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nELD OF THE INVENTION 

The present invention relates to compositions and methods for the detection and 
characterization of nucleic acid sequences and variations in nucleic acid sequences. The 
present invention relates to methods for forming a nucleic acid cleavage structure on a 
15 target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. 
For example, in some embodiments, the 5' nuclease activity of a variety of enzymes is 
used to cleave the target-dependent cleavage structure, thereby indicating the presence 
ofspecific nucleic acid sequences or specific variations thereof. 

20 BACKGROUND OF THE INVENTION 

Methods for the detection and characterization ofspecific nucleic acid sequences 
and^uence variations have been used to detect the presence of viral or bacterial nucleic 
acid sequences indicative of an infection, to detect the presence of variants or alleles of 
genes associated with disease and cancers. These methods also find application in the 
25 identification of sources of nucleic acids, as for forensic analysis or for paternity 
determinations. 

Various methods are known to the art that may be used to detect and characterize 
specific nucleic acid sequences and sequence variants. Nonetheless, with the completion 
of the nucleic acid sequencing of the human genome, as well as the genomes of numerous 
30 pathogenic organisms, the demand for fast, reUable, cost-effective and user-fiiendly tests 
for the detection ofspecific nucleic acid sequences continues to grow. Importantly, these 
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tests must be able to create a detectable signal from samples that contain very few copies 
of the sequence of interest. The following discussion examines two levels of nucleic acid 
detection assays currently in use: I. Signal Amplification Technology for detection of 
rare sequences 11. Direct Detection Technology for quantitative detection of sequences, 
5 and HI. Direct Detection of RNA. 

L Signal Amplification Technology Methods For Amplification 

The "PolymCTase Chain Reaction" (PGR) comprises the first generation of 
methods for nucleic acid amplification. However, several other methods have been 
10 developed that employ the same basis of specificity, but create signal by different 

amplification mechanisms. These methods include the "Ligase Chain Reaction" (LCR), 
"Self-Sustained Synthetic Reaction" (3SR/NASBA), and "QP-Replicase" (Qp): 

Polymerase Chain Reaction (PCR) 

15 The polymerase chain reaction (PCR), as described in U.S. Patent Nos. 4,683,195, 

4,683,202, and 4,965,188 to MuUis and MuUis et al (the disclosures of which are hereby 
incorporated by reference), describe a method for increasing the concentration of a 
segment of target sequence in a mixture of genomic DNA without cloning or purification. 
This technology provides one approach to the problems of low target sequence 

20 concentration. PCR can be used to directly increase the concentration of the target to an 
easily detectable level. This process for amplifying the target sequence involves 
introducing a molar excess of two oligonucleotide primers that are complementary to 
their respective strands of the double-stranded target sequence to the DNA mixture 
containing the desired target sequence. The mixture is denatured and then allowed to 

25 hybridize. Following hybridization, the primers are extended with polymerase so as to 
form complementary strands. The steps of denaturation, hybridization, and polymerase 
extension can be repeated as often as needed, in order to obtain relatively high 
concentrations of a segment of the desired target sequence. 

The length of the segment of the desired target sequence is determined by the 

30 relative positions of the primers with respect to each other, and, therefore, this length is a 
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controllable parameter. Because the desired segments of the target sequence become the 
dominant sequences (in terms of concentration) in the mixture, they are said to be 
"PC3l-amplified." 

Ligase Chain Reaction (LCR or LAR) 
5 The ligase chain reaction (LCR; sometimes referred to as "Ligase Amplification 

Reaction" (LAR) described by Barany. Proc. Natl. Acad. Sci.. 88:189 (1991); Barany. 
PCRMethods and Applic, 1:5 (1991); and Wu and Wallace, Genomics 4:560 (1989) has 
developed into a well-reco^ized alternative method for amplifying nucleic acids. In 
LCR. four oligonucleotides, two adjacent ohgonucleotides that uniquely hybridize to one 
10 strand of target DNA, and a complementary set of adjacent oligonucleotides, that 
hybridize to the opposite strand are mixed and DNA ligase is added to the mixture. 
Provided that there is complete complementarity at the junction, ligase will covalently 
link each set of hybridized molecules. Importantly, in LCR. two probes are ligated 
together only when they base-pair with sequences in the target sample, without gaps or 
15 mismatches. Repeated cycles of denaturation. hybridization and ligation amplify a short 
segment of DNA. LCR has also been used in combination with PGR to achieve enhanced 
detection of single-base changes. Segev. PGT Pubhc. No. W09001069 Al (1990). 
However, because the four oligonucleotides used in this assay can pair to form two short 
Ugatable fragments, there is the potential for the generation of target-independent 
20 background signal. The use of LCR for mutant screening is limited to the examination of 
specific nucleic acid positions. 

Self-Sustained Synthetic Reaction (3SR/NASBA) 

The self-sustained sequence replication reaction (3SR) (Guatelli et al, Proc. Natl. 
Acad. Sci., 87:1874-1878 [1990], with an erratum at Proc. Natl. Acad, Sci., 87:7797 

25 [1990]) is a transcription-based in vitro amplification system (Kwok et al., Proc. Natl. 
Acad. Sci., 86:1 173-1177 [1989]) that can exponentially amplify RNA sequences at a 
uniform temperature. The amplified RNA can then be utilized for mutation detection 
(Fahy et al., PGR Meth. AppL, 1:25 [1991]). In this method, an oligonucleotide primer is 
used to add a phage RNA polymerase promoter to the 5' end of the sequence of interest. 

30 In a cocktail of enzymes and substrates that includes a second primer, reverse 
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transcriptase, RNase H, RNA polymerase and ribo-and deoxyribonucleoside 
triphosphates, the target sequence undergoes repeated rounds of transcription, cDNA 
synthesis and second-strand synthesis to ampUfy the area of interest. The use of 3SR to 
detect mutations is kinetically limited to screening small segments of DNA 200-300 
5 base pairs). 

Q-Beta (QP) Replicase 

In this method, a probe that recognizes the sequence of interest is attached to the 
replicatable RNA template for Qp replicase. A previously identified major problem with 
false positives resulting bom the replication of unhybridized probes has been addressed 

10 through use of a sequence-specific ligation step. However, available thermostable DNA 
ligases are not effective on this RNA substrate, so the ligation must be performed by T4 
DNA ligase at low temperatures {iTC). This prevents the use of high temperature as a 
means of achieving specificity as in the LCR, the ligation event can be used to detect a 
mutation at the junction site, but not elsewhere. 

1 5 Table 2 below, lists some of the features desirable for systems usefiil in sensitive 

nucleic acid diagnostics, and summarizes the abilities of each of the major amplification 
methods (See also, Landgren, Trends in Genetics 9: 199 [1993]). 

A successful diagnostic method must be very specific. A straight-forward method 
of controlling the specificity of nucleic acid hybridization is by controlling the 

20 temperature of the reaction. While the 3SR/NASB A, and Qp systems are all able to 
generate a large quantity of signal, one or more of the enzymes involved in each cannot 
be used at high temperature (/.e., >55*'C). Therefore the reaction temperatures cannot be 
raised to prevent non-specific hybridization of the probes. If probes are shortened in 
order to make them melt more easily at low temperatures, the likelihood of having more 

25 than one perfect match in a complex genome increases. For these reasons, PGR and LCR 
curroitly dominate the research field in detection technologies. 
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TABLE 1 



Feature 


Method 


PCR LCR PCR& 3SR QP 
LCR NASBA 


Amplifies Target 




+ 


+ 


+ 




Recognition of Independent 
Sequences Required 


+ 


4- 


+ 


+ 


+ 


Performed at High Temp. 


+ 


+ 








Operates at Fixed Temp. 








+ 


+ 


Exponential Amptification 


+ 


+ 


+ 




+ 


Generic Signal Generation 










+ 


Easily Automatable 













The basis of the amplification procedure in the PCR and LCR is the fact that the 
products of one cycle.become usable templates in all subsequent cycles, consequently 

5 doubling the population with each cycle. The final yield of any such doubling system can 
be expressed as: (1+X)" = y, where "X" is the mean efiBciency (percent copied in each 
cycle), "n" is the number of cycles, and "y" is the overall efficiency, or yield of the 
reaction (Mullis, PCR Methods Applic, 1:1 [1991]). If every copy of a target DNA is 
utilized as a template in every cycle of a polymerase chain reaction, then the mean 

10 efficiency is 100%. If 20 cycles of PCR are performed, then the yield will be 2^°, or 
1,048,576 copies of the starting material. If the reaction conditions reduce the mean 
efficiency to 85%, then the yield in those 20 cycles will be only 1.85^^ or 220,513 copies 
of the starting material. In other words, a PCR running at 85% efficiency will yield only 
21% as much final product, compared to a reaction running at 100% efficiency. A 
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reaction that is reduced to 50% mean efficiency will yield less than 1% of the possible 
product. 

In practice, routine polymerase chain reactions rarely achieve the theoretical 
maximum yield, and PCRs are usually run for more than 20 cycles to compensate for the 

5 lower yield. At 50% mean efficiency, it would take 34 cycles to achieve the million-fold 
amplification theoretically possible in 20, and at lower efficiencies, the number of cycles 
required becomes prohibitive. In addition, any background products that amplify with a 
better mean efficiency than the intended target will become the dominant products. 

Also, many variables can influence the mean efficiency of PCR, including target 

10 DNA length and secondary structure, primer length and design, primer and dNTP 
concentrations, and buffer composition, to name but a few. Contamination of the 
reaction with exogenous DNA DNA spilled onto lab surfaces) or cross- 
contamination is also a major consideration. Reaction conditions must be carefully 
optimized for each different primer pair and target sequence, and the process can take 

15 days, even for an experienced investigator. The laboriousness of this process, including 
numerous technical considerations and other factors, presents a significant drawback to 
using PCR in the clinical setting. Indeed, PCR has yet to penetrate the clinical market in 
a significant way. The same concerns arise with LCR, as LCR must also be optimized to 
use different oligonucleotide sequences for each target sequence. In addition, both 

20 methods require expensive equipment, capable of precise temperature cycling. 

Many applications of nucleic acid detection technologies, such as in studies of 
allelic variation, involve not only detection of a specific sequence in a complex 
background, but also the discrimination between sequences with few, or single, 
nucleotide differences. One method for the detection of allele-specific variants by PCR is 

25 based upon the fact that it is difficult for Taq polymerase to synthesize a DNA strand 
when there is a mismatch between the template strand and the 3* end of the primer. An 
allele-specific variant may be detected by the use of a primer that is perfectly matched 
with only one of the possible alleles; the mismatch to the other allele acts to prevent the 
extension of the primer, thereby preventing the amplification of that sequence. This 

30 method has a substantial limitation in that the base composition of the mismatch 

influences the ability to prevent extension across the mismatch, and certain mismatches 
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do not prevent extension or have only a minimal effect (Kwok et ai, Nucl. Acids Res., 
18:999 [1990]).) 

A similar 3*-mismatch strategy is used with greater effect to prevent ligation in the 
LCR (Barany, PGR Meth. Applic, 1:5 [1991]). Any mismatch effectively blocks the 
5 action of the thermostable ligase, but LCR still has the drawback of target-independent 
background ligation products initiating the amplification. Moreover, the combination of 
PGR with subsequent LCR to identify the nucleotides at individual positions is also a 
clearly cumbersome proposition for the clinical laboratory. 

1 0 IL Direct Detection Technology 

When a sufficient amount of a nucleic acid to be detected is available, there are 
advantages to detecting that sequence directly, instead of making more copies of that 
target, (e.g., as in PGR and LGR). Most notably, a method that does not amplify the 
signal exponentially is more amenable to quantitative analysis. Even if the signal is 

15 enhanced by attaching multiple dyes to a single oligonucleotide, the conrelation between 
the final signal intensity and amount of target is direct. Such a system has an additional 
advantage that the products of the reaction will not themselves promote further reaction, 
so contamination of lab surfaces by the products is not as much of a concern. Traditional 
methods of direct detection including Northern and Southern blotting and RNase 

20 protection assays usually require the use of radioactivity and are not amenable to 
automation. Recently devised techniques have sought to eliminate the use of 
radioactivity and/or improve the sensitivity in automatable formats. Two examples are 
the "Gycling Probe Reaction" (CPR), and "Branched DNA" (bDNA) 

The cycling probe reaction (GPR) (Duck et aL, BioTech., 9:142 [1990]), uses a 

25 long chimeric oligonucleotide in which a central portion is made of RNA while the two 
termini are made of DNA. Hybridization of the probe to a target DNA and exposure to a 
thermostable RNase H causes the RNA portion to be digested. This destabilizes the 
remaining DNA portions of the duplex, releasing the remainder of the probe fiom the 
target DNA and allowing another probe molecule to repeat the process. The signal, in the 

30 form of cleaved probe molecules, accumulates at a linear rate. While the repeating 
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pixxjess increases the signal, the RNA portion of the oligonucleotide is vulnerable to 
RNases that may be carried through sample preparation. 

Branched DNA (bDNA). described by Urdea et ai. Gene 61 :253-264 (1987), 
involves oligonucleotides with branched structures that allow each individual 
5 oligonucleotide to cany 35 to 40 labels (e.^., alkaline phosphatase en^es). While this 
enhances the signal from a hybridization event, signal from non-specific binding is 
similarly increased. 

While both of these methods have the advantages of direct detection discussed 
above, neither the CPR or bDNA methods can make use of the specificity allowed by the 

1 0 requirement of independent recognition by two or more probe (oligonucleotide) 
sequences, as is common in the signal amplification methods described in Section I. 
above. The requirement that two oligonucleotides must hybridize to a target nucleic acid 
in order for a detectable signal to be generated confers an extra measure of stringency on 
any detection assay. Requiring two oligonucleotides to bind to a target nucleic acid 

1 5 reduces the chance that false "positive" results will be produced due to the non-specific 
binding of a probe to the target. The further requirement that the two oligonucleotides 
must bind in a specific orientation relative to the target, as is required in PGR, where 
oligonucleotides must be oppositely but appropriately oriented such that the DNA 
polymerase can bridge the gap between the two oligonucleotides in both directions, 

20 further enhances specificity of the detection reaction. However, it is well known to those 
in the art that even though PGR utilizes two oligonucleotide probes (termed primers) 
"non-specific" amplification (/.e., amplification of sequences not directed by the two 
primers used) is a common artifact. This is in part because the DNA polymerase used in 
PGR can accommodate very large distances, measured in nucleotides, between the 

25 oligonucleotides and thus there is a large window in which non-specific binding of an 
oligonucleotide can lead to exponential amplification of inappropriate product. The 
LCR, in contrast, cannot proceed unless the oligonucleotides used are bound to the target 
adjacent to each other and so the full benefit of the dual oligonucleotide hybridization is 
realized. 
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An ideal direct detection method would combine the advantages of the direct 
detection assays (e.g., easy quantification and minimal risk of carry-over contamination) 
with the specificity provided by a dual oligonucleotide hybridization assay, 

5 III. Direct Detection of RN A 

In molecular medicine, a simple and cost-effective method for direct and 
quantitative RNA detection would greatly facilitate the analysis of RNA viruses and the 
measurement of specific gene expression. Both of these issues are currently pressing 
problems in the field. Despite this need, few techniques have emerged that are truly 

10 direct. PCR-based detection assays require conversion of RNA to DNA by reverse 
transcriptase before amplification, introducing a variable that can compromise accurate 
quantification. Furthermore, PCR and other methods based on exponential amphfication 
(e.g., NASBA) require painstaking containment measures to avoid cross-contamination, 
and have difficulty distinguishing small differences (e.g., 2 to 3-fold) in quantity. Other 

1 5 tests that directly examine RNA sufEa: firom a variety of drawbacks, including time 
consuming autoradiography steps (^g., RNase protection assays), or ovemight reaction 
times (e.g., branched DNA assays). With over 1.5 million viral load measurements being 
performed in the U.S. every year, there is clearly an enormous potential for an 
inexpensive, rapid, high-throughput system for the quantitative measurement of RNA. 

20 • Techniques for direct, quantitative detection of mRNA are vital for monitoring 

expression of a number of different genes. In particular, levels of cytokine expression 
(eg., interleukins and lymphokines) are being exploited as clinical measures of immune 
response in the progression of a wide variety of diseases (Van Deuren et al^ J. Int. Fed. 
Clin. Chem., 5:216 [1993], Van Deuren et ai, J. Inf Dis., 169:157 [1994], Perenboom et 

25 a/., Eur. J. Clin. Invest., 26:159 [1996], Guidotti et a/., Immunity 4:25 [1996]) as well as 
in monitoring transplant recipients (Grant et al^ Transplantation 62:910 [1996]). 
Additionally, the monitoring of viral load and identification of viral genotype have great 
clinical significance for individuals suffering viral infections by such pathogens as HTV 
or Hepatitis C virus (HCV), There is a high correlation between viral load (i.e., the 

30 absolute number of viral particles in the bloodstream) and time to progression to AIDS 
(Mellors et al. Science 272:1 167 [1996], Saag et al. Nature Medicine 2:625 [1996]). 



9 



wo 01/90337 



PCTAJSOl/17086 



For that reason^ viral load, as measured by quantitative nucleic acid based testing, is 
becoming a standard monitoring procedure for evaluating the efficacy of treatment and 
the clinical status of HIV positive patients. It is thought to be essential to reduce viral 
load as early in the course of infection as possible and to evaluate viral levels on a regular 

5 basis. In the case of HCV, viral genotype has great clinical significance, with 
correlations to severity of liver disease and responsiveness to interferon therapy. 
Furthermore, because HCV cannot be grown in culture, it is only by establishing 
correlations between characteristics like viral genotype and clinical outcome that new 
antiviral treatments can be evaluated. 

10 While the above mentioned methods have been serviceable for low throughput, 

research applications, or for limited clinical application, it is clear that large scale 
quantitative analysis of RNA readily adaptable to any genetic system will require a more 
innovative approach. An ideal direct detection method would combine the advantages of 
the direct detection assays (eg., easy quantification and minimal risk of carry-over 

1 5 contamination) with the specificity provided by a dual oligonucleotide hybridization 
assay. 

Many of the methods described above rely on hybridization alone to distinguish a 
target molecule firom other nucleic acids. Although some of these methods can be highly 
sensitive, they often cannot quantitate and distinguish closely related mRNAs accurately, 
20 especially such RNAs expressed at different levels in the same sample. While the 
above-mentioned methods are serviceable for some purposes, a need exists for a 
technology that is particularly adept at distinguishing particular RNAs from closely 
related molecules. 

25 SUMMARY OF THE INVENTION 

The present invention relates to compositions and methods for the detection and 
characterization of nucleic acid sequences and variations in nucleic acid sequences. The 
present invention relates to methods for forming a nucleic acid cleavage structure on a 
target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. 

30 For example, in some embodiments, the 5' nuclease activity of a variety of enzymes is 

10 
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used to cleave the target-dependent cleavage structure, thereby indicating the presence of 
specific nucleic acid sequences or specific variations thereof. 

The present invention provides structure-specific cleavage agents (e.g., nucleases) 
firom a variety of sources, including mesophilic, psychrophilic, thermophilic, and 
5 hyperthennophilic organisms. The preferred structure-specific nucleases are 

thermostable. Thermostable structure-specific nucleases are contemplated as particularly 
useful in that they operate at temperatures where nucleic acid hybridization is extremely 
specific, allowing for allele-specific detection (including single-base mismatches). In one 
embodiment, the thermostable structure-specific nucleases are thermostable 5' nucleases 

1 0 comprising altered polymerases derived firom the native polymerases of Thermus 

species, including, but not limited to Thermus aquaticus, Thermus flavus, and Thermus 
thermophilus. However, the invention is not limited to the use of thermostable 5' 
nucleases. Thermostable structure-specific nucleases from the FEN-1, RAD2 and XPG 
class of nucleases are also preferred. 

1 5 The present invention provides a method for detecting a target sequence (e.g., a 

mutation, polymorphism, etc), comprising providing a sample suspected of containing the 
target sequence; oligonucleotides capable of forming an invasive cleavage structure in the 
presence of the target sequence; and an agent for detecting the presence of an invasive 
cleavage structure; and exposing the sample to the oligonucleotides and the agent. In 

20 some embodiments, the method further comprises the step of detecting a complex 

comprising the agent and the invasive cleavage structure (directly or indirectly). In some 
embodiments, the agent comprises a cleavage agent. In some prefened embodiments, the 
exposing of the sample to the oligonucleotides and the agent comprises exposing the 
sample to the oligonucleotides and the agent under conditions wherein an invasive 

25 cleavage structure is formed between the target sequence and the oligonucleotides if the 
target sequence is present in the sample, wherein the invasive cleavage structure is 
cleaved by the cleavage agent to form a cleavage product. In some embodiments, the 
method further comprises the step of detecting the cleavage product. In some 
embodiments, the target sequence comprises a first region and a second region, the 

30 second region downstream of and contiguous to the first region, and wherein the 

oligonucleotides comprise first and second oligonucleotides, wherein at least a portion of 

11 
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the first oligonucleotide is completely complementary to the first portion of the target 
sequence and wherein the second oligonucleotide comprises a 3' portion and a 5' portion, 
wherein the 5' portion is completely complementary to the second portion of said target 
nucleic acid. 

5 The preseit invention also provides a kit for detecting such target sequences, said 

kit comprising oligonucleotides capable of forming an mvasive cleavage structure in the 
presence of the target sequence. In some embodiments, the kit fiirfher comprises an agent 
for detecting the presence of an invasive cleavage structure (e.g., a cleavage agent). In 
some embodiments, the oligonucleotides comprise first and second oligonucleotides, said 

10 first oligonucleotide comprising a 5* portion complementary to a first region of the target 
nucleic acid and said second oligonucleotide comprising a 3' portion and a 5' portion, said 
5' portion complementary to a second region of the target nucleic acid downstream of and 
contiguous to the first portion. In some preferred embodiments, the target sequence 
comprises 

1 5 The present invention also provides methods for detecting the presence of a target 

nucleic acid molecule by detecting non-target cleavage products comprising providing: a 
cleavage agent; a source of target nucleic acid, the target nucleic acid comprising a first 
region and a second region, the second region downstream of and contiguous to the first 
region; a first oligonucleotide, wherein at least a portion of the first oligonucleotide is 

20 completely complementary to the first portion of the target nucleic acid; and a second 
oligonucleotide comprising a 3' portion and a 5' portion, wherein the 5' portion is 
completely complementary to the second portion of the target nucleic acid; mixing the 
cleavage agent, the target nucleic acid, the first oligonucleotide and the second 
oligonucleotide to create a reaction mixture under reaction conditions such that at least 

25 the portion of the first oligonucleotide is annealed to the first region of said target nucleic 
acid and wherein at least the 5' portion of the second oligonucleotide is annealed to the 
second region of the target nucleic acid so as to create a cleavage structure, and wherein 
cleavage of the cleavage structure occurs to generate non-target cleavage product; and 
detecting the cleavage of the cleavage structure. 

30 The detection of the cleavage of the cleavage structure can be carried out in any 

maimer. In some embodiments, the detection of the cleavage of the cleavage structure 
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comprises detecting the non~target cleavage product. In yet other embodiments, the 
detection of the cleavage of the cleavage structure comprises detection of fluorescence, 
mass, or fluorescence energy transfer. Other detection methods include, but are not 
limited to detection of radioactivity, luminescence, phosphorescence, fluorescence 

5 polarization, and charge. In some embodiments, detection is carried out by a method 
comprising providing the non-target cleavage product; a composition comprising two 
single-stranded nucleic acids annealed so as to define a single-stranded portion of a 
protein binding region; and a protein; and exposing the non-target cleavage product to the 
single-stranded portion of the protein binding region imder conditions such that the 

10 protein binds to the piotem binding region. In some embodiments, the protein comprises 
a nucleic acid producing protein, wherein the nucleic acid producing protein binds to the 
protein-binding region and produces nucleic acid. In some embodiments, the protein- 
binding region is a template-dq)endent KNA polymerase binding region (e.g., a T7 RNA 
polymerase binding region). In other embodiments, the detection is carried out by a 

1 5 method comprising providing the non-target cleavage product; a single continuous strand 
of nucleic acid comprising a sequence defining a single strand of an RNA polymerase 
binding region; a template-dependent DNA polymerase; and a template-dqpendent RNA 
polymerase; exposing the non-target cleavage product to the RNA polymerase binding 
region under conditions such that the non-target cleavage product binds to a portion of 

20 the single strand of the RNA polymerase binding region to produce a bound non-target 
cleavage product; exposing the bound non-target cleavage product to the template- 
dependent DNA polymerase under conditions such that a double-stranded RNA 
polymerase binding region is produced; and exposing the double-stranded RNA 
polymerase binding region to the template-dependent RNA polymerase under conditions 

25 such that RNA transcripts are produced. In some embodiments, the method further 

comprises the step of detecting the RNA transcripts. In some embodiments, the template- 
dependent RNA polymerase is T7 RNA polymerase. 

The present invention is not limited by the nature of the 3' portion of the second 
oligonucleotide. In some preferred embodiments, the 3* portion of the second 

30 oligonucleotide comprises a 3' terminal nucleotide not complementary to the target 
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nucleic acid. In some embodiments, the 3' portion of the second oligonucleotide consists 
of a single nucleotide not complementary to the target nucleic acid. 

Any of the components of the method may be attached to a solid support. For 
example, in some embodiments, the first oligonucleotide is attached to a solid support. In 
S other embodiments, the second oligonucleotide is attached to a solid support. 

The cleavage agent can be any agent that is cq)able of cleaving invasive cleavage 
stnictures. In some preferred embodiments, the cleavage agent comprises a structure- 
specific nuclease. In particularly preferred embodiments, the structure-specific nuclease 
comprises a thermostable structure-specific nuclease (e.g., a thermostable 5' nuclease), 

10 Thermostable structure-specific nucleases include, but are not limited to, those having an 
amino acid sequence homologous to a portion of ttie amino acid sequence of a 
thermostable DNA polymerase derived from a thermophilic organism (e.g., Thermus 
aquaticus, Thermus flavus, and Thermus thermophilus). In other embodiments, the 
thermostable structure-specific nuclease comprises a nuclease firomihe FEN-1, RAD2 or 

1 5 XPG classes of nucleases, or chimerical structures containing one or more portions of any 
of the above cleavage agents. 

The method is not limited by the nature of the target nucleic acid. In some 
embodiments, the target nucleic acid is single stranded or double stranded DNA or RNA. 
In some embodiments, double stranded nucleic acid is rendered single stranded (e.g., by 

20 heat) prior to formation of the cleavage structure. In some embodiment, the source of 
target nucleic acid comprises a sample containing genomic DNA. Sample include, but 
are not limited to, blood, saliva, cerebral spinal fluid, pleural fluid, milk, lymph, sputum 
and s^nen. 

In some embodiments, the reaction conditions for tiie method comprise providmg 
25 a source of divalent cations. In some preferred embodiments, the divalent cation is 
selected fi-om the group comprising Mn2+ and Mg2+ ions. In some embodiments, the 
reaction conditions for the method comprise providing the first and the second 
oligonucleotides in concentration excess compared to the target nucleic acid. 
In some embodiments, the method further comprises providing a third 
30 oligonucleotide complementary to a third portion of said target nucleic acid upstream of 
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the first portion of the target nucleic acid, wherein the third oUgonucleotide is mixed with 
the reaction mixture. 

The present invention also provides a method for detecting the presence of a 
target nucleic acid molecule by detecting non-target cleavage products comprising 

5 providing: a cleavage agent; a source of target nucleic acid, the target nucleic acid 
comprising a first region and a second region, the second region downstream of and 
contiguous to the first region; a plurality of first oligonucleotides, wherein at least a 
portion of the first oligonucleotides is completely complementary to the first portion of 
the target nucleic acid; a second oligonucleotide comprising a 3' portion and a 5* portion, 

10 wherein said 5' portion is completely complementary to the second portion of the target 
nucleic acid; mixing the cleavage agent, the target nucleic acid, the plurality of first 
oligonucleotides and second oligonucleotide to create a reaction mixture under reaction 
conditions such that at least the portion of a first oligonucleotide is annealed to the first 
region of the target nucleic acid and wherein at least the 5* portion of the second 

15 oUgonucleotide is annealed to the second region of the target nucleic acid so as to create a 
cleavage structure, and wherein cleavage of the cleavage structure occurs to generate 
non-taf get cleavage product, wherein the conditions permit multiple cleavage structures 
to form and be cleaved bom the target nucleic acid; and detecting the cleavage of said 
cleavage structures. In some embodiments, the conditions comprise isothermal 

20 conditions ttiat permit the plurality of first oligonucleotides to dissociate firom the target 
nucleic acid. While the present invention is limited by the number of cleavage structure 
formed on a particular target nucleic acid, in some preferred embodiments, two or more 
(3, 4, 5, . . ., 10, . . 10000, . . .) of the plurality of first oligonucleotides form cleavage 
structures with a particular target nucleic acid, wherein the cleavage structures are 

25 cleaved to produce the non-target cleavage products. 

The presCTt invention also provides methods wherein a cleavage product firom the 
above methods is used in a fiirther invasive cleavage reaction. For example, the present 
invention provides a method comprising providing a cleavage agent; a first target nucleic 
acid, the first target nucleic acid comprismg a first region and a second region, the second 

30 region downstream of and contiguous to the first region; a first oligonucleotide, wherein 
at least a portion of the first oligonucleotide is completely complementary to the first 
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portion of the first target nucleic acid; a second oligonucleotide comprising a 3* portion 
and a 5' portion, wherein the 5' portion is completely complementary to the second 
portion of the first target nucleic acid; a second target nucleic acid, said second target 
nucleic acid comprising a first region and a second region, the second region downstream 

5 of and contiguous to the first region; and a third oligonucleotide, wherein at least a 
portion of the third oligonucleotide is completely complementary to the first portion of 
the second target nucleic acid; generating a first cleavage structure wherein at least said 
portion of the first oligonucleotide is annealed to the first region of the first target nucleic 
acid and wherein at least the 5' portion of the second oligonucleotide is annealed to the 

10 second region of the first target nucleic acid and wherein cleavage of the first cleavage 
structure occurs via the cleavage agent thereby cleaving the*first oligonucleotide to 
generate a fourth oligonucleotide, said fourth oligonucleotide comprising a 3' portion and 
a 5" portion, wherein the 5* portion is completely complementary to the second portion of 
the second target nucleic acid; generating a second cleavage structure under conditions 

1 5 wherein at least said portion of the third oligonucleotide is annealed to the first region of 
the second target nucleic acid and wherein at least the 5* portion of the fourth 
oligonucleotide is annealed to the second region of the second target nucleic acid and 
wherein cleavage of the second cleavage structure occurs to generate a cleavage 
fragment; and detecting the cleavage of the second cleavage structure. In some preferred 

20 embodiments, the 3* portion of the fourth oligonucleotide comprises a 3* terminal 

nucleotide not complementary to the second target nucleic acid. In some embodiments, 
the 3' portion of the third oligonucleotide is covalently linked to the second target nucleic 
acid. In some embodiments, the second target nucleic acid further comprises a 5' region, 
wherein the 5' region of the second target nucleic acid is the third oligonucleotide. 

25 The present invention further provides kits comprising: a cleavage agent; a first 
oligonucleotide comprising a 5' portion complementary to a first region of a target 
nucleic acid; and a second oligonucleotide comprising a 3' portion and a 5' portion, said 5' 
portion complementary to a second region of the target nucleic acid downstream of and 
contiguous to the first portion. In some embodiments, the 3' portion of the second 

30 oligonucleotide comprises a 3' terminal nucleotide not complementary to tiie target 
nucleic acid. In preferred embodiments, the 3' portion of the second oligonucleotide 
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consists of a single nucleotide not complementary to the target nucleic acid. In some 
embodiments, the kit further comprises a solid support. For example, in some 
embodiments, the first and/or second oligonucleotide is attached to said solid support. In 
some embodiments, the kit further comprises a buffer solution. In some preferred 
S embodiments, the buffer solution comprises a source of divalent cations (e.g., Mn^"*" 
and/or Mg^"^ ions). In some specific embodiments, the kit further comprises a third 
oligonucleotide complementary to a third portion of the target nucleic acid upstream of 
the first portion of the first target nucleic acid. In yet other embodiments, the kit further 
comprises a target nucleic acid. In some embodiments, the kit further comprises a second 

10 target nucleic acid. In yet oflier embodiments, the kit further comprises a third 

oligonucleotide comprising a 5' portion complementary to a first region of the second 
target nucleic acid. In some specific embodiments, the 3' portion of the third 
oligonucleotide is covalently linked to the second target nucleic acid. In other specific 
embodiments, the second target nucleic acid further comprises a S' portion, wherein the S' 

15 portion of the second target nucleic acid is the third oligonucleotide. In still other 
embodiments, the kit fiirther comprises an ARRESTOR molecule (e,g., ARRESTOR 
oligonucleotide). 

The present invention further provides a composition comprising a cleavage 
structure, the cleavage structure comprising: a) a target nucleic acid, the target nucleic 

20 acid having a first region, a second region, a third region and a fourth region, wherein the 
first region is located adjacent to and downstream from the second region, the second 
region is located adjacent to and downstream firom the third region and the third region is 
located adjacent to and downstream from the fourth region; b) a first oligonucleotide 
complementary to the fourth region of the target nucleic acid; c) a second oligonucleotide 

25 having a 5* portion and a 3* portion wherein the 5' portion of the second oligonucleotide 
contains a sequence complementary to the second region of the target nucleic acid and 
wherein the 3' portion of the second oligonucleotide contains a sequence complementary 
to the third region of the target nucleic acid; and d) a third oligonucleotide having a 5' 
portion and a 3' portion wherein the 5* portion of the third oligonucleotide contains a 

30 sequence complementary to the first region of the target nucleic acid and wherein the 3' 
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portion of the third oligonucleotide contains a sequence complementary to the second 
region of the target nucleic acid. 

The present invention is not limited by the length of the four regions of the target 
nucleic acid. In one embodiment, the first region of die target nucleic acid has a length of 
5 1 1 to SO nucleotides. In another embodim^t, the second region of the target nucleic acid 
has a length of one to three nucleotides. In another embodiment, the diird region of the 
target nucleic acid has a length of six to nine nucleotides. In yet another embodiment, the 
fourdi region of the target nucleic acid has a length of 6 to 50 nucleotides. 

The invention is not limited by the nature or composition of the of the first, 

1 0 second, third and fourth oligonucleotides; these oligonucleotides may comprise DNA, 
RNA, PNA and combinations thereof as well as comprise modified nucleotides, universal 
bases, adducts, etc. Further, one or more of the first, second, diird and the fourth 
oligonucleotides may contain a dideoxynucleotide at the 3' terminus. 

In one preferred embodiment, the target nucleic acid is not completely 

1 5 complementary to at least one of the first, the second, the third and the fourth 

oligonucleotides. In a particularly preferred embodiment, the target nucleic acid is not 
completely complementary to the second oligonucleotide. 

As noted above, the present invention contemplates the use of structure-specific 
nucleases in detection methods. In one embodiment, the present invention provides a 

20 method of detecting the presence of a target nucleic acid molecule by detecting non- 
target cleavage products comprising: a) providing: i) a cleavage means, ii) a source of 
target nucleic acid, the target nucleic acid having a first region, a second region, a third 
region and a fourth region, wherein the first region is located adjacent to and downstream 
fiiom the second region, the second region is located adjacent to and downstream Scorn the 

25 third region and the third region is located adjacent to and downstream firom the fourth 
region; iii) a first oligonucleotide complementary to the fourth region of the target nucleic 
add; iv) a second oligonucleotide having a 5' portion and a 3' portion wherein, the 5' 
portion of the second oligonucleotide contains a sequence complementary to the second 
region of the target nucleic acid and wherein the 3' portion of the second oligonucleotide 

30 contains a sequence complementary to the third region of the target nucleic acid; iv) a 
third oligonucleotide having a 5' and a 3* portion wherein the 5' portion of the third 
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oligonucleotide contains a sequence complementary to the first region of the target 
nucleic acid and wherein the 3' portion of the third oligonucleotide contains a sequence 
complementary to the second region of the target nucleic acid; b) mixing the cleavage 
means, the target nucleic acid, the first oligonucleotide, the second oligonucleotide and 
the third oligonucleotide to create a reaction mixture under reaction conditions such that 
the first oligonucleotide is annealed to the fourth region of the target nucleic acid and 
wherein at least the 3' portion of the second oUgonucleotide is annealed to the target 
nucleic acid and wherein at least the 5' portion of the third oligonucleotide is annealed to 
the target nucleic acid so as to create a cleavage structure and wherein cleavage of the 
cleavage structure occurs to generate non-target cleavage products, each non-target 
cleavage product having a 3'-hydroxyl group; and c) detecting the non-target cleavage 
products. 

The invention is not limited by the nature of the target nucleic acid. In one 
embodiment, the target nucleic acid comprises single-stranded DNA. In another 
embodiment, the target nucleic acid comprises double-stranded DNA and prior to step c), 
the reaction mixture is treated such that the double-stranded DNA is rendered 
substantially single-stranded. In another embodiment, the target nucleic acid comprises 
RNA and the first and secon^d oligonucleotides comprise DNA. 

The invention is not limited by the nature of the cleavage means. In one 
embodiment, the cleavage means is a structure-specific nuclease; particularly preferred 
structure-specific nucleases are thermostable structure-specific nucleases. 

In another preferred embodiment the thermostable structure specific nuclease is a 
chimerical nuclease. 

In an alternative preferred embodiment, the detection of the non-target cleavage 
products comprises electrophoretic separation of the products of the reaction followed by 
visualization of the separated non-target cleavage products. 

In another preferred embodiment, one or more of the first, second, and third 
oligonucleotides contain a dideoxynucleotide at the 3' terminus. When 
dideoxynucleotide-contauiing oligonucleotides are employed, the detection of the non- 
target cleavage products preferably comprises: a) incubating the non-target cleavage 
products with a template-independent polymerase and at least oiie labeled nucleoside 
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triphosphate under conditions such that at least one labeled nucleotide is added to the 3 - 
hydroxyl group of the non-target cleavage products to generate labeled non-target 
cleavage products; and b) detecting the presence of the labeled non-target cleavage 
products. The invention is not limited by the nature of the template-independent 

5 polymerase employed; in one embodiment, the template-independent polymerase is 
selected from the group consisting of terminal deoxynucleotidyl transferase (TdT) and 
poly A polymerase. When TdT or polyA polymerase are employed in the detection step, 
the second oligonucleotide may contain a 5' end label, the 5' end label being a diffwent 
label than the label present upon the labeled nucleoside triphosphate. The invention is 

10 not limited by the nature of the 5* end label; a wide variety of suitable 5' end labels are 
known to the art and include biotin, fluorescein, tetrachlorofluorescein, 
hexachlorofluorescein, Cy3 amidite, CyS amidite and digoxigenin. 

In another embodiment, detecting the non-target cleavage products comprises: a) 
incubating the non-target cleavage products with a template-independent polymerase and 

1 5 at least one nucleoside triphosphate under conditions such that at least one nucleotide is 
added to the 3'-hydroxyl group of the non-target cleavage products to generate tailed non- 
target cleavage products; and b) detecting the presence of the tailed non-target cleavage 
products. The invention is not limited by the nature of the template-independent 
polymerase employed; in one embodiment, the template-independent polymerase is 

20 selected from the group consisting of teminal deoxynucleotidyl transferase (TdT) and 
poly A polymerase. When TdT or polyA polymerases are employed in the detection step, 
the second oligonucleotide may contain a 5' end label. The invention is not limited by the 
nature of flie 5' end label; a wide variety of suitable 5* end labels are known to the art and 
include biotin, fluorescein, tetrachlorofluorescem, hexachlorofluorescein, Cy3 amidite, 

25 CyS amidite and digoxigenin. 

In a preferred OTibodiment, the reaction conditions comprise providing a source of 

divalent cations; particularly preferred divalent cations are Mn2+ and Mg^"*" ions. 

The present invention further provides a method of detecting the presence of a 
target nucleic acid molecule by detecting non-target cleavage products comprising; a) 
30 providing; i) a cleavage means, ii) a source of target nucleic acid, the target nucleic acid 
having a first region, a second region and a third region, wherein the first region is 
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located adjacent to and downstream from the second region and wherein the second 
region is located adjacent to and downstream from the third region; iii) a first 
oligonucleotide having a 5* and a 3* portion wherein the 5' portion of the first 
oligonucleotide contains a sequence complementary to the second region of the target 
nucleic acid and wherein the 3' portion of the first oUgonucleotide contains a sequence 
complementary to the third region of the target nucleic acid; iv) a second oligonucleotide 
having a length between eleven to fifteen nucleotides and further having a 5* and a 3* 
portion wherein the 5' portion of the second oligonucleotide contains a sequence 
complementary to the first region of the target nucleic acid and wherein the 3' portion of 
the second oligonucleotide contains a sequence complementary to the second region of 
the target nucleic acid; b) mixing the cleavage means, the target nucleic acid, the first 
oligonucleotide and the second oligonucleotide to create a reaction mixture under 
reaction conditions such that at least the 3' portion of the first oligonucleotide is annealed 
to the target nucleic acid and wherein at least the 5' portion of the second oligonucleotide 
is annealed to the target nucleic acid so as to create a cleavage structure and wherein 
cleavage of the cleavage structure occurs to generate non-target cleavage products, each 
non-target cleavage product having a 3'-hydroxyl group; and c) detecting the non-target 
cleavage products. In a preferred embodiment the cleavage means is a structure-specific 
nuclease, preferably a thermostable structure-specific nuclease. 

The invention is not limited by the length of the various regions of the target 
nucleic acid. In a preferred embodiment, the second region of the target nucleic acid has 
a length between one to five nucleotides. In another preferred embodiment, one or more 
of the first and the second oligonucleotides contain a dideoxynucleotide at the 3* 
terminus. When dideoxynucleotide-containing oligonucleotides are employed, the 
detection of the non-target cleavage products preferably comprises: a) incubating the 
non-target cleavage products with a template-independent polymerase and at least one 
labeled nucleoside triphosphate under conditions such that at least one labeled nucleotide 
is added to the 3 -hydroxyl group of the non-target cleavage products to generate labeled 
non-target cleavage products; and b) detecting the presence of the labeled non-target 
cleavage products. The invention is not limited by the nature of the template-independent 
polymerase employed; in one embodimmt, the template-independent polymerase is 
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selected from the group consisting of terminal deoxynucleotidyl transferase (TdT) and 
poly A polymerase. When TdT or polyA polymerase is employed in the detection step, 
the second oligonucleotide may contain a 5* end label, the 5' end label being a different 
label than the label present upon the labeled nucleoside triphosphate. The invention is 
5 not limited by the nature of the 5' end label; a wide variety of suitable 5' end labels are 
known to the art and include biotin, fluorescein, tetrachlorofluorescein, 
hexachlorofluorescein, Cy3 amidite, Cy5 amidite and digoxigenin. 

In another embodiment, detecting the non-target cleavage products comprises: a) 
incubating the non-target cleavage products with a template-independent polymerase and 

10 at least one nucleoside triphosphate under conditions such that at least one nucleotide is 
added to the 3**hydroxyl group of the non-target cleavage products to generate tailed non- 
target cleavage products; and b) detecting the presence of the tailed non-target cleavage 
products. The invention is not limited by the nature of the template-independent 
polymerase employed; in one embodiment, the template-independent polymerase is 

1 5 selected from the group consisting of terminal deoxynucleotidyl transferase (TdT) and 
poly A polymerase. When TdT or polyA polymerases are employed in the detection step, 
the second oligonucleotide may contain a 5* end label. The invention is not limited by the 
nature of the S' end label; a wide variety of suitable S' end labels are known to the art and 
include biotin, fluorescein, tetrachlorofluorescein, hexachlorofluorescein, Cy3 amidite, 

20 CyS amidite and digoxigenin. 

The novel detection methods of the invention may be employed for the detection 
of target DNAs and RNAs including, but not limited to, target DNAs and RNAs 
comprising wild type and mutant alleles of genes, including graes fi:om humans or other 
animals that are or may be associated with disease or cancer. In addition, the methods of 

25 the invention may be used for the detection of and/or identification of strains of 

microorganisms, including bacteria, frmgi, protozoa, ciliates and viruses (and in particular 
for the detection and identification of RNA viruses, such as HCV). 

The present invention ftirther provides novel enzymes designed for direct 
detection, characterization and quantitation of nucleic acids, particularly RNA. The 

30 present invention provides enzymes that recognize specific nucleic acid cleavage 

structures formed on a target RNA sequence and that cleave the nucleic acid cleavage 
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Structure in a site-specific manner to produce non-target cleavage products. The present 
invention provides enzymes having an improved ability to specifically cleave a DNA 
manberof a complex comprising DNA and RNA nucleic acid strands. 

For example, the present invention provides DNA polymerases that are altered in 
structure relative to the native DNA polymerases, such that they exhibit altered (eg., 
improved) poformance in detection assays based on the cleavage of a structure 
comprising nucleic acid {eg. , RNA). In particular, the altered polymerases of the present 
invention exhibit improved performance in detection assays based on the cleavage of a 
DNA member of a cleavage struchire (eg., an invasive cleavage structure) that comprises 
an RNA target strand. 

The improved perforaiance in a detection assay may arise from any one of, or a 
combination of several improved featiires. For example, in one embodiment, the enzyme 
of tiie presMit invention may have an improved rate of cleavage (kcai) on a specific 
targeted structure, such tiiat a larger amount of a cleavage product may be produced in a 
given time span. In anotiier embodiment, tiie enzyme of tiie present invention may have 
a reduced activity or rate in the cleavage of inappropriate or non-specific structoires. For 
example, in certain embodiments of tiie present invention, one aspect of improvement is 
that tiie differential between the detectable amount of cleavage of a specific structure and 
tiie detectable amount of cleavage of any alternative structures is increased. As such, it is 
witfiin tiie scope of tiie present invention to provide an enzyme having a reduced rate of 
cleavage of a specific Uurget structure compared to tiie rate of tiie native enzyme, and 
having a further reduced rate of cleavage of any alternative structoires, such that the 
differential between ttie detectable amount of cleav^e of flie specific structure and flie 
detectable amount of cleavage of any alternative structoires is increased. However, flie 
present invention is not limited to raizymes tiiat have an improved differential. 

In a preferred embodiment, the enzyme of tiie presait invention is a DNA 
polymerase having an altered nuclease activity as described above, and also having 
altered syntiietic activity, compared to tiiat of any native DNA polymerase from which 
tiie enzyme has been derived. It is especially preferred tiiat tiie DNA polymerase is 
altered such tiiat it exhibits reduced synthetic activity as well as improved nuclease 
activity on RNA targets, compared to tiiat of tiie native DNA polymerase. Enzymes and 
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genes oicoding enzymes having reduced synthetic activity have been described {See eg., 
Kaiser etaL, J. Biol. Chem., 274:21387 [1999], Lyamichev et al, Prot. Natl. Acad. Sci., 
96:6143 [1999], US. Patents 5,541,311, 5,614,402, 5,795,763 and 6,090,606, 
incorporated herein by reference in their entireties). The present invention contemplates 
5 combined modifications, such that the resulting 5' nucleases are without interfering 
synthetic activity, and have improved performance in RNA detection assays. 

The present invention contemplates a DNA sequence encoding a DNA 
polymerase altered in sequence relative to the native sequence, such that it exhibits 
altered nuclease activity from that of the native DNA polymerase. For example, in one 

1 0 embodiment, the DNA sequence encodes an enzyme having an improved rate of cleavage 
(kcat) on a specific targeted structure, such that a larger amount of a cleavage product may 
be produced in a given time span. In another embodiment, the DNA encodes an enzyme 
having a reduced activity or rate in the cleavage of inappropriate or non-specific 
structures. In certain embodiments, one aspect of improvement is that the differential 

1 5 between the detectable amount of cleavage of a specific structure and the detectable 

amount of cleavage of any alternative structures is increased. It is within the scope of the 
present invention to provide a DNA encoding an enzyme having a reduced rate of 
cleavage of a specific target structure compared to the rate of the native enzyme, and 
having a fiirther reduced rate of cleavage of any alternative structures, such that the 

20 differential between the detectable amount of cleavage of the specific structure and the 
detectable amount of cleavage of any alternative structures is increased. However, the 
present invention is not limited to poljonerases that have an improved differential. 

In a preferred embodiment, the DNA sequence encodes a DNA polymerase 
having the altered nuclease activity described above, and also having altered synthetic 

25 activity, compared to that of any native DNA polymerase from which the improved 

enzyme is derived. It is especially preferred that the encoded DNA polymerase is altered 
such that it exhibits reduced synthetic activity as well as improved nuclease activity on 
RNA targets, compared to that of the native DNA polymerase. 

It is not intended that the invention be limited by the nature of the alteration 

30 required to introduce altered nuclease activity. Nor is it intended that the invention be 
limited by the extent of either the alteration, or in the improvement observed. If the 
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polymerase is also altered so as to be synthesis modified, it is not intended that the 
invention be limited by the polymerase activity of the modified or unmodified protein, or 
by the nature of the alteration to rendw the polymerase synthesis modified. 

The present invaition contemplates structure-^ecific nuclease from a variety of 
sources, including, but not limited to, mesophilic, psychrophiUc. thermophilic, and 
hyperthermophilic organisms. The preferred structure-specific nucleases are 
thermostable. Thermostable structure-specific nucleases are contemplated as particularly 
usefiil in that they allow the INVADER assay {See e.g., U.S. Pat. Nos. 5,846,717, 
5,985,557, 5,994,069, 6,001,567, and 6,090,543 and PCT Publications WO 97/27214 and 
WO 98/42873, incorporated herein by reference in their entireties) to be oporated near the 
melting temperature (T„) of the downstream probe Sligonucleotide, so that cleaved and 
uncleaved probes may cycle on and off the target during the course of the reaction. In 
one embodiment, the thermostable structure-specific enzymes are thermostable 5' 
nucleases that are selected firom the group comprising altered polymerases derived fiom 
flie native polymerases of Thermus species, including, but not limited to, nemus 
aqmticus, Themus flams, nermus themophilus, Thermus filiformus, and Thermus 
scotoductus. However, flie invention is not limited to the use of thermostable 5' 
nucleases. For example, certain embodiments of the present invention utilize short 
oligonucleotide probes that may cycle on and off of the target at low tanperatures, 
allowing the use of non-thermostable enzymes. 

In some preferred embodiments, the present invention provides a composition 
comprising an enzyme, wherein &e enzyme comprises a hetaologous fiuictional domain, 
wherein the hetax)logous functional domain provides altered (e.g., improved) 
fimctionality in a nucleic acid cleavage assay. The present mvention is not limited by the 
nature of the nucleic acid cleavage assay. For example, nucleic acid cleavage assays 
include any assay in which a nucleic acid is cleaved, directly or indirectly, in the presence 
of the enzyme. In certain preferred embodiments, the nucleic acid cleavage assay is an 
invasive cleavage assay. In particularly preferred embodiments, the cleavage assay 
utilizes a cleavage structure having at least one RNA component In another particularly 
preferred embodiment, the cleavage assay utiUzes a cleavage structure having at least one 
RNA component, wherein a DNA member of the cleavage structure is cleaved. 
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In some preferred embodiments, the enzyme comprises a 5* nuclease or a 
polymerase. In certain preferred embodiments, the 5* nuclease comprises a thermostable 
5' nuclease. In other preferred embodiments, the polymerase is altered in sequence 
relative to a naturally occurring sequence of a polymerase such that it exhibits reduced 
5 DNA synthetic activity from that of the naturally occurring polymerase. In certain 
preferred embodiments, the polymerase comprises a thermostable polymerase a 
polymerase from a Thermus species including, but not limited to, Thermus aquaticus, 
Thermus flavus, Thermus thermophiluSy Thermus filiformus, and Thermus scotoductus). 
The present invention is not limited by the nature of the altered functionality 

10 provided by the heterologous functional domain. Illustrative examples of alterations 
include, but are not limited to, enzymes where the heterologous functional domain 
comprises an amino acid sequence (e.g., one or more amino acids) that provides an' 
improved nuclease activi^, an improved substrate binding activity and/or improved 
background specificity in a nucleic acid cleavage assay. 

15 The present invention is not limited by the nature of the heterologous functional 

domain. For example, in some embodiments, the heterologous functional domain 
comprises two or more amino acids from a polymerase domain of a polymerase (e.g,, 
introduced into the enzyme by insertion of a chimeric functional domain or created by 
mutation). In certain preferred embodiment, at least one of the two or more amino acids 

20 is from a palm or thumb region of the polymerase domain. The present invention is not 
limited by the identity of the polymerase from which the two or more amino acids are 
selected. In certain preferred embodiments, the polymerase comprises Thermus 
thermophilus polymerase. In particularly preferred embodiments, the two or more amino 
acids are from amino acids 300-650 of SEQ ID N0:1. 

25 The novel enzymes of the invention may be employed for the detection of target 

DNAs and RNAs including, but not limited to, target DNAs and RNAs comprising wild 
type and mutant alleles of genes, including, but not limited to, genes from hurnans, other 
animal, or plants that are or may be associated with disease or other conditions. In 
addition, the enzymes of the invention may be used for the detection of and/or 

30 identification of strains of microorganisms, including bacteria, fungi, protozoa, ciliates 
and viruses (and in particular for the detection and identification of viruses having RNA 
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genomes, such as the Hepatitis C and Human Immunodeficiency viruses). For example, 
the present invention provides methods for cleaving a nucleic acid comprising providing: 
an en^e of the presait invoition and a substrate nucleic acid; and exposing the 
substrate nucleic acid to the enzyme (e.g., to produce a cleavage product that may be 
5 detected). 

In one embodiment, the present invention provides a thermostable 5' nuclease 
having an amino acid sequence selected from the group comprising SEQ ID N0S:2, 3, 4, 
5. 6, 7, 8, 9, 10, II 12. 13. 14, 15, 16. 17, 18. 19, 20, 21, 22, 23, 24. 25, 26, 27, 28, 29, 30, 
31, 32. 33. 34, 35. 36, 37, 38, 39, 40, 41, 42. 43, 44. 45, 46, 47. 48, 49, 50, 51, 52. 53, 54, 
10 55. 56, 57, 58. 59, 60, 61, 62, 63. 64, 65, 66. 67, 68. 341, 346, 348. 351, 353, 359, 365. 
■ 367. 369, 374. 376, 380. 384, 388, 392, 396. 400, 402, 406, 408, 410, 412, 416. 418. 420. 
424, 427, 429. 432. 436, 440. 444, 446, 448. 450. 456. 460. 464, 468, 472, 476, 482, 485, 
488, 491, 494, 496, 498, 500, 502, 506, 510. 514. 518. 522, 526. 530, 534, 538. 542, 544. 
550. 553, 560. 564. 566. 568. 572, 574. 576, 578, 580. 582. 584. 586, 588, and 590. In 
15 another embodiment, the 5' nuclease is encoded by a DNA sequence selected from the 
group comprising of SEQ ID NOS:69, 70. 71, 72, 73, 74, 75, 76, 77, 78, 79, 80. 81, 82, 
83, 84, 85, 86, 87, 88, 89. 90, 91. 92, 93, 94. 95, 96, 97, 98. 99, 100, 101, 102, 103, 104, 
105. 106, 107, 108, 109, 110, 111, 112, 113. 114. 115, 116, 117, 118, 119, 120, 121. 122, 
123, 124, 125, 126, 127, 128, 129, 130. 131, 132, 133, 134, 135. 340, 345, 347, 350, 352, 
20 358, 364, 366, 368, 373, 375, 379. 383, 387, 391, 395, 399, 401, 405, 407. 409, 411, 415, 
417, 419, 423, 426, 428, 431, 435, 439, 443, 445, 447, 449, 452, 454. 455, 459, 463, 467, 
471, 475, 481, 484, 495, 497, 499, 501, 505, 509, 513, 517, 521, 525, 529, 533, 537, 541, 
543. 549, 552. 559, 563, 565, 567, 571, 573, 575, 577. 579, 581, 583, 585, 587, and 589. 
The present invention also provides a recombinant DNA vector comprising DNA 
25 having a nucleotide sequence encoding a 5' nuclease, the nucleotide sequence selected 
from the group comprising SEQ ID NOS: 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 
81, 82. 83, 84, 85, 86, 87, 88. 89, 90, 91, 92, 93. 94, 95. 96. 97. 98, 99. 100, 101, 102, 
103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114. 115, 116, 117, 118, 119, 120. 
121. 122, 123, 124, 125, 126, 127. 128. 129, 130, 131, 132, 133, 134, 135, 340, 345, 347, 
30 350, 352, 358, 364, 366, 368, 373, 375, 379, 383, 387, 391, 395, 399, 401, 405, 407, 409. 
411. 415. 417. 419, 423. 426. 428. 431. 435. 439, 443, 445, 447, 449, 452, 454, 455, 459, 
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463, 467, 471, 475, 481. 484, 495, 497, 499, 501, 505, 509, 513, 517, 521, 525, 529, 533, 
537, 541, 543, 549, 552, 559, 563, 565, 567, 571, 573, 575, 577, 579, 581, 583, 585, 587, 
and 589. In a preferred embodiment, the invention provides a host cell transformed with 
a recombinant DNA vector comprising DN A having a nucleotide sequence encoding a 

5 structure-specific nuclease, the nucleotide selected from the group comprising sequence 
SEQID NOS: 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79. 80, 81, 82, 83, 84, 85, 86, 87, 88, 
89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99. 100, 101, 102, 103, 104, 105, 106, 107, 108, 
109. 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 
127, 128, 129, 130, 131, 132, 133, 134. 135. 340, 345, 347, 350, 352, 358, 364, 366, 368, 

10 373, 375, 379, 383, 387, 391, 395, 399, 401. 405, 407, 409, 411, 415, 417, 419, 423, 426, 
428, 431, 435, 439, 443, 445, 447, 449. 452. 454, 455, 459, 463, 467, 471, 475, 481, 484, 
495, 497, 499, 501, 505, 509. 513, 517, 521, 525, 529. 533, 537, 541. 543, 549, 552, 559, 
563, 565, 567, 571, 573, 575, 577, 579, 581. 583, 585, 587. and 589. The invention is not 
limited by the nature of the host cell employed. The art is well aware of expression 

1 5 vectors suitable for the expression of nucleotide sequences encoding structure-specific 
nucleases that can be expressed in a variety of prokaryotic and eukaryotic host cells. In a 
preferred embodiment, the host cell is an Escherichia coli cell. 

The present invention provides a method of altering 5' nuclease enzymes relative 
to native 5' nuclease enzymes, such that they exhibit unproved performance in detection 

20 assays based on the cleavage of a structure comprising KNA. In particular, the altered 5' 
nucleases produced by the method of the present invention exhibit improved performance 
in detection assays based on the cleavage of a DNA member of a cleavage structure (eg., 
an invasive cleavage structure) that comprises an RNA target strand. The improved 5* 
nucleases resulting from the methods of the present invention may be improved in any of 

25 the ways discussed herein. Examples of processes for assessing improvement in any 
candidate enzyme are provided. 

For example, the present invention provides methods for producing an altered 
enzyme with improved functionality in a nucleic acid cleavage assay comprising: 
providing an enzyme and a nucleic acid test substrate; introducing a heterologous 

30 iunctional domain into the enzyme to produce an altered enzyme; contacting the altered 
enzyme with the nucleic acid test substrate to produce cleavage products; and detecting 
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the cleavage products. In some embodiments, the introduction of the heterologous 
functional domain comprises mutating one or more amino acids of the enzyme. In other 
embodiments, the introduction of the heterologous functional domain into the enzyme 
comprises adding a functional domain fiom a protein (e.g.. another enzyme) into the 
5 enzyme {e.g., substituting functional domains by removing a portion of the enzyme 
sequence prior to adding the functional domain of the protein). In preferred 
embodiments, the nucleic acid test substrate comprises a cleavage structure. In 
particularly preferred embodiment, the cleavage structure comprises an RNA target 
nucleic acid. In yet other preferred embodiments, the cleavage structure comprises an 

1 0 invasive cleavage structure. 

The present invention also provides nucleic acid treatment kits. One preferred 
«nbodiment is a kit comprising a composition comprising at least one improved 5' 
nuclease. Another prefenred embodiment provides a kit comprising: a) a composition 
comprising at least one improved 5' nuclease; and b) an INVADER oligonucleotide and a 
15 signal probe oligonucleotide. In some embodiments of the kits of the present invention, 
the improved 5' nuclease is derived from a DNA polymerase from a eubacterial species. 
In furthw embodiments, the eubacterial species is a thermophile. hi still further 
embodiments, the theimophile is of the genus Thermus. In still further embodiments, the 
thermophile is selected from the group consisting of nermus aquaticus, Thermus flavus, 
20 Thermus thermophilus. Thermus filiformus, and Thermus scotoductus. In preferred 
embodiments, the improved 5' nuclease is encoded by DNA selected from the group 
comprising SEQ ID NOS: 69, 70. 71. 72, 73, 74, 75, 76, 77, 78, 79, 80. 81, 82, 83, 84, 85, 
86, 87. 88, 89, 90, 91, 92, 93, 94, 95. 96, 97, 98, 99, 100, 101, 102, 103, 104. 105, 106, 
107. 108, 109. 110. 111. 112. 113, 114, 115, 116. 117. 118, 119, 120, 121. 122. 123, 124, 
25 125, 126. 127, 128. 129, 130. 131, 132, 133. 134, 135, 340, 345. 347, 350, 352, 358, 364, 
366, 368, 373, 375, 379, 383, 387, 391, 395, 399. 401, 405, 407, 409, 411, 415. 417. 419, 
423, 426, 428, 431, 435, 439, 443, 445, 447, 449, 452, 454, 455. 459. 463, 467, 471, 475, 
481, 484, 495, 497, 499, 501, 505, 509, 513, 517, 521, 525, 529, 533, 537, 541, 543, 549, 
552, 559, 563, 565, 567, 571, 573, 575, 577, 579, 581, 583, 585, 587, and 589. In yet 
30 other preferred embodiments, the kits further comprise reagents for detecting a nucleic 
acid cleavage product. In further preferred embodiments, the reagents for detecting a 
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cleavage product comprise oligonucleotides for use in a subsequent invasive cleavage 
reaction {See e.g., U.S. Patent No. 5,994,069). In particularly preferred embodiments, the 
reagents for the subsequent invasive cfeavage reaction comprise a probe labeled with 
moieties that produce a fluorescence resonance energy transfer (FRET) effect. 
5 The present invention also provides methods for treating nucleic acid, comprising: 

a) providing: a first structure-specific nuclease consisting of an endonuclease in a 
solution containing manganese; and a nucleic acid substrate; b) treating the nucleic acid 
substrate with increased temperature such that the substrate is substantially single- 
stranded; c) reducing the temperature xmder conditions such that the single-stranded 

10 substrate forms one or more cleavage structures; d) reacting the cleavage means with the 
cleavage structures so that one or more cleavage products are produced; and e) detecting 
the one or more cleavage products. In some embodiments of the methods, the 
• endonuclease includes, but is not limited to, CLEAVASE BN enzyme, Thermus 
aquaticus DNA polymerase, Thermus thermophilus DNA polymerase, Escherichia coli 

1 5 Bxo III, and the Saccharomyces cerevisiae Radl/RadlO complex. In yet other preferred 
embodiments, the nuclease is a 5' nuclease derived firom a thermostable DNA polymerase 
altered in amino acid sequence such that it exhibits reduced DNA synthetic activity from 
that of the wild-type DNA polymerase but retains substantially the same 5' nuclease 
activity of the wild-type DNA polymerase. In yet other embodiments, the nucleic acid is 

20 selected fix)m flie group consisting of RNA and DNA. In further embodiments, the 
nucleic acid of step (a) is double stranded. 

The present invention also provides nucleic acid treatment kits, comprising: a) a 
composition comprising at least one purified FEN-1 endonuclease; and b) a solution 
containing manganese. In some embodiments of the kits, the purified FEN-1 

25 endonuclease is selected from the group consisting Pyrococcus woesei FEN-1 

endonuclease, Pyrococcus furiosus FEN-1 endonuclease, Methanococcus jannaschii 
FEN-1 endonuclease, Methanobacterium thermoautotrophicum FEN-1 endonuclease, 
Archaeoglobus Julgidus FEN-1, Sulfolobus solfataricus, Pyrobaculum aerophilum, 
Thermococcus litoralis, Archaeaglobus veneficus, Archaeaglobus profundus, Acidianus 

30 brierlyi, Acidianus ambivalens, Desuljurococcus amylolyticus, Desulfurococcus mobilis, 
Pyrodictium brockii, Thermococcus gorgonarius, Thermococcus zilligii, Methanopyrus 
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kandlerU Methanococcus igneus, Pyrococcus horikoshiU Aeropyrum pemiXy and 
chimerical FEN-1 endonucleases. In other embodiments, the kits further comprise at 
least one second structure-specific nuclease. In some preferred embodiments, the second 
nuclease is a 5' nuclease derived from a thermostable DNA polymerase altered in amino 

5 acid sequence such that it exhibits reduced DNA synthetic activity from that of the wild- 
type DNA polymerase but retains substantially the same 5* nuclease activity of the wild- 
type DNA polymerase. In yet other embodiments of the kits, the portion of the amino 
acid sequence of the second nuclease is homologous to a portion of the amino acid 
sequence of a thermostable DNA polymerase derived from a eubacterial thermophile of 

1 0 the genus Thermus. In fijrther embodiments, the theraiophile is selected from the group 
consisting of Thermus aquaticus, Thermus flavus and Thermus thermophilus. In yet other 
preferred embodiments, the kits further comprise reagents for detecting the cleavage 
products. 

The present invention ftuther provides any of the compositions, mixtures, 
15 methods, and kits described herein, used in conjunction with endonucleases comprising 
Sulfolobus solfataricus, Pyrobaculum aerophilum, Thermococcus litoralis, 
Archaeaglobus veneficus, Archaeaglobus profundus, Acidianus brierlyU Acidianus 
ambivalens, Desuljurococcus amylolyticus, Desuljurococcus mobilis, Pyrodictium 
brockih Thermococcus gorgonarius, Thermococcus zilligii, Methanopyrus kandleri, 
20 Methanococcus igneus, Pyrococcus horikoshiU and Aeropyrum pemix endonucleases. 
These include compositions comprising purified FEN-1 endonucleases from the 
organisms (including specific endonucleases described by sequences provided herein, as 
well as, variants and homologues), kits comprising these compositions, composition 
comprising chimerical endonucleases comprising at least a portion of the endonucle^es 
.25 from these organisms, kits comprising such compositions, compositions comprising 
nucleic acids encoding the endonucleases from these organisms (including vectors and 
host cells), kits comprising such compositions, antibodies generated to the endonucleases, 
mixtures comprising endonucleases from these organisms, methods of using the 
endonuclease in cleavage assays (e.g., invasive cleavage assays, CFLP, etc.), and kits 
30 containing components useftil for such methods. Examples describing the generation, 
structure, use, and characterization of these endonucleases are provided herein. 
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The present invention also provides methods for improving the methods and 
enzymes disclosed herein. For example, the present invention provides methods of 
improving enzymes for any intended purpose (e.g., use in cleavage reactions, 
amplification reactions, binding reactions, or any other use) comprising the step of 
5 providing an enzyme disclosed herein and modifying the enzyme (e.g., altering the amino 
acid sequence, adding or subtracting sequence, adding post-translational modifications, 
adding any other component whether biological or not, or any other modification). 
Likewise, the present invention provides methods for improving the methods disclosed 
herein comprising, conducting the method steps with one or more changes (e.g., change 

10 in a composition provided in the method, change in the order of the steps, or addition or 
subtraction of steps). 

The improved performance in a detection assay may arise from any one of, or a 
combination of several improved features. For example, in one embodiment, the enzyme 
of the present invention may have an improved rate of cleavage (kcat) on a specific 

1 5 targeted structure, such that a larger amount of a cleavage product may be produced in a 
given time span. In another embodiment, the enzyme of the present invention may have 
a reduced activity or rate in the cleavage of inappropriate or non-specific structures. For 
example, in certain embodiments of the present invention, one aspect of improvement is 
that the differential between the detectable amount of cleavage of a specific structure and 

20 the detectable amount of cleavage of any alternative structures is increased. As such, it is 
within the scope of the present invention to provide an enzyme having a reduced rate of 
cleavage of a specific target structure compared to the rate of the native enzyme, and 
having a further reduced rate of cleavage of any alternative structures, such that the 
differential between the detectable amount of cleavage of the specific structure and the 

25 detectable amount of cleavage of any alternative structures is increased. However, the 
present invention is not limited to enzymes that have an improved differential. 

In some preferred embodiments, the present invention provides a composition 
comprising an enzyme, wherein the enzyme comprises a heterologous functional domain, 
wherein the heterologous functional domain provides altered (e.g., improved) 

30 functionality in a nucleic acid cleavage assay. The present invention is not limited by the 
nature of the nucleic acid cleavage assay. For example, nucleic acid cleavage assays 
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include any assay in which a nucleic acid is cleaved, directly or indirectly, in the presence 
of the enzyme. In certain preferred embodiments, the nucleic acid cleavage assay is an 
invasive cleavage assay. In particularly preferred embodiments, the cleavage assay 
utilizes a cleavage structure having at least one RNA component. In another particularly 
preferred embodiment, the cleavage assay utilizes a cleavage structure having at least one 
RNA component, wherein a DNA member of the cleavage stmcture is cleaved. 

The present invention is not limited by the nature of the altered functionality 
provided by the heterologous functional domain. Illustrative examples of alterations 
include, but are not limited to, enzymes where the heterologous functional domain 
conrprises an amino acid sequence (e.g., one or more amino acids) that provides an 
inq)roved nuclease activity, an improved substrate binding activity and/or improved 
background specificity in a nucleic acid cleavage assay. 

The present invention is not limited by the nature of the heterologous functional 
domain. For example, in some embodiments, the heterologous functional domain 
comprises two or more amino acids from a polymerase domain of a polymerase (e.g., 
introduced into the enzyme by insertion of a chimerical functional domain or created by 
mutation). In certain preferred embodiment, at least one of the two or more amino acids 
is from a pabn or thumb region of the polymerase domain. The present invention is not 
limited by the identity of the polymerase from which the two or more amino acids are 
selected. In certain preferred embodiments, the polymerase comprises Thermus 
thermophilus polymerase. In particularly preferred embodiments, the two or more amino 
acids are from amino acids 300-650 of SEQ ID NO: I . 

The novel enzymes of the invention may be employed for the detection of target 
DNAs and RNAs including, but not limited to, target DNAs and RNAs comprising wild 
type and! mutant alleles of genes, including, but not limited to, genes from humans, other 
animal, or plants that are or may be associated with disease or other conditions. In 
addition, the enzymes of the invention may be used for the detection of and/or 
identification of strains of microorganisms, including bacteria, fungi, protozoa, ciliates 
and viruses (and in particular for the detection and identification of viruses having RNA 
genomes, such as the Hepatitis C and Human Immunodeficiency viruses). For example, 
the present invention provides methods for cleaving a nucleic acid comprising providing: 
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an enzyme of the present invention and a substrate nucleic acid; and exposing the 
substrate nucleic acid to the enzyme (e.g., to produce a cleavage product that .may be 
detected). In some embodiments, the substrate nucleic is in a cell lysate sample. 

The present invention also provides a method for d^ecting the presence of a 
5 target nucleic acid comprising: cleaving an invasive cleavage structure, said invasive 
cleavage structure comprising an RNA target nucleic acid; and detecting the cleavage of 
the invasive cleavage structure. Such an assay may comprise a multiplex assay, wherein 
multiple invasive cleavage structures are cleaved. Such structures include structures 
fomied on different target nucleic acids, as well as, structures formed on different 

1 0 locations of the sample target nucleic acid. In some embodiments, the target nucleic acid 
comprises a first region and a second region, said second region downstream of and 
contiguous to said ISrst region. In some embodiments, the invasive cleavage structure 
comprises the target nucleic acid, a first oligonucleotide, and a second oligonucleotide, 
wherein at least a portion of the first oligonucleotide is completely compl^entary to the 

1 5 first portion of the first target nucleic acid, and wherein the second oligonucleotide 
comprises a 3' portion and a S' portion, wherein the 5' portion is completely 
complementary to said second portion of the target nucleic acid. In some embodiments, 
the 3* portion of the second oligonucleotide comprises a 3* terminal nucleotide not 
complementary to said target nucleic acid. In some embodiments, the 3* portion of the 

20 second oligonucleotide consists of a single nucleotide not complementaiy to the target 
nucleic acid. In some embodiments, the method further comprises the steps of forming a 
second invasive cleavage structure comprising a non-target cleavage product and 
cleaving the second invasive cleavage structure. In some embodiments, the invasive 
cleavage structure or the second invasive cleavage comprises an oligonucleotide 

25 comprising a sequence selected from the group consisting of SEQ ID NO:709-2640. In 
other embodiments, the invasive cleavage structure or the second invasive cleavage 
comprises an oligonucleotide comprising a sequence selected from the group consisting 
of SEQ ID NO: 169-2 11 and 619-706. In some preferred embodimrats, the target nucleic 
acid comprises a cytochrome P450 RNA or a cytokine RNA. In some embodiments, the 

30 first region or the second region of the target nucleic acid encompasses a splice junction, 
an exon (or a portion thereof), or an intron (or a portion thereof). In some embodiments, 
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the RNA target nucleic acid is provided in a cell lysate. In some embodiments, the first 
oligonucleotide is covalently attached to the second oligonucleotide. Such 
oligonucleotides find use, for example, in methods described in U.S. Patent Nos. 
5,714,320 and 5,854,033, herein incorporated by reference in their entireties. The present 
5 invention also provides kits containing one or more of the components used in the above 
methods. 

DEFINITIONS 

To facilitate an understanding of the present invention, a number of terms and 

1 0 phrases are defined below: 

As used herein, the terms "complementary" or "complementarity" are used in 
reference to polynucleotides (Le,, a sequence of nucleotides such as an oligonucleotide or 
a target nucleic acid) related by the base-pairing rules. For example, for the sequence " 
5LA-G-T-3*," is complementary to the sequence " 3»-T-C-A-5'." Complementarity may be 

1 5 "partial," in which only some of the nucleic acids' bases are matched according to the 
base pauing rules. Or, there may be "complete" or "total" complementarity between the 
nucleic acids. The degree of complementarity between nucleic acid strands has 
significant effects on the efiBciency and strength of hybridization between nucleic acid 
strands. This is of particular importance in amplification reactions, as well as detection 

20 methods that depend upon binding between nucleic acids. Either term may also be used 
in reference to individual nucleotides, especially within the context of polynucleotides. 
For example, a particular nucleotide within an oligonucleotide may be noted for its 
complementarity, or lack thereot to a nucleotide widiin another nucleic acid strand, in 
contrast or comparison to the complementarity between the rest of the oligonucleotide 

25 and the nucleic acid strand. Nucleotide analogs used to form non-standard base pairs, 
whether with another nucleotide analog (e.g., an IsoC/IsoG base pair), or with a naturally 
occurring nucleotide (e.g., as described in U.S. Patent 5,912,340, herein incorporated by 
reference in its entirety) are also considered to be complementary to a base pairing 
partner within the meaning this definition. 
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The term "homology" and "homologous" refers to a degree of identity. There 
may be partial homology or complete homology. A partially homologous sequence is 
one that is less than 100% identical to another sequence. 

As used herein, the term "hybridization" is used in referrace to the pairing of 
S complementary nucleic acids. Hybridization and the strength of hybridization {Le,^ the 
strength of the association between the nucleic acids) is influenced by such factors as the 
degree of complementary between the nucleic acids, stringency of the conditions 
involved, and the T^ of the formed hybrid. "Hybridization" methods involve the 
aimealing of one nucleic acid to another, complementary nucleic acid, <.e., a nucleic acid 

1 0 having a complementary nucleotide sequence. The ability of two polymers of nucleic 
acid containing complementary sequences to find each other and anneal through base 
pairing interaction is a well-recognized phenomenon. The initial observations of the 
"hybridization" process by Marmur and Lane, Proc. Nati. Acad. Sci. USA 46:453 (1960) 
and Doty et al., Proc. Natl. Acad. Sci. USA 46:461 (1960) have been followed by the 

1 5 refinement of this process into an essential tool of modem biology. 

With regard to complementarity, it is important for some diagnostic applications 
to determine whether the hybridization represents complete or partial complementarity. 
For example, where it is desired to detect simply the presence or absence of pathogen 
DNA (such as &om a virus, bacterium, fimgi, mycoplasma, protozoan) it is only 

20 important that the hybridization method ensures hybridization when the relevant 
sequence is present; conditions can be selected where both partially complementary 
probes and completely complementary probes will hybridize. Other diagnostic 
applications, however, may require that the hybridization method distinguish between 
partial and complete complementarity. It may be of interest to detect genetic 

25 polymorphisms. For example, human hemoglobin is composed, in part, of four 

polypeptide chains. Two of these chains are identical chains of 141 amino acids (alpha 
chains) and two of these chains are identical chains of 146 amino acids (beta chains). 
The gene encoding the beta chain is known to exhibit polymorphism. The normal allele 
encodes a beta chain having glutamic acid at the sixth position. The mutant allele 

30 encodes a beta chain having valine at the sixth position. This difference in amino acids 
has a profound (most profound when the individual is homozygous for the mutant allele) 
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physiological impact known clinically as sickle cell anemia. It is well known that the 
genetic basis of the amino acid change involves a single base difference between the 
normal allele DNA sequence and the mutant allele DNA sequence. 

The complement of a nucleic acid sequence as used herein refers to an 
oligonucleotide which, when aligned with the nucleic acid sequence such that the 5' end 
of one sequence is paired with the 3' end of the other, is in "antiparallel association." 
Certain bases not commonly found in natural nucleic acids may be included in the nucleic 
acids of the present invention and include, for example, inosine and 7-deazaguanine. 
Complementarity need not be perfect; stable duplexes may contain mismatched base 
pairs or unmatched bases. Those skilled in the art of nucleic acid technology can 
detennine duplex stability empirically considering a number of variables including, for 
example, the length of the oligonucleotide, base composition and sequence of the 
oligonucleotide, ionic strength and incidence of mismatched base pairs. 

As used herein, the term " T™ " is used in reference to the "melting temperature." 
The melting temperature is the temperature at which a population of double-stranded 
nucleic acid molecules becomes half dissociated into single strands. Several equations 
for calculating the of nucleic acids are well known in the art. As indicated by 
standard references, a simple estimate of the T„, value may be calculated by the equation: 
Tm = 81 ,5 + 0.4l(% G + C), when a nucleic acid is in aqueous solution at 1 M NaCl (see 
e.g., Anderson and Young, Quantitative Filter Hybridization, in Nucleic Acid 
Hybridization (1985). Other references {e.g.. Allawi, H.T. & SantaLucia, J., Jr. 
Thermodynamics and NMR of internal G.T mismatches in DNA. Biochemistry 36, 
10581-94 (1997) include more sophisticated computations which take structural and 
environmental, as well as sequence characteristics into account for the calculation of T^. 

As used herein the term "stringency" is used in reference to the conditions of 
temperature, ionic strength, and the presence of other compounds, under which nucleic 
acid hybridizations are conducted. With "high stringency" conditions, nucleic acid base 
pairing will occur only between nucleic acid fragments that have a high frequency of 
complementary base sequences. Thus, conditions of "weak" or "low" stringency are 
often required when it is desired that nucleic acids that are not completely 
complementary to one another be hybridized or annealed together. 
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"High stringency conditions" when used in reference to nucleic acid hybridization 
comprise conditions equivalent to binding or hybridization at 42 C in a solution 
consisting of 5X SSPE (43.8 g/I NaCl, 6.9 g/1 NaH2P04 H2O and 1.85 g/1 EDTA, pH 
adjusted to 7.4 with NaOH), 0.5% SDS, 5X Denhardt's reagent and 100 jig/ml denatured 
5 salmon sperm DNA followed by washing in a solution comprising O.IX SSPE, 1.0% SDS 
at 42 C when a probe of about 500 nucleotides in length is employed. 

"Medium stringency conditions" when used in reference to nucleic acid 
hybridization comprise conditions equivalent to binding or hybridization at 42 C in a 
solution consisting of 5X SSPE (43,8 g/1 NaCl, 6.9 gA NaH2P04 H2O and 1 .85 g/1 

10 EDTA, pH adjusted to 7.4 with NaOH), 0.5% SDS, 5X Denhardt's reagent and 100 fig/ml 
denatured saknon sperm DNA followed by washing in a solution comprising l.OX SSPE, 
1.0% SDS at 42 C when a probe of about 500 nucleotides in length is employed. 

"Low stringency conditions" comprise conditions equivalent to binding or 
hybridization at 42 C m a solution consisting of 5X SSPE (43.8 g/1 NaCl, 6.9 g/1 

15 NaH2P04 H2O and 1.85 g/1 EDTA, pH adjusted to 7.4 with NaOH), 0.1% SDS, 5X 
Denhardrs reagent [SOX Denhardt's contains per 500 ml: 5 g Ficoll (Type 400, 
Pharamcia), 5 g BSA (Fraction V; Sigma)] and 100 g/ml denatured salmon sperm DNA 
followed by washing in a solution comprising 5X SSPE, 0.1% SDS at 42 C when a probe 
of about 500 nucleotides in length is employed. 

20 The term "gene" refers to a DNA sequence that comprises control and coding 

sequences necessary for the production of an RNA having a non-coding function (e.g., a 
ribosomal or transfer RNA), a polypeptide or a precursor. The RNA or polypeptide can 
be encoded by a fiilHength coding sequence or by any portion of the coding sequence so 
long as the desired activity or function is retained. 

25 The term "wild-type" refers to a gene or a gene product that has the characteristics 

of that gene or gene product when isolated from a naturally occurring source. A wild- 
type gene is that which is most frequently observed in a population and is thus arbitrarily 
designated the "normal" or "wild-type" form of the gene. In contrast, the term 
"modified,""mutant," or "polymorphic" refers to a gene or gene product that displays 

30 modifications in sequence and or functional properties {Le., altered characteristics) when 
compared to the wild-type gene or gene product. It is noted that naturally-occurring 
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mutants can be isolated; these are identified by the fact that they have altered 
characteristics when compared to the wild-type gene or gene product. 

The term "recombinant DNA vector" as used herein refers to DNA sequences 
containing a desired heterologous sequence. For example, although the term is not 

5 limited to the use of expressed sequences or sequences that encode an expression product, 
in some embodiments, the heterologous sequence is a coding sequence and appropriate 
DNA sequences necessary for either the replication of the coding sequence in a host 
organism, or the expression of the operably linked coding sequence in a particular host 
organism. DNA sequences necessary for expression in prokaryotes include a promoter, 

1 0 optionally an operator sequence, a ribosome-binding site and possibly other sequences. 
Eukaryotic cells are known to utilize promoters, polyadenlyation signals and enhancers. 

The term "LTR" as used herein refers to the long terminal repeat found at each 
end of a provirus the integrated form of a retrovirus). The LTR contains numerous 
regulatory signals including transcriptional control elements, polyadenylation signals and 

1 5 sequences needed for replication and integration of the viral genome. The viral LTR is 
divided into three regions called U3, R and U5. 

The U3 region contains the enhancer and promoter elements. The U5 region 
contains the polyadenylation signals. The R (repeat) region separates the U3 and U5 
regions and transcribed sequences of the R region appear at both the 5' and 3' ends of the 

20 viral RNA. 

The term "oligonucleotide" as used herein is defined as a molecule comprising 
two or more deoxyribonucleotides or ribonucleotides, preferably at least 5 nucleotides, 
more preferably at least about 10-15 nucleotides and more preferably at least about 1 5 to 
30 nucleotides. The exact size will depend on many factors, which in turn depend on the 

25 ultimate function or use of the oligonucleotide. The oligonucleotide may be generated in 
any maimer, including chemical synthesis, DNA replication, reverse transcription, PGR, 
or a combination thereof. 

Because mononucleotides are reacted to make oligonucleotides in a manner such 
that the 5* phosphate of one mononucleotide pentose ring is attached to the 3' oxygen of 

30 its neighbor in one direction via a phosphodiester linkage, an end of an oligonucleotide is 
referred to as the "5* end" if its 5' phosphate is not linked to the 3' oxygen of a 
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mononucleotide pentose ring and as the "3* end" if its 3' oxygen is not linked to a 5' 
phosphate of a subsequent mononucleotide pentose ring. As used herein, a nucleic acid 
sequence, even if internal to a larger oligonucleotide, also may be said to have 5* and 3* 
ends. A first region along a nucleic acid strand is said to be upstream of another region if 
5 the 3* end of the first region is before the 5* end of the second region when moving along 
a strand of nucleic acid in a S' to 3* direction. 

When two different, non-overiapping oligonucleotides anneal to different regions 
of the same linear complementary nucleic acid sequence, and the 3' end of one 
oligonucleotide points towards the 5' end of the other, the former may be called the 

10 "upstream" oligonucleotide and the latter tiie "downstream" oligonucleotide. Similarly, 
when two overlapping oligonucleotides are hybridized to the same linear complementary 
nucleic acid sequence, with the first oligonucleotide positioned such that its 5* end is 
upstream of the 5* end of the second oligonucleotide, and the 3' end of the first 
oligonucleotide is upstream of the 3' end of the second oligonucleotide, the first 

1 5 oligonucleotide may be called the "upstream" oligonucleotide and the second 
oligonucleotide may be called the "downstream" oligonucleotide. 

The term "primer" refers to an oligonucleotide that is capable of acting as a point 
of initiation of synthesis when placed under conditions in which primer extension is 
initiated. An oligonucleotide "primer" may occur naturally, as in a purified restriction 

20 digest or may be produced synthetically. 

A primer is selected to be "substantially" complementary to a strand of specific 
sequence of the template. A primer must be sufficiently complementary to hybridize 
with a template strand for primer elongation to occur. A primer sequence need not reflect 
the exact sequence of the template. For example, a non-complementary nucleotide 

25 fi-agment may be attached to the 5' end of the primer, with the remainder of the primer 
sequence being substantially complementary to the strand. Non-complementary bases or 
longer sequences can be interspersed into the primer, provided that the primer sequence 
has sufficient complementarity with the sequence of the template to hybridize and 
thereby form a template primer complex for synthesis of the extension product of the 

30 primer. 
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The temi "label" as used herein refers to any atom or molecule that can be used to 
provide a detectable (preferably quantifiable) effect, and that can be attached to a nucleic 
acid or protein. Labels include but are not limited to dyes; radiolabels such as ^^P; 
binding moieties such as biotin; haptens such as digoxgenin; luminogenic, 
phosphorescent or fluorogenic moieties; and fluorescent dyes alone or in combination 
with moieties that can suppress or shift emission spectra by fluorescence resonance 
energy transfer (FRET). Labels may provide signals detectable by fluorescence, 
radioactivity, colorimetry, gravimetry, X-ray diffraction or absorption, magnetism, 
enzymatic activity, and the like. A label may be a charged moiety (positive or negative 
charge) or alternatively, may be charge neutral. Labels can include or consist of nucleic 
acid or protein sequence, so long as the sequence comprising the label is detectable. 

The term "signal" as used herein refers to any detectable effect, such as would be 
caused or provided by a label or an assay reaction. 

As used herein, the term "detector" refers to a system or component of a system, 
e.g., an instrument (e.g. a camera, fluorimeter, charge-coupled device, scintillation 
counter, etc.) or a reactive medium (X-ray or camera film, pH indicator, etc.), that can 
convey to a user or to another component of a system (e.g., a computer or controller) the 
presence of a signal or effect. A detector can be a photometric or spectrophotometric 
system, which can detect ultraviolet, visible or infrared light, including fluorescence or 
chemiluminescence; a radiation detection system; a spectroscopic system such as nuclear 
magnetic resonance spectroscopy, mass spectrometry or surface enhanced Raman 
spectrometry; a system such as gel or capillary electrophoresis or gel exclusion 
chromatography; or other detection systems known in the art, or combinations thereof. 

The term "cleavage structure" as used herein, refers to a structure that is formed 
by the interaction of at least one probe oligonucleotide and a target nucleic acid, forming 
a structure comprising a duplex, the resulting structure being cleavable by a cleavage 
agent, including but not limited to an enzyme. The cleavage structure is a substrate for 
specific cleavage by the cleavage means in contrast to a nucleic acid molecule that is a 
substrate for non-specific cleavage by agents such as phosphodiesterases that cleave 
nucleic acid molecules without regard to secondary structure (i.e., no formation of a 
duplexed structure is required). 
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The term "folded cleavage structure" as used herein, refers to a region of a single- 
stranded nucleic acid substrate containing secondary structure, the region being cleavable 
by an enzymatic cleavage means. The cleavage structure is a substrate for specific 
cleavage by the cleavage means in contrast to a nucleic acid molecule that is a substrate 
5 for non-specific cleavage by agents such as phosphodiesterases that cleave nucleic acid 
molecules without regard to secondary structure (z.e., no folding of the substrate is 
required). 

As used herein, the term "folded target" refers to a nucleic acid strand that 

contains at least one region of secondary structure (/.e, at least one double stranded 
10 region and at least one single-stranded region within a single strand of the nucleic acid). 

A folded target may comprise regions of tertiary structure in addition to regions of 

secondary structure. 

The term "cleavage means" or "cleavage agent" as used herein refers to any 

means that is capable of cleaving a cleavage structure, including but not limited to 
15 enaymes. The cleavage means may include native DNAPs having 5' nuclease activity 

(e,g.y Taq DNA polymerase, E. coli DNA polymerase I) and, more specifically, modified 

DNAPs having 5* nuclease but lacking synthetic activity. "Structure-specific nucleases" 

or "structure-specific enzymes" are enzymes that recognize specific secondary structures 

in a nucleic acid molecule and cleave these structures. The cleavage means of the 
20 invention cleave a nucleic acid molecule in response to the formation of cleavage 

structures; it is not necessary that the cleavage means cleave the cleavage structure at any 

particular location within the cleavage structure. 

The cleavage means is not restricted to enzymes having solely 5* nuclease 

activity. The cleavage means may include nuclease activity provided fi-om a variety of 
25 sources including the CLEAVASE enzymes, the FEN-1 endonucleases (including RAD2 

and XPG proteins), Taq DNA polymerase and E. coli DNA polymerase I. 

The term "thermostable" when used in reference to an enzyme, such as a 5* 

nuclease, indicates that the enzyme is functional or active (i.e., can perform catalysis) at 

an elevated temperature, i.e., at about 55''C or higher. 



42 



wo 01/90337 



PCT/USOl/17086 



The term "cleavage products" as used herein, refers to products generated by the 
reaction of a cleavage means with a cleavage structure (i.e., the treatment of a cleavage 
structure with a cleavage means). 

The term "target nucleic acid" refers to a nucleic acid molecule containing a 
sequence that has at least partial complementarity with at least a probe oligonucleotide 
and may also have at least partial complementarity with an INVADER oligonucleotide. 
The target nucleic acid may comprise single- or double-stranded DNA or RNA, and may 
comprise nucleotide analogs, labels, and other modifications. 

The term "probe oligonucleotide" refers to an oligonucleotide that interacts with a 
target nucleic acid to form a cleavage structure in the presence or absence of an 
INVADER oligonucleotide. When annealed to the target nucleic acid, the probe 
oligonucleotide and target form a cleavage structure and cleavage occurs within the probe 
oligonucleotide. 

The term "non-target cleavage product" refers to a product of a cleavage reaction 
that is not derived from the target nucleic acid. As discussed above, in the methods of the 
present invention, cleavage of the cleavage structure generally occurs within the probe 
oligonucleotide. The fragments of the probe oligonucleotide generated by this target 
nucleic acid-dependent cleavage are "non-target cleavage products." 

The term "INVADER oligonucleotide" refers to an oligonucleotide that 
hybridizes to a target nucleic acid at a location near the region of hybridization between a 
probe and tiie target nucleic acid, wherein the INVADER oligonucleotide comprises a 
portion (eg., a chemical moiety, or nucleotide— whether complementary to that target or 
not) that overlaps with the region of hybridization between the probe and target. In some 
embodiments, the INVADER oligonucleotide contains sequences at its 3' end that are 
substantially the same as sequences located at the 5* end of a probe oligonucleotide. 

The term "substantially single-stranded" when used in reference to a nucleic acid 
substrate means that the substrate molecule exists primarily as a single strand of nucleic 
acid in contrast to a double-stranded substrate which exists as two strands of nucleic acid 
which are held together by inter-strand base pairing interactions. 

The term "sequence variation" as used herein refers to differences in nucleic acid 
sequence between two nucleic acids. For example, a wild-type structural gene and a 
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mutant form of this wild-type structural gene may vary in sequence by the presence of 
single base substitutions and/or deletions or insertions of one or more nucleotides. These 
two fomis of the structural gene are said to vary in sequence from one another. A second 
mutant form of the structural gene may exist. This s«)ond mutant form is said to vary in 
5 sequence from both the wild-type gene and the first mutant form of the gene. 

The term "liberating" as used herein refers to the release of a nucleic acid 
fragment from a larger nucleic acid fragment, such as an oligonucleotide, by the action 
of, for example, a 5* nuclease such that the released fragment is no longer covalently 
attached to the remainder of the oligonucleotide. 

10 The term "Kn," as used herein refers to the Michaelis-Menten constant for an 

enzyme and is defined as the concentration of the specific substrate at which a given 
enzyme yields one-half its maximum velocity in an enzyme catalyzed reaction. 

The term "nucleotide" as used herein includes, but is not limited to, naturally 
occurring and/or synthetic nucleotides, nucleotide analogs, and nucleotide derivatives. 

15 For example, the term includes naturally occurring DNA or RNA monomers, nucleotides 
with backbone modifications such as peptide nucleic acid (PNA) (M. Eghohn et al.. 
Nature 365:566 [1993]), phosphorothioate DNA, phosphorodithioate DNA, 
phosphoramidate DNA, aminde-linked DNA, MMI-linked DNA, 2'-0-niethyl RNA, 
alpha-DNA and methylphosphonate DNA, nucleotides with sugar modifications such as 

20 2*-0-methyl RNA, 2'.fluoro RNA, 2'-amino RNA, 2'-0-alkyl DNA, 2'-0-allyl DNA, 
2'-0-alkynyl DNA, hexose DNA, pyranosyl RNA, and anhydrohexitol DNA, and 
nucleotides having base modifications such as C*5 substituted pyrimidines (substituents 
including fluoro-, bromo- chloro-, iodo-, methyl-, ethyl-, vinyl-, formyl-, ethynyl-, 
propynyl-, alkynyl-, thiazoyl-, imidazolyl-, pyridyl-), 7-deazapurines with C-7 

25 substituents including fluoro-, bromo-, chloro-, iodo-, methyl-, ethyl-, vinyl-, formyl-, 
alkynyl-, alkenyl-, thiazolyl-, imidazolyl-, pyridyl-), inosine and diaminopurine. 

The term "base analog" as used herein refers to modified or non-naturally 
occurring bases such as 7-deaza purines 7-deaza-adenine and 7-deaza-guanine); 
bases modified, for example, to provide altered interactions such as non-standard 

30 basepairing, including, but not limited to: IsoC, Iso G, and other modified bases and 
nucleotides described in U.S. Patent Nos. 5,432,272; 6,001,983; 6,037,120; 6,140,496; 



44 



wo 01/90337 



PCT/USOl/17086 



5,912,340; 6,127,121 and 6,143.877, each of which is incorporated herein by reference in 
their entireties; heterocyclic base analogs based no the purine or pyrimidine ring systems, 
and other heterocyclic bases. Nucleotide analogs include base analogs and comprise 
modified forms of deoxyribonucleotides as well as ribonucleotides. 

The tenn "polymorphic locus" is a locus present in a population that shows 
variation between members of the population the most common allele has a 
frequency of less than 0.95). In contrast, a "monomorphic locus" is a genetic locus at 
little or no variations seen between members of the population (generally taken to be a 
locus at which the most common allele exceeds a frequency of 0.95 in the gene pool of 
the population). 

The term "microorganism" as used herein means an organism too small to be 
obsCTved with the unaided eye and includes, but is not limited to bacteria, virus, 
protozoans, fimgi, and ciliates. 

The term "microbial gene sequences" refers to gene sequences derived from a 

microorganism. 

The term "bacteria" refers to any bacterial species including eubacterial and 
aichaebacterial species. 

The term "virus" refers to obligate, ultramicroscopic, intracellular parasites 
incapable of autonomous replication (i.e., replication requires the use of the host celPs 
machinery). 

The term "multi-drug resistant" or multiple-drug resistant" refers to a 
microorganism that is resistant to more than one of the antibiotics or antimicrobial agents 
used in the treatmait of said microorganism. 

The term "sample" in the present specification and claims is used in its broadest 
sense. On the one hand it is meant to include a specimen or culture (e.g., microbiological 
cultures). On the other hand, it is meant to include both biological and environmental 
samples. A sample may include a specimen of synthetic origin. 

Biological samples may be animal, including human, fluid, solid {e.g., stool) or 
tissue, as well as liquid and solid food and feed products and ingredients such as dairy 
items, vegetables, meat and meat by-products, and waste. Biological samples may be 
obtained from all of the various families of domestic animals, as well as feral or wild 
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animals, including, but not limited to, such animals as ungulates, bear, fish, lagamorphs, 
rodents, etc. 

Environmental samples include environmental material such as surface matter, 
soil, water and industrial samples, as well as samples obtained from food and dairy 
5 processing instruments, apparatus, equipment, utensils, disposable and non-disposable 
items. These examples are not to be construed as limiting the sample types applicable to 
the present invention. 

The term "source of target nucleic acid" refers to any sample that contains nucleic 
acids (RNA or DNA). Particularly preferred sources of target nucleic acids are biological 
10 samples including, but not limited to blood, saliva, cerebral spinal fluid, pleural fluid, 
milk, lymph, sputum and semen. 

An oligonucleotide is said to be present in "excess" relative to another 
oligonucleotide (or target nucleic acid sequence) if that oligonucleotide is present at a 
higher molar concentration that the other oligonucleotide (or target nucleic acid 
1 5 sequence). When an oligonucleotide such as a probe oligonucleotide is present in a 
cleavage reaction in excess relative to the concentration of the complementary target 
nucleic acid sequence, the reaction may be used to indicate the amount of the target 
nucleic acid present. Typically, when present in excess, the probe oligonucleotide will be 
present at least a 100-fold molar excess; typically at least 1 pmole of each probe 
20 oligonucleotide would be used when the target nucleic acid sequence was present at 
about 10 fmoles or less. 

A sample "suspected of containing" a first and a second target nucleic acid may 
contain either, both or neither target nucleic acid molecule. 

The terai "charge-balanced" oligonucleotide refers to an oligonucleotide (the 
25 input oligonucleotide in a reaction) that has been modified such that the modified 
oligonucleotide bears a charge, such that when the modified oligonucleotide is either 
cleaved (i.e., shortened) or elongated, a resulting product bears a charge different from 
the input oligonucleotide (the "charge-unbalanced" oligonucleotide) thereby permitting 
separation of the input and reacted oligonucleotides on the basis of charge. The term 
30 "charge-balanced" does not imply that the modified or balanced oligonucleotide has a net 
neutral charge (although this can be the case). Charge-balancing refers to the design and 
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modification of an oligonucleotide such that a specific reaction product generated from 
this input oligonucleotide can be separated on the basis of charge from the input 
oligonucleotide. 

For example, in an INVADER oligonucleotide-directed cleavage assay in which 

5 the probe oligonucleotide bears the sequence: 5' TTCTTTTCACCAGCGAGACGGG 3* 
(Le.y SEQ ID NO:136 without the modified bases) and cleavage of the probe occurs 
between the second and third residues, one possible charge-balanced version of this 
oligonucleotide would be: 5' Cy3-AminoT-Amino-TCTTTTCACCAGCGAGAC GGG 
3\ This modified oligonucleotide bears a net negative charge. After cleavage, the 

10 following oligonucleotides are generated: 5' Cy3-AminoT-Amino-T 3'and 5' 
CTTTTCACCAGCGAGACGGG 3* (residues 3-22of SEQ ID NO: 136), 5» Cy3- 
AminoT-Amino-T 3' bears a detectable moiety (the positively-charged Cy3 dye) and two 
amino-modified bases. The amino-modified bases and the Cy3 dye contribute positive 
charges in excess of the negative charges contributed by the phosphate groups and thus 

1 5 the 5' Cy3- AminoT-Amino-T 3*oligonucleotide has a net positive charge. The other, 
longer cleavage fragment, like the input probe, bears a net negative charge. Because the 
5' Cy3-AminoT-Amino-T 3'fragment is separable on the basis of charge from the input 
probe (the charge-balanced oligonucleotide), it is referred to as a charge-unbalanced 
oligonucleotide. The longer cleavage product cannot be separated on the basis of charge 

20 from the input oligonucleotide as both oligonucleotides bear a net negative charge; thus, 
the longer cleavage product is not a charge-unbalanced oligonucleotide. 

The term "net neutral charge" when used in reference to an oligonucleotide, 
including modified oligonucleotides, indicates that the sum of the charges present (i.e., R- 
NH3+ groups on thymidines, the N3 nitrogen of cytosine, presence or absence or 

25 phosphate groups, etc.) under the desired reaction or separation conditions is essentially 
zero. An oligonucleotide having a net neutral charge would not migrate in an electrical 
field. 

The term "net positive charge" when used in reference to an oligonucleotide, 
including modified oligonucleotides, indicates that the sum of the charges present (i.e., R- 
30 NH3+ groups on thymidines, the N3 nitrogen of cytosine, presence or absence or 
phosphate groups, etc.) under the desired reaction conditions is +1 or greater. An 
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oligonucleotide having a net positive charge would migrate toward the negative electrode 
in an electrical field. 

The term "net negative charge" when used in reference to an oligonucleotide, 
including modified oligonucleotides, indicates that the sum of the charges present (i.e., R- 
5 NH3+ groups on thymidines, the N3 nitrogen of cytosine, presence or absence or 
phosphate groups, etc.) under the desired reaction conditions is -1 or lower. An 
oligonucleotide having a net negative charge would migrate toward the positive electrode 
in an electrical field. 

The term "polymerization means" or ^'polymerization agent" refers to any agent 
10 enable of facilitating the addition of nucleoside triphosphates to an oligonucleotide. 
Preferred polymerization means comprise DNA and RNA polymerases. 

The term "ligation means" or "ligation agent" refers to any agent capable of 
facilitating the ligation (i.e., the formation of a phosphodiester bond between a 3 -OH and 
a 5' P located at the termini of two strands of nucleic acid). Preferred ligation means 
15 comprise DNA ligases and RNA ligases. 

The term "reactant" is used herein in its broadest sense. The reactant can 
comprise, for example, an enzymatic reactant, a chemical reactant or light (e.g., 
ultraviolet light, particularly short wavelength ultraviolet light is known to break 
oligonucleotide chains). Any agent capable of reacting with an oligonucleotide to either 
20 shorten (i.e., cleave) or elongate the oligonucleotide is encompassed within the term 
"reactant," 

The term "adduct" is used herein in its broadest sense to indicate any compound 
or element that can be added to an oligonucleotide. An adduct may be charged 
(positively or negatively) or may be charge-neutral. An adduct may be added to the 

25 oligonucleotide via covalent or non-covalent linkages. Examples of adducts include, but 
are not limited to, indodicarbocyanine dye amidites, amino-substituted nucleotides, 
ethidium bromide, ethidium homodimer, (l,3-propanediamino)propidium, 
(diethylenetriamino)propidium, thiazole orange, (N-N'-tetramethyl-l,3- 
propanediamino)propyl thiazole orange, (N-N'-tetramethyl-l,2-ethanediamino)propyl 

30 thiazole orange, thiazole orange-thiazole orange homodimer (TOTO), thiazole orange- 
thiazole blue heterodimer (TOTAB), thiazole orange-ethidium heterodimer 1 (TOEDl), 
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thiazole orange-ethidium hetcrodimer 2 (T0ED2) and fluorescein-ethidium heterodimer 
(FED), psoralens, biolin, slreptavidin. avidin, clc. 

Where a first oligonucleotide is complementary to a region of a target nucleic acid 
and a second oligonucleotide has complementary to the same region (or a portion of this 
region) a "region of sequence overlap" exists along the target nucleic acid. The degree of 
overlap will vary depending upon the nature of the complementarity (see, e.g., region "X" 
in Figs. 29 and 67 and the accompanying discussions). 

As used herein, the term "purified" or "to purify" refers to the removal of 
contaminants from a sample. For example, recombinant CLEAVASE nucleases are 
expressed in bacterial host cells and the nucleases are purified by the removal of host cell 
proteins; the percent of these recombinant nucleases is thereby increased in the sample. 

The term "recombinant DNA molecule" as used herein refers to a DNA molecule 
that comprises of segments of DNA joined together by means of molecular biological 
techniques. 

The term "recombinant protein" or "recombinant polypeptide" as used herein 
refers to a protein molecule that is expressed firom a recombinant DNA molecule. 

As used herein the term "portion" when in reference to a protein (as in "a portion 
of a given protein") refers to fragments of that protein. The firagments may range in size 
from four amino acid residues to the entire amino acid sequence minus one amino acid 

(e.g.,4,5,6 n-1). 

The term "nucleic acid sequence" as used herein refers to an oligonucleotide, 
nucleotide or polynucleotide, and fiagments or portions thereof, and to DNA or RNA of 
genomic or synthetic origin that may be single or double stranded, and represent the 
sense or antisense strand. Similarly, "amino acid sequence" as used herein refers to 
peptide or protein sequence. 

The term "peptide nucleic acid" ("PNA") as used herein refers to a molecule 
comprising bases or base analogs such as would be found in natural nucleic acid, but 
attached to a peptide backbone rather than the sugar-phosphate backbone typical of 
nucleic acids. The attachment of the bases to the peptide is such as to allow the bases to 
base pair with complementary bases of nucleic acid m a manner similar to that of an 
oligonucleotide. These small molecules, also designated anti gene agents, stop transcript 
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elongation by binding to their complementary strand of nucleic acid (Nielsen, et al 
Anticancer Drug Des. 8:53 63 [1993]). 

As used herein, the terms "purified" or "substantially purified" refer to molecules, 
either nucleic or amino acid sequences, that are removed from their natural environment, 
5 isolated or separated, and are at least 60% free, preferably 75% free, and most preferably 
90% fi-ee from other components with which they are naturally associated. An "isolated 
polynucleotide" or "isolated oligonucleotide" is therefore a substantially purified 
polynucleotide. 

As used herein, the term "fusion protein" refers to a chimeric protein 

10 containing the protein of interest (e.g., CLEAVASE BN/thrombin nuclease and portions 
or fragments thereof) joined to an exogenous protein fragment (the fusion partner which 
consists of a non CLEAVASE BN/thrombm nuclease protein). The fusion partner may 
enhance solubility of recombinant chimeric protein (e.g., the CLEAVASE BN/thrombin 
nuclease) as expressed in a host cell, may provide an affinity tag (e.g., a his-tag) to allow 

15 purification of the recombinant fusion protein from the host cell or culture supernatant, or 
both. If desired, the fusion protein may be removed from the protein of interest (e.g., 
CLEAVASE BN/thrombin nuclease or fragments thereof) by a variety of enzymatic or 
chemical means known to the art. 

As used herein, the terms "chimeric protein" and "chimerical protein" refer to a 

20 single protein molecule that comprises amino acid sequences portions derived from two 
or more parent proteins. These parent molecules may be from similar proteins from 
genetically distinct origins, different proteins from a single organism, or different 
proteins from different organisms. By way of example but not by way of limitation, a 
chimeric structure-specific nuclease of the present invention may contain a mixture of 

25 amino acid sequences that have been derived from FEN-1 genes from two or more of the 
organisms having such genes, combined to form a non-naturally occurring nuclease. The 
term "chimerical" as used herein is not intended to convey any particular proportion of 
contribution from the naturally occurring genes, nor limit the manner in which the 
portions are combined. Any chimeric structure-specific nuclease constructs having 

30 cleavage activity as determined by the testing methods described h^ein are improved 
cleavage agents within the scope of the present invention. 
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The term "continuous strand of nucleic acid" as used herein is means a strand of 
nucleic acid that has a continuous, covalently linked, backbone structure, without nicks or 
other disruptions. The disposition of the base portion of each nucleotide, whether 
base-paired, single-stranded or mismatched, is not an element in the definition of a 
continuous strand. The backbone of the continuous strand is not limited to the 
ribose-phosphate or deoxyribose-phosphate compositions that are found in naturally 
occurring, unmodified nucleic acids. A nucleic acid of the present invention may 
comprise modifications in the structure of the backbone, including but not limited to 
phosphorothioate residues, phosphonate residues, 2' substituted ribose residues (e.g., 
2'-0-methyl ribose) and alternative sugar (e.g., arabinose) containing residues. 

The term "continuous duplex" as used herein refers to a region of double stranded 
nucleic acid in which there is no disruption in the progression of basepairs within the 
duplex (/.e., the base pairs along the duplex are not distorted to acconunodate a gap, 
bulge or mismatch with the confines of the region of continuous duplex). As used herein 
the term refers only to the arrangement of the basepairs within the duplex, without 
implication of continuity in the backbone portion of the nucleic acid strand. Duplex 
nucleic acids with uninterrupted basepairing, but with nicks in one or both strands are 
within the definition of a continuous duplex. 

The term "duplex" refers to the state of nucleic acids in which the base portions of 
the nucleotides on one strand are bound through hydrogen bonding the their 
complementary bases arrayed on a second strand. The condition of being in a duplex 
form reflects on the state of the bases of a nucleic acid. By virtue of base pairing, the 
strands of nucleic acid also generally assume the tertiary structure of a double helix, 
having a major and a minor groove. The assumption of the helical form is implicit in the 
act of becoming duplexed. 

The term "duplex dependent protein binding" refers to the binding of proteins to 
nucleic acid that is dependent on the nucleic acid being in a duplex, or helical form. 

The term "duplex dependent protein binding sites or regions" as used herein refers 
to discrete regions or sequences within a nucleic acid that are bound with particular 
affinity by specific duplex-dependent nucleic acid binding proteins. This is in contrast to 
the generalized duplex-dependent binding of proteins that are not site-specific, such as 
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the histone proteins that bind chromatin with little reference to specific sequences or 
sites. 

The term "protein-binding region" as used herein refers to a nucleic acid region 
identified by a sequence or structure as binding to a particular protein or class of proteins. 
5 It is within the scope of this definition to include those regions that contain sufficient 
genetic information to allow identifications of the region by comparison to known 
sequences, but which might not have the requisite structure for actual binding (e.g., a 
single strand of a duplex-depending nucleic acid binding protein site). As used herein 
"protein binding region" excludes restriction endonuclease binding regions. 

10 The term "complete double stranded protein binding region" as used herein refers 

to the minimum region of continuous duplex required to allow binding or other activity of 
a duplex-dependent protein. This definition is intended to encompass the observation 
that some duplex dependent nucleic acid binding proteins can interact with fiill activity 
with regions of duplex that may be shorter than a canonical protein binding region as 

1 5 observed in one or the other of the two single strands. In other words, one or more 
nucleotides in the region may be allowed to remain unpaired without suppressing 
binding. As used here in, the term "complete double stranded binding region" refers to 
the minimum sequence that will accommodate the bmding fiinction. Because some such 
regions can tolerate non-duplex sequences in multiple places, although not necessarily 

20 simultaneously, a single protein binding region might have several shorter sub-regions 
that, when duplexed, will be fully competent for protein binding. 

The term "template" refers to a strand of nucleic acid on which a complementary 
copy is built from nucleoside triphosphates through the activity of a template-dependent 
nucleic acid polymerase. Within a duplex the template strand is, by convention, depicted 

25 and described as the "bottom" strand. Similarly, the non-template strand is often depicted 
and described as the "top" strand. 

The term "template-dependent RNA polymerase" refers to a nucleic acid 
polymerase that creates new RNA strands through the copying of a template strand as 
described above and which does not synthesize RNA in the absence of a template. This 

30 is in contrast to the activity of the template-independent nucleic acid polymerases that 
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synthesize or extend nucleic acids without reference to a template, such as terminal 
deoxynucleotidyl transferase, or Poly A polymerase. 

The term "ARRESTOR molecule" refers to an agent added to or included in an 
invasive cleavage reaction in order to stop one or more reaction components from 
participating in a subsequent action or reaction. This may be done by sequestering or 
inactivating some reaction component (e.g., by binding or base-pairing a nucleic acid 
component, or by binding to a protein component). The term "ARRESTOR 
oligonucleotide" refere to an oligonucleotide included in an invasive cleavage reaction in 
order to stop or arrest one or more aspects of any reaction (e.g., the first reaction and/or 
any subsequent reactions or actions; it is not intended that the ARRESTOR 
oligonucleotide be limited to any particular reaction or reaction step). This may be done 
by sequestering some reaction component (e.g., base-pairing to another nucleic acid, or 
binding to a protein component). However, it is not intended that the tem be so limited 
as to just situations in which a reaction component is sequestered. 

As used herein, the term "kit" refers to any delivery system for delivering 
materials. In the context of reaction assays, such delivery systems include systems that 
allow for the storage, transport, or delivery of reaction reagents (e.g., oligonucleotides, 
enzymes, etc. in the appropriate containers) and/or supporting materials (e.g., buffers, 
written instructions for performing'the assay etc.) from one location to another. For 
example, kits include one or more enclosures (e.g., boxes) containing the relevant 
reaction reagents and/or supporting materials. As used herein, the term "fragmented kit" 
refers to a delivery systems comprising two or more separate containers that each contain 
a subportion of the total kit components. The containers may be delivered to the intended 
recipient together or separately. For example, a first container may contain an enzyme 
for use in an assay, while a second container contains oligonucleotides. The term 
"fragmented kit" is intended to encompass kits contaming Analyte specific reagents 
(ASR's) regulated under section 520(e) of the Federal Food, Drug, and Cosmetic Act, but 
are not limited thereto. Indeed, any delivery system comprising two or more separate 
containers that each contains a subportion of the total kit components are included in the 
twm "fragmented kit." In contrast, a "combined kit" refers to a delivery system 
containing all of the components of a reaction assay in a single container (e.g., in a single 
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box housing each of the desired components). The term "kit" includes both fragmented 
and combined kits. 

As used herein, the term "ilmctional domain" refers to a region, or a part of a 
• region, of a protein (e.g., an enzyme) that provides one or more functional characteristic 
5 of the protein. For example, a functional domain of an enzyme may provide, directly or 
indirectly, one or more activities of the enzyme including, but not limited to, substrate 
binding capability and catalytic activity. A functional domain may be characterized 
through mutation of one or more amino acids within the functional domain, wherein 
mutation of the amino acid(s) alters the associated fimctionality (as measured empirically 

10 in an assay) thereby indicating the presence of a functional domain. 

As used herein, the term "heterologous functional domain" refers to a protein 
functional domain that is not in its natural environment. For example, a heterologous 
functional domain includes a ftmctional domain from one enzyme introduced into another 
enzyme. A heterologous functional domain also includes a functional domain native to a 

15 protein that has been altered in some way (e.g., mutated, added in multiple copies, etc.). 
A heterologous functional domain may comprise a plurality of contiguous amino acids or 
may include two or more distal amino acids are amino acids fragments (e.g., two or more 
amino acids or fragments with intervening, non-heterologous, sequence). Heterologous 
functional domains are distinguished from endogenous functional domains in that the 

20 heterologous amino acid(s) are joined to or contain amino acid sequences that are not 

found naturally associated with the amino acid sequence in nature or are associated with a 
portion of a protein not found in nature. 

As used herein, the term "altered functionality in a nucleic acid cleavage assay" 
refers to a characteristic of an enzyme that has been altered in some manner to differ from 

25 its natural state (e.g., to differ from how it is found in nature). Alterations include, but 
are not limited to, addition of a heterologous functional domain (e.g., through mutation or 
through creation of chimerical proteins). In some embodiments, the altered characteristic 
of the enzyme may be one that improves the performance of an enzyme in a nucleic acid 
cleavage assay. Types of improvement include, but are not limited to, improved nuclease 

30 activity (e.g., improved rate of reaction), improved substrate binding (e.g., increased or 
decreased binding of certain nucleic acid species [e.g., RNA or DNA] that produces a 
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desired outcome [e,g., greater specificity, improved substrate turnover, etc.]), and 
improved background specificity (e.g.. less undesired product is produced). The present 
invention is not limited by the nucleic cleavage assay used to test improved functionality. 
However, in some preferred embodiments of the present invention, an invasive cleavage 
5 assay is used as the nucleic acid cleavage assay. In certain particularly preferred 

embodiments, an invasive cleavage assay utilizing an RNA target is used as the nucleic 

acid cleavage assay. 

As used herein, the terms "N-terminal" and "C-terminal" in reference to 
polypeptide sequences refer to regions of polypeptides including portions of the N- 

10 terminal and C-terminal regions of the polypeptide, respectively. A sequence that 
includes a portion of the N-terminal region of polypeptide includes amino acids 
predominantly from the N-terminal half of the polypeptide chain, but is not limited to 
such sequences. For example, an N-terminal sequence may include an interior portion of 
the polypeptide sequence including bases firam both the N-tenminal and C-terminal halves 

15 of the polypeptide. The same applies to C-terminal regions. N-terrainal and C-terminal 
regions may, but need not, mclude the amino acid defining the ulHraate N-terminal and 
C-terminal ends of the polypqitide, respectively. 

DESCRIPTION OF THE DRAWINGS 

20 Figure 1 shows a schematic representation of sequential invasive cleavage 

reactions. In step A, an upstream INVADER oligonucleotide and a downstream probe 
combine with a target nucleic acid strand to form a cleavage structure. In step B, the 
portion of the cleaved signal probe fi:om A combines with a second target nucleic acid 
strand and a labeled signal probe to form a second cleavage stmcture. In step C. cleavage 

25 of the labeled second cleavage structure yields a detectable signal 

Figure 2 shows schematic representations of several examples of invasive 
cleavage structures comprising RNA target strands (SEQ ID N0:141). Panel A depicts 
an INVADER oligonucleotide (SEQ ID NO:142) and probe (SEQ ID NO:143). Panel B 
depicts an INVADER oligonucleotide (SEQ ID NO:144) and probe (SEQ ID NO: 143). 

30 Panel C depicts an INVADER oligonucleotide (SEQ ID NO:145) and probe (SEQ ID 
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NO: 145). Panel D depicts an INVADER oligonucleotide (SEQ ID NO; 145) and probe 
(SEQIDNO:146). 

Figure 3 shows schematic representations of two examples of structures that are 
not invasive cleavage structures labelled SEQ ID NOs:147-152. 
5 Figure 4 shows a schematic representation of a configuration of invasive cleavage 

that is useful for detection of target sequence variations. In A, an invasive cleavage 
structure having overlap between the two probes is formed, and the arrow indicates that it 
is cleavable by the enzymes of the present invention. In B, variation of the target 
sequence removes a region of complementarity to the downstream probe and eliminates 
10 the overlap. The absence of an arrow in panel B indicates a reduced rate of cleavage of 
this structure compared to that diagrammed in panel A. 

Figure 5 shows a diagram of the X-ray structure of a ternary complex of Klentaql 
with primer/template DNA in the polymerizing mode determined by Li et al (Li et aiy 
Protein Sci., 7:1 1 16 [1998]). Without intending to represent precise borders between 
15 features of the physical form, the portions referred to in the text as the "fingers", "thumb" 
and "palm" regions are loosely indicated by the circle, rectangle, and oval, respectively. 

Figure 6 shows a schematic diagram of the DNA polymerase gene firom Thermus 
aquaticus. Restriction sites used in these studies are indicated above. The approximate 
regions encoding various structural or functional domains of the protein are indicated by 
20 double-headed arrows, below. 

Figure 7 shows a schematic diagram of the chimeric constructs comprising 
portions of the TaqPol gene and the TthPol gene. Open and shaded boxes denote TaqPol 
and TthPol sequences, respectively. The numbers correspond to the amino acid sequence 
of TaqPol The 5* nuclease and polymerase domains of TaqPol and the pahn, thumb, and 
25 fingers regions of the polymerase domain are indicated. The abbreviations for the 

restrictions sites used for recombination are as follows: B, EcoRI; N, NotI; Bs, BstBI; D, 
Ndel; B, BamHI; and S, Sail. 

Figure 8A-H shows a comparison of the nucleotide structure of the polymerase 
genes isolated from Thermus aquaticus (SEQ ID NO: 153), Thermus flavus (SEQ ID 
30 NO:154) and Thermus thermophilus (SEQ ID NO:155); the consensus sequence (SEQ ID 
NO: 156) is shown at the top of each row. 
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Figure 9A-C shows a comparison of the amino acid sequence of the polymerase 
isolated from Themus aquaticus (SEQ ID NO: 157). Themusflavus (SEQ ID NO: 158), 
and Themus themophilus (SEQ ID N0:1); the consensus sequence (SEQ ID NO:159) is 
shown at the top of each row, 

5 Figure 10 shows the sequences and proposed structures of substrates for the 

invasive signal amplification reaction with human IL-6 KNA target strand (SEQ ID 
NO:160) and upstream probe (SEQ ID N0:161). The cleavage site of the downstream 
probe (SEQ ID NO: 162) is indicated by an arrow. Sequence of the IL-6 DNA target 
strand (SEQ ID NO: 163) is shown below. 

10 Figure 1 1 shows the image generated by a fluorescence imager showing the 

products of invasive cleavage assays using the indicated enzymes, and the IL-6 substrate 
of Figure 10 having either a DNA target strand (A) or an RNA target strand (B). 

Figure 12 compares the cycling cleavage activities of Taq DN RX HT, Tth DN 
RX HT, and Taq-Tth chimerical enzymes with IL-6 substrate havmg an KNA target 

15 strand. 

Figure 13 shows a comparison of the amino acid sequences of the Bstl-BamHI 
fragments of TaqPol (SEQ ID NO:164) and TthPol (SEQ ID NO:165). Pairs of similar 
amino acids are shaded with Ught gray. Aligned amino acids that have a charge 
difference are shaded with dark gray. The numbers correspond to the amino acid 
20 sequence of TaqPol. Amino acids of TaqPol changed to the corresponding amino acids 
of TthPol by site-directed mutagenesis are indicated by (+). 

Figure 14 compares the cycling cleavage activities of Taq DN RX HT, Taq-Tth 
chimerical enzymes, and chimerical enzymes having the indicated additional amino acid 
modifications, with IL-6 substrate having an RNA target strand. 
25 Figure 1 5 compares the cycling cleavage activities of Taq DN RX HT, Tth DN 

RX HT, and Taq DN RX HT having flie indicated amino acid modifications, with IL-6 
substrate having an RNA tai^et strand. 

Figure 16 compares polymerization activities of TaqPol, TtiiPol, and Taq-Tth 
chimerical enzymes, and TaqPol having the indicated amino acid modifications. 
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Figure 17 shows a diagram of the X-ray structure of a ternary complex of 
Klentaql with primer/template DNA in the polymerizing mode determined by Li et al 
(Li et al. Protein Sci., 7:1116 [1998]). Amino acids G41 8 and E507 are indicated. 

Figures 1 8 A-D show schematic diagrams of examples of substrates that may be 
5 used to measure various cleavage activities of enzymes. The substrates may be labeled, 
for example, with a fluorescent dye and a quenching moiety for FRET detection, as 
shown, to facilitate detection and measurement. The substrates of 18A and 18B are 
invasive cleavage structures having RNA and DNA target strands, respectively. 18C 
shows an example of an X-structure, and 18D shows an example of a hairpin structure, 
10 both of which may be used to assess the activity of enzymes on altemative structures that 
may be present in invasive cleavage reactions. 

Figure 19 shows schematic diagrams of chimeric constructs comprising portions 
of the TaqPoI gene and the TthPol gene. Open and shaded boxes denote TaqPol and 
TthPol sequences, respectively. The chimeras also include the DN, RX, and HT 
15 modifications. A table compares the cleavage activity of each protein on the indicated 
cleavage substrates. 

Figure 20A shows a schematic diagram for an RNA containing invasive cleavage 
substrate. The 5* end of the target molecule (SEQ ID NO: 166) is modified with biotin 
and blocked with streptavidin as described. The downstream probe (SEQ ID NO: 167) 
20 with cleavage site is also shown. Panels B-D show analysis of the properties of the Taq 
DN RX HT G418KyE507Q mutant in cleavage of the shown substrate under conditions 
of varying reaction temperature, KCl concentration, and MgS04 concentration. 

Figure 21 shows schematic diagrams for model substrates used to test enzymes 
for invasive cleavage activity. The molecule shown in 2 1 A provides a DNA target strand 
25 (SEQ ID NO:168), while the model shown in 21B provides an RNA containing target 
strand (SEQ ID NO:167). Both 21 A and B show downstream probe SEQ ID NO:166, 
Figure 22 shows schematic diagrams for model substrates used to test enzymes 
for cleavage activity on altemative, non-invasive stmctures. 

Figure 23 shows a schematic diagram for a model substrate used to test enzymes 
30 for invasive cleavage activity. 
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Figure 24 shows schematic diagrams for a model substrate used to test enzymes 
for invasive cleavage activity on RNA or DNA target strands. 

Figure 25 compares the cycling cleavage activities of Tth DN RX HT, Taq 2M, 
TfiPol, Tsc Pol, and Tfi and Tsc-derived mutant enzymes. 
5 Figure 26 depicts structures that may be employed to determine the ablity of an 

enzyme to cleave a probe in the presence and the absence of an upstream oligonucleotide. 
Fig. 26 displays the sequence of oligonucleotide 89-15-1 (SEQ ID NO:212), 
oligonucleotide 81-69-5 (SEQ ID N0:213). oligonucleotide 81-69-4 (SEQ ID NO:214), 
oligonucleotide 81-69-3 (SEQ ID NO:215), oligonucleotide 81-69-2 (SEQ ID N0:216) 
10 and a portion of M13mpl8 (SEQ ID NO:217). 

Figure 27 shows the image generated by a fluoresceaice imager that shows the 
dependence of Pfu FEN-1 on the presence of an overlapping upstream oligonucleotide for 
specific cleavage of the probe. 

Figure 28a shows the image generated by a fluorescence imager that compares the 
15 amount of product generated in a standard (t.e., a non-sequential invasive cleavage 
reaction) and a sequ«itial invasive cleavage reaction. 

Figure 28b is a graph comparing the amount of product generated in a standard or 
basic (i.e., a non-sequential invasive cleavage reaction) and a sequential invasive 
cleavage reaction ("INVADER sqnl") (y axis = fluorescence units; x axix = attomoles of 
20 target). 

Figure 29 shows the image generated by a fluorescence imager that shows that the 
products of a completed sequential invasive cleavage reaction cannot cross contaminant a 
subsequent similar reaction. 

Figure 30 shows the sequence of the oligonucleotide anployed in an invasive 
25 cleavage reaction for the detection of HCMV viral DNA; Fig. 30 shows tiie sequence of 
oligonucleotide 89-76 (SEQ ID NO:218), oligonucleotide 89-44 (SEQ ID NO:219) and 
nucleotides 3057-3110 of tfie HCMV genome (SEQ ID NO:220). 

Figure 3 1 shows the image generated by a fluorescence imago: that shows the 
sensitive detection of HCMV viral DNA in samples containing human genomic DNA 
30 using an invasive cleavage reaction. 
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Figure 32 is a schematic that illustrates one embodiment of the present invention, 
where the cut probe from an initial invasive cleavage reaction is employed as the 
INVADERoligonucleotide in a second invasive cleavage reaction, and where an 
ARRESTOR oligonucleotide prevents participation of remaining uncut first probe in the 
5 cleavage of the second probe. 

Figure 33 is a schematic that illustrates one embodiment of the present invention, 
where the cut probe from an initial invasive cleavage reaction is employed as an 
integrated INVADER-target complex in a second invasive cleavage reaction, and where 
an ARRESTOR oligonucleotide prevents participation of remaining uncut first probe in 
1 0 the cleavage of the second probe. 

Figure 34 shows three images generated by a fluorescence imager showing that 
two different lengths of 2* 0-methyl, 3' terminal amine-modified ARRESTOR 
oligonucleotide both reduce non-specific background cleavage of the secondary probe 
when included in the second step of a reaction where the cut probe from an initial 
1 5 invasive cleavage reaction is employed as an integrated INVADER-target complex in a 
second invasive cleavage reaction. 

Figure 35A shows two images generated by a fluorescence imager showing the 
effects on nonspecific and specific cleavage signal of increasing concentrations of 
primary probe in the first step of a reaction where the cut probe from an initial invasive 
20 cleavage reaction is employed as the INVADER oligonucleotide in a second invasive 
cleavage reaction. 

Figure 35B shows two images generated by a fluorescence imager showing the 
effects on nonspecific and specific cleavage signal of increasing concentrations of 
primary probe in the first step of a reaction, and inclusion of a 2* 0-methyl, 3' terminal 

25 amine-modified ARRESTOR oligonucleotide in the second step of a reaction where the 
cut probe from an initial invasive cleavage reaction is employed as the INVADER 
oligonucleotide in a second invasive cleavage reaction. 

Figure 35C shows shows a graph generated using the spreadsheet MICROSOFT 
EXCEL software, comparing the effects on nonspecific and specific cleavage signal of 

30 increasing concentrations of primary probe in the first step of a reaction, in the presence 
or absence of a 2* 0-methyl, 3* terminal amine-modified ARRESTOR oligonucleotide in 
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the second step of a reaction where the cut probe from an initial invasive cleavage 
reaction is employed as the INVADER oligonucleotide in a second invasive cleavage 
reaction. 

Figure 36A shows two images generated by a fluorescence imager showing the 
5 effects on nonspecific and specific cleavage signal of including an unmodified 

ARRESTOR oligonucleotide in the second step of a reaction where the cut probe from an 
initial invasive cleavage reaction is employed as the INVADER oligonucleotide in a 
second invasive cleavage reaction. 

Figure 36B shows two images generated by a fluorescence imager showing the 
10 effects on nonspecific and specific cleavage signal of including a 3' terminal amine 
modified ARRESTOR oligonucleotide, a partially T 0-methyl substituted, 3' teraiinal 
amine modified ARRESTOR oligonucleotide, or an entirely 2' 0-raethyl, 3" terminal 
amine modified ARRESTOR oligonucleotide in the second step of a reaction where the 
cut probe from an initial invasive cleavage reaction is employed as the INVADER 
15 oligonucleotide in a second invasive cleavage reaction. 

Figure 37A shows two images generated by a fluorescence imager comparing the 
effects on nonspecific and specific cleavage signal of including ARRESTOR 
oligonucleotides of different lengtiis in the second step of a reaction where the cut probe 
from an initial invasive cleavage reaction is employed as the INVADER oligonucleotide 
20 in a second invasive cleavage reaction. 

Figure 37B shows two images generated by a fluorescence imager comparing tiie 
effects on nonspecific and specific cleavage signal of including an arrestoer 
oligonucleotides of different lengths in the second step of a reaction where the cut probe 
from an initial invasive cleavage reaction is employed as the INVADER oligonucleotide 
25 in a second invasive cleavage reaction, and in which a longw variant of the secondary 
probe used in the reactions in Fig. 37A is tested. 

Figure 37C shows a schematic diagram of a primary probe aligned with several 
ARRESTOR oligonucleotides of different lengtiis. The region of the primary probe that 
is complementary to the HBV target sequence is underlined. The ARRESTOR 
30 oligonucleotides are aligned with the probe by complementarity. 
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Figure 38 shows two images generated by a fluorescence imager comparing the 
effects on nonspecific and specific cleavage signal of including ARRESTOR 
oligonucleotides of different lengths in the second step of a reaction where the cut probe 
from an initial invasive cleavage reaction is employed as the INVADER oligonucleotide 
5 in a second invasive cleavage reaction, using secondary probes of two different lengths. 

Figure 39 shows a graph of the calculated running average of a ten nucleotide 
stiretch of the hUbiquitin RNA (the Ave(lO) Index) derived firom the SS-count output of • 
an mfold analysis, expressed as a percentage of the total number of structures found by 
mfold that include a particular base, plotted against the position of the base. 
10 Figure 40 shows an example microplate layout for an RNA INVADER assay 

comprising 40 samples, 6 standards, and a No Target Control. 

Figure 41 shows INVADER assay components for use in detecting human (h), 
mouse (m), or rat (r) RNAs of the indicated genes or transcripts. 

Figure 42 shows a computer display of an INVADERCREATOR Order Entry 

IS screen. 

Figure 43 shows a computer display of an INVADERCREATOR Multiple SNP 
Design Selection screen. 

Figure 44 shows a computer display of an INVADERCREATOR Designer 
Worksheet screeiL 

20 Figure 45 shows a computer display of an INVADERCREATOR Output Page 

screen. 

Figure 46 shows a computer display of an INVADERCREATOR Printer Ready 
Output screen. 

Figure 47 shows INVADER assay components (SEQ ID NOs:709-2640) for use 
25 in detecting RNA target nucleic acids. Components are grouped per RNA analyte to be 
detected. Where multiple probes, INVADER oligonucleotides, stacker oligonucleotides, 
ARRESTOR oligonucleotides, or other components are provided, any of the multiple 
componaits may be used, unless indicated otherwise. Unless indicated otherwise, 
oligonucleotides are presented S'-3' orientation. 
30 Figure 48 shows a chart showing the Ave(lO) Index against base pair position. 
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DESCRIPTION OF THE INVENTION 



Introduction 

The present invention relates to methods and compositions for treating nucleic 
5 acid, and in particular, methods and compositions for detection and characterization of 
nucleic acid sequences and sequence changes. 

In preferred embodiments, the present invention relates to means for cleaving a 
nucleic acid cleavage structure in a site-specific manner. While the present invention 
provides a variety of cleavage agents, in some embodiments, the present invention relates 
10 to a cleaving enzyme having 5' nuclease activity without interfering nucleic acid 
synthetic ability. In other embodiments, the present invention provides novel 
polymerases (eg., thermostable polymerases) possessing altered polymerase and/or 
nucleases activities. 

For example, in some embodiments, the present invention provides 5* nucleases 
15 derived from thermostable DNA polymerases that exhibit altered DNA synthetic activity 
from that of native thermostable DNA polymerases. The 5' nuclease activity of the 
polymerase is retained while the synthetic activity is reduced or absent. Such 5' 
nucleases are capable of catalyzing the structure-specific cleavage of nucleic acids in the 
absence of interfering synthetic activity. The lack of synthetic activity during a cleavage 
20 reaction results in nucleic acid cleavage products of uniform size. 

The novel properties of the nucleases of the invention form the basis of a method 
of detecting specific nucleic acid sequences. This method reUes upon the amplification 
of the detection molecule rather than upon the ampUfication of the target sequence itself 
as do existing methods of detecting specific target sequences. 
25 DNA polymerases pNAPs), such as those isolated &om E. coli or fi»m 

thermophilic bacteria of the genus Thermus as well as other organisms, are enzymes that 
synthesize new DNA strands. Several of the known DNAPs contain associated nuclease 
activities in addition to the synthetic activity of the enzyme. 

Some DNAPs are known to remove nucleotides bora, the 5' and 3' ends of DNA 
30 chains (Komberg, DNA Replication, W.H. Freeman and Co.. San Francisco, pp. 127-139 
[1980]). These nuclease activities are usually referred to as 5' exonuclease and 3' 
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exonuclease activities, respectively. For example, the 5' exonuclease activity located in 
tiie N-temiinal domain of several DNAPs participates in the removal of RNA primers 
during lagging strand synthesis during DNA replication and the removal of damaged 
nucleotides during repair. Some DNAPs, such as the E, coli DNA polymerase 
5 (DNAPEcl), also have a 3' exonuclease activity responsible for proof-reading during 
DNA synthesis (Komberg, supra). 

A DNAP isolated from Thermus aqmticus, termed Taq DNA polymerase 
(DNAPTaq), has a 5' exonuclease activity, but lacks a functional 3' exonucleolytic 
domain (Tindall and Kunkell, Biochem., 27:6008 [1988]). Derivatives of DNAPEcl and 

10 DNAPTaq, respectively called the Klenow and Stoffel fragments, lack 5' exonuclease 
domains as a result of enzymatic or genetic manipulations (Brutlag et al., Biochem. 
Biophys. Res. Commun., 37:982 [1969]; Erlich et al. Science 252:1643 [1991]; Setlow 
and Komberg, J. Biol. Chem., 247:232 [1972]), 

The 5* exonuclease activity of DNAPTaq was reported to require concurrent 

1 5 synthesis (Gelfand, PGR Technology - Principles and Applications for DNA 

Amplification, H.A. Eriich, [Ed.], Stock^ton Press, New York, p. 19 [1989]). Although 
mononucleotides predominate among the digestion products of the 5' exonucleases of 
DNAPTaq and DNAPEcl. short oligonucleotides 12 nucleotides) can also be observed 
implying that these so-called 5* exonucleases can function endonucleolytically (Setlow, 

20 • supra; Holland et al., Proc. Natl. Acad. Sci. USA 88:7276 [1991]). 

In WO 92/06200, Gelfand et al show that the preferred substrate of the 5' 
exonuclease activity of the thermostable DNA polymerases is displaced single-stranded 
DNA. Hydrolysis of the phosphodiester bond occurs between the displaced single- 
stranded DNA and the double-helical DNA with the preferred exonuclease cleavage site 

25 being a phosphodiester bond in the double helical region. Thus, the 5' exonuclease 
activity usually associated with DNAPs is a structure-dependent single-stranded 
endonuclease and is more properly referred to as a 5' nuclease. Exonucleases ^re 
enzymes that cleave nucleotide molecules from the ends of the nucleic acid molecule. 
Endonucleases, on the other hand, are enzymes that cleave the nucleic acid molecule at 

30 internal raflier than terminal sites. The nuclease activity associated with some 

thermostable DNA polymerases cleaves endonucleolytically but this cleavage requires 
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contact with the 5' end of the molecule being cleaved. Therefore, these nucleases are 
referred to as 5' nucleases. 

Whffli a 5' nuclease activity is associated with a eubacterial Type A DNA 
polymerase, it is found in the one third N-terminal region of the protein as an independent 
functional domain. The C-terminal two-thirds of the molecule constitute the 
polymerization domain that is responsible for the synthesis of DNA. Some Type A DNA 
polymerases also have a 3' exonuclease activity associated with the two-third C-terminal 

region of the molecule. 

The 5' exonuclease activity and the polymerization activity of DNAPs can be 
separated by proteolytic cleavage or genetic manipulation of the polymerase molecule. 
The Klenow or large proteolytic cleavage fiagment of DNAPEcl contains the 
polymerase and 3' exonuclease activity but lacks the 5' nuclease activity. The Stoffel 
fragment of DNAPTaq (DNAPStQ lacks the 5' nuclease activity due to a genetic 
manipulation that deleted the N-terminal 289 amino acids of the polymerase molecule 
(Erlich et al.. Science 252:1643 [1991]). WO 92/06200 describes a thermostable DNAP 
with an altered level of 5' to 3' exonuclease. U.S. Patent No. 5.108.892 describes a 
Thermus aquaticus DNAP without a 5' to 3' exonuclease. Thermostable DNA 
polymerases with lessened amounts of synthetic activity are available (Third Wave 
Technologies, Madison. WI) and are described in U.S. Pat. Nos. 5,541.31 1. 5.614.402, 
5,795,763, 5,691,142, and 5,837,450. herein incorporated by reference in their entireties. 
The present invention provides 5' nucleases derived from thermostable Type A DNA 

polymerases that retain 5' nuclease activity but have reduced or absent synthetic activity. 

Tte ability to uncouple the synttietic activity of the enzyme from the 5' nuclease activity 

proves that the 5' nuclease activity does not require concurrent DNA synthesis as was 

previously reported (Gelftnd, PGR Technology, supra). 

In addition to the 5'-exonuclease domains of tiie DNA polymerase I proteins of 

Eubacteria, described above. 5' nucleases have been found associated with bacteriophage. 

eukaryotes and arehaebacteria. Overall, all of the enzymes in this family display very 

similar substrate specificities, despite their Hmited level of sequence similarity. 

Consequently, enzymes suitable for use in the methods of the present invention may be 

isolated or derived from a wide array of sources. 
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A mammalian enzyme with functional similarity to the 5'-exonuclease domain of 
E. coli Pol I was isolated nearly 30 years ago (Lindahl, et al, Proc Natl Acad Sci U S A 
62(2): 597-603 [1969]). Later, additional members of this group of enzymes called flap 
endonucleases (FENl) from Eukarya and Archaea were shown to possess a nearly 

5 identical structure specific activity (Harrington and Lieber. Embo J 13(5), 1235-46 
[1994]; Murante et al, J Biol Chem 269(2), 1191-6 [1994]; Robins, et al, J Biol Chem 
269(46), 28535-8 [1994]; Hosfield, et al, J Biol Chem 273(42), 27154-61 [1998]), 
despite limited sequence similarity. The substrate specificities of the FENl enzymes, and 
the eubacterial and related bacteriophage enzymes have been examined and found to be 

10 similar for all enzymes (Lyamichev, et a/., Science 260(5109), 778-83 [1993], Hanington 
and Lieber, supra^ Murante, et ai, supra, Hosfield, et at, supra^ Rao, et ai^ J Bacterid 
180(20), 5406-12 [1998], Bhagwat, etal,, J. Biol Chem 272(45), 28523-30 [1997], 
Garforth and Sayers, Nucleic Acids Res 25(19), 3801-7 [1997]). 

Using preformed substrates, many of the studies cited above determined that these 

1 5 nucleases leave a gap upon cleavage, leading the authors to speculate that DNA 

polymerase must then act to fill in that gap to generate a ligatable nick. A number of 
other 5' nucleases have been shown to leave a gap or overlap after cleavage of the same 
or similar flap substrates. It has since been determined that that all the structure-specific 
5'-exonucleases leave a nick after cleavage if the substrate has an overlap between the 

20 upstream and downstream duplexes (Kaiser et a/., J. Biol .Chem. 274(30):21387-21394 
[1999]). While duplexes having several bases of overlapping sequence can assume 
several different conformations through branch migration, it was determined that 
cleavage occurs in the conformation wh^e the last nucleotide at the 3' end of the 
upstream strand is unpaired, with the cleavage rate being essentially the same whether the 

25 end of the upstream primer is A, C, G, or T. It was determined to be positional overlap 
between the 3* end of the upstream primer and downstream duplex, rather then sequence 
overlap, that provides optimal cleavage. In addition to allowing these enzymes to leave a 
nick after cleavage, the single base of overlap causes the enzymes to cleave several 
orders of magnitude faster than when a substrate lacks overlap (Kaiser et al, supra), 

30 Any of the 5' nucleases described below may find application in one or more 

embodiments of the methods described herein. 5' nucleases of particular utility in the 
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methods Of present invention include but are not limited to polymerases from a Themus 
species including, but not limited to, Themus aquaticus, nermusjlavus, Themus 
themophUus, Themus filifomus, and Thernm scotoductus. and altered polymerases. 
Particularly preferred are altered polymerases exhibiting improved performance in 
detection assays based on the cleavage of a DNA member of an invasive cleavage 
structure tiiat comprises an RNA target strand. 

Chimerical polymerases may find application in one or more embodiments of the 
present invention, including but not limited to chimerical polymerases comprising one or 
more portions of one or more FEN nucleases including but are not limited to those of 
Methanococcus Jannaschii, Methmobacterium themoautotrophicum. Archaeoglobus 
veneficus. Sulfohbus solfataricus, Pyrobaculum aerophilum. Themococcus lUordis. 
Archaeaglobusprojundus. Acidiams brierfyi. Acidianus ambivalens, Desuljurococcus 
amylofyticus. Desulfurococcus mobUis. Pyrodictium brocldi, Themococcus gorgonarius. 
Themococcus zilligii. Methanopyrus kandleri. Methanococcus igneus. Pyrococcus 
horikoshii, and Aeropynm pernix; particulariy preferred FENl enzymes are chimerical 
Archaeoglobus fidgidus and Pyrococcus fiiriosus. Particularly prefen^d are altered 
polymerases exhibiting improved performance in detection assays based on the cleavage 
of a DNA member of an invasive cleavage structure that comprises an RNA target strand. 

The detailed description of the invention is presented in the following sections: 

I. Detection of Specific Nucleic Acid Sequences Using 5' Nucleases in an 

INVADER Directed Cleavage Assay, 
n. Signal Enhancement By Incorporating The Products Of An Invasive Cleavage 

Reaction Into A Subsequent Invasive Cleavage Reaction; 
m. Effect of ARRESTOR Oligonucleotides on Signal and Background in Sequential 
Invasive Cleavage Reactions. 

IV. Improved Enzymes For Use In INVADER Oligonucleotide-Durected Qeavage 
Reactions Comprising RNA Targets; 

V. Reaction Design for INVADER Assay Detection of RNA Targets; 

VI. Kits for performing the RNA INVADER Assay; and 
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Vll. The INVADER Assay for Direct Detection and Measurement of Specific RNA 
Analytes. 

5 L Detection of Specific Nucleic Acid Sequences Using 5* Nucleases in an 
INVADER Directed Cleavage Assay 

1 . INVADER Assay Reaction design 

The present invention provides means for forming a nucleic acid cleavage 
1 0 structure that is dependent upon the presence of a target nucleic acid and cleaving the 
nucleic acid cleavage structure so as to release distinctive cleavage products. 5* nuclease 
activity, for example, is used to cleave the target-dependent cleavage structure and the 
resulting cleavage products are indicative of the presence of specific target nucleic acid 
sequences in the sample. When two strands of nucleic acid, or oligonucleotides, both 
1 5 hybridize to a target nucleic acid strand such that they form an overlapping invasive 
cleavage structure, as described below, invasive cleavage can occur. Through the 
interaction of a cleavage agent (e.g., a 5' nuclease) and the upstream oligonucleotide, the 
cleavage agent can be made to cleave the downstream oligonucleotide at an internal site 
in such a way that a distinctive fi:xigment is produced. Such embodiments have been 
20 termed the INVADER assay (Third Wave Technologies) and are described in U.S. Patent 
Appl. Nos. 5,846,717, 5,985,557, 5,994.069, 6,001,567, and 6,090,543 and PCT 
Publications WO 97/27214 and WO 98/42873, herein incorporated by reference in their 
entireties. 

The present invention further provides assays in which the target nucleic acid is 
25 reused or recycled during multiple rounds of hybridization with oligonucleotide probes 
and cleavage of the probes without the need to use temperature cycling (i.e., for periodic 
denaturation of target nucleic acid strands) or nucleic acid synthesis (i.e., for the 
polymerization-based displacement of target or probe nucleic acid strands). When a 
cleavage reaction is run under conditions in which the probes are continuously replaced 
30 on the target strand (e.g. through probe-probe displacement or through an equilibrium 
between probe/target association and disassociation, or through a combination 
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comprising these mechanisms. [ Reynaldo et al.. J. Mol. Biol. 97: 51 1 (2000)]) multiple 
probes can hybridize to the same target, allowing multiple cleavages, and the generation 

of multiple cleavage products. 

By the extent of its complementarity to a target nucleic acid strand, an 
5 oligonucleotide may be said to define a specific region of the target. In an invasive 
cleavage structure, the two oligonucleotides define and hybridize to regions of the target 
that are adjacent to one another {i.e.. regions without any additional region of the target 
between them). Either or both oligonucleotides may comprise additional portions that are 
not complementary to the target strand. In addition to hybridizing adjacently, in order to 
10 form an invasive cleavage structure, the 3' end of the upstream oligonucleotide must 
comprise an additional moiety. Whenboth oligonucleotides are hybridized to a target 
strand to form a structure and such a 3' moiety is present on the upstream oligonucleotide 
within the structure, the oligonucleotides may be said to overlap, and the structure may be 
described as an overiapping, or invasive cleavage structure. 
15 In one embodiment, the 3' moiety of the invasive cleavage structure is a single 

nucleotide. In this embodiment the 3' moiety may be any nucleotide {i.e., it may be, but 
it need not be complementary to the target strand). In a preferred embodiment the 3' 
moiety is a single nucleotide that is not complementary to the target strand. In another 
embodiment, the 3' moiety is anucleotide-like compound {Le., a moiety having chemical 
20 features similar to a nucleotide, such as a nucleotide analog or an organic ring compound; 
See e.g., U.S. Pat. No. 5,985,557). In yet another embodiment the 3' moiety is one or 
more nucleotides that duplicate in sequence one or more nucleotides present at the 5' end 
of the hybridized region of the downstream oligonucleotide. In a further embodiment, 
the duplicated sequence of nucleotides of the 3' moiety is followed by a single nucleotide 
25 that is not further dupUcative of the downsti^am oUgonucleotide sequence, and that may 
be any other nucleotide. In yet another embodiment, the duplicated sequence of 
nucleotides of the 3' moiety is followed by a nucleotide-like compound, as described 
above. 

The downstream oligonucleotide may have, but need not have, additional moieties 
30 attached to eitiier end of the region that hybridizes to the target nucleic acid strand. In a 
preferred embodiment, the downstream oligonucleotide comprises a moiety at its 5' end 
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a 5' moiety). In a particularly preferred embodiment, said 5' moiety is a 5' flap or 
arm comprising a sequence of nucleotides that is not complementary to the target nucleic 
acid strand. 

When an overlapping cleavage structure is formed, it can be recognized and 
5 cleaved by a nuclease that is specific for this structure (/.e., a nuclease that will cleave 
one or more of the nucleic acids in the overlapping structure based on recognition of this 
structure, rather than on recognition of a nucleotide sequence of any of the nucleic acids, 
forming the structure). Such a nuclease may be termed a "structure-specific nuclease". 
In some embodiments, the structure-specific nuclease is a S* nuclease. In a preferred 

10 embodiment, the structure-specific nuclease is the 5' nuclease of a DNA polymCTase. In 
another preferred embodiment, the DNA polymerase having the 5' nuclease is synthesis- 
deficient. In another preferred embodiment, the 5* nuclease is a FEN-1 endonuclease. In 
a particularly preferred embodiment, the 5* nuclease is thermostable. 

In some embodiments, said structure-specific nuclease preferentially cleaves the 

15 downstream oligonucleotide. In a preferred embodiment, the downstream 

oligonucleotide is cleaved one nucleotide into the 5* end of the region that is hybridized 
to the target within the overlapping structure. Cleavage of the overlapping structure at 
any location by a structure-specific nuclease produces one or more released portions or 
firagments of nucleic acid, termed "cleavage products." 

20 In some embodiments, cleavage of an overlapping structure is performed under 

conditions wherein one or more of the nucleic acids in the structure can disassociate (i.e. 
un-hybridize, or melt) from the structure. In one embodiment, full or partial 
disassociation of a first cleavage structure allows the target nucleic acid to participate in 
the formation of one or more additional overlapping cleavage structures. In a preferred 

25 embodiment, the first cleavage structure is partially disassociated. In a particularly 

preferred embodiment only the oligonucreotide that is cleaved disassociates fi:om the first 
cleavage structure, such that it may be replaced by another copy of the same . 
oligonucleotide. In some embodiments, said disassociation is induced by an increase in 
temperature, such that one or more oligonucleotides can no longer hybridize to the target 

30 strand. In other embodiments, said disassociation occurs because cleavage of an 

oligonucleotide produces only cleavage products that cannot bind to the target strand 
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under the conditions of the reaction. In a preferred embodiment, conditions are selected 
wherein an oligonucleotide may associate with (U.. hybridize to) and disassociate from a 
target strand regardless of cleavage, and wherein the oligonucleotide may be cleaved 
when it is hybridized to the target as part of an overlapping cleavage structure. In a 
particularly preferred embodiment, conditions are selected such that the number of copies 
of the oligonucleotide that can be cleaved when part of an overlapping structure exceeds 
the number of copies of the target nucleic acid strand by a sufficient amount that when 
the first cleavage stnicture disassociates, the probability that the target strand will 
associate with an intact copy of the oligonucleotide is greater than the probability that 
that it will associate wifli a cleaved copy of the oligonucleotide. 

In some embodiments, cleavage is performed by a structure-specific nuclease that 
can recognize and cleave structures that do not have an overlap. In a preferred 
embodiment, cleavage is performed by a structure-specific nuclease having a lower rate 
of cleavage of nucleic acid structures that do not comprise an overlap, compared to the 
rate of cleavage of structures comprising an overlap. In a particularly preferred 
embodiment, cleavage is performed by a structure-specific nuclease having less than 1% 
of the rate of cleavage of nucleic acid structures that do not comprise an overiap, 
compared to the rate of cleavage of structures comprising an overiap. 

In some embodiments it is desirable to detect the cleavage of the overlapping 
cleavage structure. Detection may be by analysis of cleavage products or by analysis of 
one or more of the remaining uncleaved nucleic acids. For convenience, the following 
discussion will refer to the analysis of cleavage products, but it will be appreciated by 
those skilled in the art that these methods may as easily be appUed to analysis of the 
uncleaved nucleic acids in an invasive cleavage reaction. Any method known in the art 
for analysis of nucleic acids, nucleic acid fiagments or oligonucleotides may be applied 
to the detection of cleavage products. 

In one embodiment, the cleavage products may be identified by chemical content, 
e.g., the relative amounts of each atom, each particular type of reactive group or each 
nucleotide base (Chargaff et al.. J. Biol. Chem. 177: 405 [1949]) they contain. In this 
way, a cleavage product may be distinguished firom a longer nucleic acid firom which it 
was released by cleavage, or fi-om other nucleic acids. 
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In another embodiment, the cleavage products may be distinguished by a 
particular physical attribute, including but not limited to length, mass, charge, or charge- 
to-mass ratio. In yet another embodiment, the cleavage product may be distinguished by 
a behavior that is related to a physical attribute, including but not limited to rate of 
5 rotation in solution, rate of migration during electrophoresis, coefficient of sedimentation 
in centrifugation, time of flight in MALDI-TOF mass spectrometry, migration rate or 
other behavior in chromatography, melting temperature ftom a complementary nucleic 
acid, orprecipitability from solution. 

Detection of the cleavage products may be through release of a label. Such labels 

1 0 may include, but are not limited to one or more of any of dyes, radiolabels such as ^^P or 
^^S, binding moieties such as biotin, mass tags, such as metal ions or chemical groups, 
charge tags, such as polyamines or charged dyes, hsq)tens such as digoxgenin, 
luminogenic, phosphorescent or fluorogenic moieties, and fluorescent dyes, either alone 
or in combination with moieties that can suppress or shift emission spectra, such as by 

1 5 fluorescence resonance energy transfer (FRET) or collisional fluorescence energy 
transfer. 

In some embodiments, analysis of cleavage products may include physical 
resolution or separation, for example by electrophoresis, hybridization or by selective 
binding to a support, or by mass spectrometry methods such as MALDI-TOF. In other 

20 embodiments, the analysis may be performed without any physical resolution or 

separation, such as by detection of cleavage-induced changes in fluorescence as in FRET- 
based analysis, or by cleavage-induced changes in the rotation rate of a nucleic acid in 
solution as in fluorescence polarization analysis. 

Cleavage products can be used subsequently in any reaction or read-out method 

25 that can make use of oligonucleotides. Such reactions include, but are not limited to, 
modification reactions, such as ligation, tailing with a template-independent nucleic acid 
polymerase and primer extension with a template-dependent nucleic acid polymerase. 
The modification of the cleavage products may be for purposes including, but not limited 
to, addition of one or more labels or binding moieties, alteration of mass, addition of 

30 specific sequences, or for any other purpose that would facilitate analysis of either the 
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cleavage products or analysis of any other by-product, result or consequence of the 
cleavage reaction. 

Analysis of the cleavage products may involve subsequent steps or reactions that 
do not modify the cleavage products themselves. For example, cleavage products may be 
used to complete a functional structure, such as a competent promoter for in vitro 
transcription or another protein binding site. Analysis may include the step of using the 
completed structure for or to perform its fimction. One or more cleavage products may 
also be used to complete an overlapping cleavage structure, thereby enabling a 
subsequent cleavage reaction, the products of which may be detected or used by any of 
the methods described herein, including the participation in further cleavage reactions. 

Certain preferred embodiments of the invasive cleavage reactions are provided in 
the following descriptions. In some embodiments, the methods of the present invention 
employ at least a pair of oligonucleotides that interact with a target nucleic acid to form a 
cleavage structure for a structure-specific nuclease. In some embodiments, the cleavage 
structure comprises i) a target nucleic acid that may be either single-stranded or double- 
stranded (when a double^stranded target nucleic acid is employed, it may be rendered 
single stranded, e.g.. by heating); u) a first oligonucleotide, termed the "probe," that 
defines a first region of the target nucleic acid sequence by beipg the complement of that 
t^on; ui) a second oligonucleotide, temied the "INVADER oUgonucleotide," the 5' part 
of which defines a second region of the same target nucleic acid sequence, adjacent to 
and downstream of the first target region, and the second part of which overlaps into the 
region defined by tiie first oligonucleotide. 

It can be considered that the binding of these oligonucleotides in this embodiment 
divides the target nucleic acid into three distinct regions: one region that has 
complementarity to only the probe; one region that has complementarity only to the 
INVADER oligonucleotide; and one region that has complementarity to both 
oligonucleotides. As discussed above, in some preferred embodiments of the present 
invention, the overlap may comprise moieties other than overlapping complementary 
bases. Thus, in some embodiments, there is a physical, but not sequence, overlap 
between the INVADER and probe oligonucleotides, i.e., in these latter embodiments. 
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there is not a region of the target nucleic acid that has complementarity to both 
oligonucleotides. 

a) Oligonucleotide design 

Design of these oligonucleotides (i.e., the INVADER oligonucleotide and the 
5 probe) is accomplished using practices that are standard in the art. For example, 

sequences that have self-complementarity, such that the resulting oligonucleotides would 
either fold upon themselves, or hybridize to each other at the expense of binding to the 
target nucleic acid, are generally avoided. 

One consideration in choosing a length for these oligonucleotides is the 

10 complexity of the sample containing the target nucleic acid. For example, the human 
genome is approximately 3x10^ basepairs in length. Any 10-nucleotide sequence will 
appear with a frequency of 1:4^°, or 1 : 1,048,576 in a random string of nucleotides, which 
would be approximately 2,861 times in 3 billion basepairs. Clearly, an oligonucleotide of 
this length would have a poor chance of binding uniquely to a 10-nucIeotide region 

1 5 within a target having a sequence the size of the human genome. If the target sequence 
were within a 3 kb plasmid, however, such an oligonucleotide might have a very 
reasonable chance of binding uniquely. By this same calculation it can be seen that an 
oligonucleotide of 16 nucleotides (i.e., a 16-mer) is the minimum length of a sequence 
that is mathematically likely to appear once in 3 x 10^ basepairs. This level of specificity 

20 may also be provided by two or more shorter oligonucleotides if they are configured to 
bind in a cooperative fashion (i.e., such that they can produce the intended complex only 
if both or all are bound to their intended target sequences), wherein the combination of 
the short oligonucleotides provides the desired specificity. In one such embodiment, the 
cooperativity between the shorter oligonucleotides is by a coaxial stacking effect that can 

25 occur when the oligonucleotides hybridize to adjacent sites on a target nucleic acid. In 
another embodiment, the shorter oligonucleotides are connected to one another, either 
directly, or by one or more spacer regions. The short oligonucleotides thus connected 
may bind to distal regions of the target and may be used to bridge across regions of 
secondary structure in a target Examples of such bridging oligonucleotides are described 

30 in PCT Publication WO 98/50403, herein incorporated by reference in its entirety. 
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A second consideration in choosing oligonucleotide length is the temperature 
range in which the oligonucleotides will be expected to function. A 16-mer of average 
base content (50% G-C bases) will have a calculated T„ of about 41°C, depending on. 
among other things, the concentration of the oligonucleotide and its target, the salt 
5 content of the reaction and the precise order of the nucleotides. As a practical matter, 
longer oligonucleotides are usually chosen to enhance the specificity of hybridization. 
OUgonucleotides 20 to 25 nucleotides in length are often used, as they are highly likely to 
be specific if used in reactions conducted at temperatures that are near their Tn,s (within 
about 5 »C of the T„). In addition, with calculated T^s in the range of 50 to 70°C, such 
10 oligonucleotides (i.e., 20 to 25-mers) are appropriately used in reactions catalyzed by 
thermostable enzymes, which often display optimal activity near this temperature range. 

The maximum length of the oligonucleotide chosen is also based on the desired 
specificity. One should avoid choosing sequences that are so long that they are either at a 
high risk of binding stably to partial complements, or that they cannot easily be dislodged 
15 when desired (e.g., failure to disassociate fiom the target once cleavage has occurred or 
failure to disassociate at a reaction temperature suitable for. the enzymes and other 
matoials in the reaction). 

The first step of design and selection of the oligonucleotides for the INVADER 
oligonucleotide-directed cleavage is m accordance with these sample general principles. 
20 Considered as sequence-specific probes individually, each oUgonucleotide may be 
selected according to the guidelines listed above. That is to say, each oligonucleotide 
will generally be long enough to be reasonably expected to hybridize only to the intended 
target sequence within a complex sample, usually in the 20 to 40 nucleotide rahge. 
Alternatively, because the INVADER oligonucleotide-directed cleavage assay depends 
25 upon the concerted action of these oligonucleotides, the composite length of the 2 

oligonucleotides which span^ind to the target may be selected to fall within this range, 
with each of the individual oligonucleotides being in approximately the 13 to .17 
nucleotide range. Such a design might be employed if a non-thermostable cleavage 
means were employed in the reaction, requiring the reactions to be conducted at a lower 
30 temperature than that used when thermostable cleavage means are employed. In some 
embodiments, it may be desirable to have these oligonucleotides bind multiple times 
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within a single target nucleic acid (e.g., to bind to multiple variants or multiple similar 
sequences within a target). It is not intended that the method of the present invention be 
limited to any particular size of the probe or INVADER oligonucleotide. 

The second step of designing an oligonucleotide pair for this assay is to choose 

5 the degree to which the upstream "INVADER" oligonucleotide sequence will overlap 
into the downstream "probe" oligonucleotide sequence, and consequently, the sizes into 
which the probe will be cleaved. In some preferred embodiments, the probe 
oligonucleotide can be made to "turn over," that is to say probe can be made to depart to 
allow the binding and cleavage of other copies of the probe molecule, without the 

1 0 requirements of thermal denaturation or displacement by polymerization. While in one 
embodiment of this assay probe turnover may be facilitated by an exonucleolytic 
digestion by the cleavage agent, in some preferred embodiments of the present invention 
turnover does not require this exonucleolytic activity. For example, in some 
embodiments, a reaction temperature and reaction conditions are selected so as to create 

15 an equilibrium wherein the probe hybridizes and disassociates from the target. In other 
embodiments, temperature and reaction conditions are selected so that unbound probe can 
initiate binding to the target strand and physically displace bound probe. In still otho- 
embodiments, temperature and reaction conditions are selected such that either or both 
mechanisms of probe replacement may occur in any proportion. The method of the 

20 present invention is not limited to any particular mechanism of probe replacement. By 
any mechanism, when the probe is bound to the target to form a cleavage structure, 
cleavage can occur. The continuous cycling of the probe on and off of the target allows 
multiple probes to bind and be cleaved for each copy of a target nucleic acid. 

i) Non-sequence Overlaps 

25 It has been determined that the relationship between the 3' end of the upstream 

oligonucleotide and the desired site of cleavage on the probe should be carefully 
designed. It is known that the preferred site of cleavage for the types of structure-specific 
endonucleases employed herein is one basepair into a duplex (Lyamichev et al, supra). 
It was previously believed that the presence of an upstream oligonucleotide or primer 

30 allowed the cleavage site to be shifted away from this preferred site, into the single 
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Stranded region of the 5' arm (Lyamichev et ai, supra and U.S. Patent No. 5,422,253). In 
contrast to this previously proposed mechanism, and while not limiting the present 
invention to any particular mechanism, it is believed that the nucleotide immediately 5'. 
or upstream of the cleavage site on the probe (including miniprobe and mid-range probes) 
should be able to basepair with the target for efficient cleavage to occur. In the case of 
the present invention, this would be the nucleotide in the probe sequence immediately 
upstream of the intended cleavage site. In addition, as described herein, it has been 
observed that in order to direct cleavage to that same site in the probe, the upstream 
oUgonucleotide should have its 3' base (i.e.. nt) immediately upstream of the intended 
cleavage site of the probe. In embodiments where the INVADER and probe 
oUgonucleotides share a sequence overlap, this places the 3' terminal nucleotide of the 
upstream oligonucleotide and the base of the probe oligonucleotide 5' of the cleavage site 
in competition for pairing with the corresponding nucleotide of the target strand. 

To examine tiie outcome of this competition (Le. which base is paired during a 
successful cleavage event), substitutions were made in the probe and INVADER 
oligonucleotides such that either tiie probe or the INVADER oligonucleotide were 
mismatched witii tiie target sequence at tiiis position. The effects of both arrangements 
on tiie rates of cleavage were examined. When the INVADER oligonucleotide is 
unpaired at tiie 3' end. tiie rate of cleavage was not reduced. If tiiis base was removed, 
however, tiie cleavage site was shifted upstream of tiie intended site. In contrast, if the 
probe oligonucleotide was not base-paired to tiie target just upstream of tiie site to which 
tiie INVADER oligonucleotide was directing cleavage, tiie rate of cleavage was 
dramatically reduced, suggesting tiiat when a competition exists, flie probe 
oligonucleotide was the molecule to be base-paired in fliis position. 

It appears tiiat tiie 3' end of tiie upstream INVADER oligonucleotide is unpaired 
during cleavage, and yet is important for accurate positioning of tiie cleavage. To 
examine which part(s) of tiie 3' terminal nucleotide are required for tiie positioning of 
cleavage, INVADER oligonucleotides were designed ttiat terminated on tiiis end witii 
nucleotides that were altered in a variety of ways. Sugars examined included 2' 
deoxyribose with a 3' phosphate group, a dideoxyribose, 3' deoxyribose, 2' 0-methyl 
ribose, arabinose and arabinose with a 3' phosphate. Abasic ribose, witii and wittiout 3' 
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phosphate were tested. Synthetic **universar* bases such at 3-mtropynoIe and 5-3 
nitroindole on ribose sugars were tested. Finally, a base-like aromatic ring structure, 
acridine, linked to the 3' end the previous nucleotide without a sugar group was tested. 
The results obtained support the conclusion that the aromatic ring of the base (at the 3* 
5 end of the INVADER oligonuceotide) is an important moiety for accomplishing the 
direction of cleavage to the desired site within the downstream probe. The 3* terminal 
moiety of the DSTVADER oligonucleotide need not be a base that is complementary to the 
target nucleic acid. 

ii) Miniprobes And Mid-Range Probes; 

10 As discussed above, the INVADER oligonucleotide-directed cleavage assay may 

be performed using INVADER and probe oligonucleotides that have a length of about 
13-25 nucleotides (typically 20-25 nucleotides). It is also contemplated that the 
oligonucleotides may themselves be composed of shorter oligonucleotide sequences that 
align along a target strand but that are not covalently linked. This is to say that there is a 

1 5 nick in the sugar-phosphate backbone of the composite oligonucleotide, but that there is 
no disruption in the progression of base-paired nucleotides in the resulting duplex. When 
short strands of nucleic acid align contiguously along a longer strand the hybridization of 
each is stabilized by the hybridization of the neighboring fragments because the basepairs 
can stack along the helix as though the backbone was in fact uninterrupted. This 

20 cooperativity of binding can give each segment a stability of interaction in excess of what 
would be expected for the segment hybridizing to the longer nucleic acid alone. One 
application of this observation has been to assemble primers for DNA sequencing, 
typically about 18 nucleotides long, from sets of three hexamer oligonucleotides that are 
designed to hybridize in this way (Kotler et al Proc. Natl. Acad. Sci. USA 90:4241 

25 [1 993]). The resulting doubly-nicked primer can be extended enzymatically in reactions 
performed at temperatures that might be expected to disrupt the hybridization of 
hexamers, but not of 1 8-mers. 

The use of composite or split oligonucleotides is applied with success in the 
INVADER-directed cleavage assay. For example, the probe oligonucleotide may be split 

30 into two oligonucleotides that anneal in a contiguous and adjacent manner along a target 
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Oligonucleotide such that the downstream oligonucleotide (analogous to the probe) is 
assembled from two smaller pieces: a short segment of 6-10 nts (termed the 
"miniprobe"), that is to be cleaved in the course of the detection reaction, and an 
oligonucleotide that hybridizes immediately downstream of the miniprobe (termed the 
"stacker"), that serves to stabilize the hybridization of the probe. To form the cleavage 
structure, an upstream oligonucleotide (the INVADER oligonucleotide) is provided to 
direct the cleavage activity to the desired region of the miniprobe. Assembly of the probe 
from non-linked pieces of nucleic acid (i.e., the miniprobe and the stacker) allows regions 
of sequences to be changed without requiring the re-synthesis of the entire proven 
sequence, thus improving the cost and flexibility of the detection system. In addition, the 
use of unlinked composite oligonucleotides makes the system more stringent in its 
requirement of perfectly matched hybridization to achieve signal generation, allowing 
this to be used as a sensitive means of detecting mutations or changes in the target nucleic 
acid sequences. 

In one embodiment, the methods of the present invention employ at least three 
oUgonucleotides that interact with a target nucleic acid to form a cleavage structure for a 
structure-specific nuclease. More specifically, the cleavage struchore comprises i) a 
target nucleic acid that may be either single-stranded or double-sti:anded (when a double- 
stianded target nucleic acid is employed, it may be rendered single-stranded. e.g., by 
heating); ii) a first oligonucleotide, termed the "stacker," tiiat defines a first region of tiie 
target nucleic acid sequence by being the complement of that region.; iii) a second 
oligonucleotide, termed the "miniprobe," that defines a second region of the target 
nucleic acid sequence by being tiie complement of that region ; iv) a third 
oligonucleotide, termed the "INVADER oligonucleotide," the 5' part of which defines a 
tiutd region of tiie same target nucleic acid sequence, adjacent to and downstream of the 
second target region, and the second or 3' part of which overiaps into the region defined 
by the second oUgonucleotide As described above for embodiments that do not employ a 
stacker, the overlap region can represent a region where tiiere is a physical, but not 
sequence, overlap between tiie IhfVADER and probe oligonucleotides. 
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In addition to the benefits cited above, the use of a composite design for the 
ohgonucleotides that form the cleavage structure allows more latitude in the design of the 
reaction conditions for performing the INVADER-directed cleavage assay. When a 
longer probe (eg., 16-25 nt), as described above, is used for detection in reactions that 
S are performed at temperatures below the Tm of that probe, the cleavage of the probe may 
play a significant role in destabilizing the duplex of which it is a part, thus allowing 
turnover and reuse of the recognition site on the target nucleic acid. In contrast, reaction 
temperature that are at or above the Tm of the probe mean that the probe molecules are 
hybridizing and releasing from the target quite rapidly even without cleavage of the 

1 0 probe. When an upstream INVADER oligonucleotide and a cleavage agent are provided 
the probe will be specifically cleaved, but the cleavage will not be necessary to the 
turnover of the probe. When a long probe (e.g., 16-25 nt) is used in this way the 
tOTiperatures required to achieve this state is high, around 65 to 70**C for a 25-mer of 
average base composition. Requiring the use of such elevated temperatures limits the 

1 5 choice of cleavage agents to those that are very thermostable, and may contribute to 
background in the reactions, depending of the means of detection, through thermal 
degradation of the probe oligonucleotides. With miniprobes, this latter mechanism of 
probe replacement may be accomplished at a lower temperature. Thus, shorter probes are 
preferred for embodiments using lower reaction temperatures. 

20 The miniprobe of the present invention may vary in size depending on the desired 

application. In one embodiment, the probe may be relatively short compared to a 
standard probe (e.g., 16-25 nt), in the range of 6 to 10 nucleotides. When such a short 
probe is used, reaction conditions can be chosen that prevent hybridization of the 
miniprobe in the absence of the stacker oligonucleotide. In this way a short probe can be 

25 made to assume the statistical specificity and selectivity of a longer sequence. In the 
event of a perturbation in the cooperative binding of the miniprobe and stacker nucleic 
acids, as might be caused by a mismatch within the short sequence that is otherwise 
complementary to the target nucleic acid or at the junction between the contiguous 
duplexes, this cooperativity can be lost, dramatically reducing the stability of the shorter 

30 duplex (/.€., that of the miniprobe), and thus reducing the level of cleaved product in the 
assay of the present invention. 
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It is also contemplated that probes of intermediate size may be used. Such probes, 
in the 1 1 to 1 5 nucleotide range, may blend some of the features associated with the 
longer probes as originally described, these features including the ability to hybridize and 
be cleaved absent the help of a stacker oligonucleotide. At temperatures below the 

5 expected T™ of such probes, the mechanisms of turnover may be as discussed above for 
probes in the 20 nt range, and be dependent on the removal of the sequence in the overlap 
region for destabilization and cycling. 

The mid-range probes may also be used at elevated temperatures, at or above their 
expected to allow melting rather than cleavage to promote probe turnover. In 

10 contrast to the longer probes described above, however, the temperatures required to 

allow the use of such a thermally driven turnover are much lower (about 40 to 60"C). 
thus preserving both the cleavage means and the nucleic acids in the reaction from 
thermal degradatioa In this way. the mid-range probes may perfonn in some instances 
like the miniprobes described above. In a further similarity to the miniprobes, the 
1 5 accumulation of cleavage signal from a mid-range probe may be helped under some 
reaction conditions by the presence of a stacker. 

To summarize, a standard long probe usually does not benefit from the presence 
of a stacker oligonucleotide downstream (the exception being cases where such an 
oligonucleotide may also disrupt structures in the target nucleic acid that interfere with 
20 the probe binding), and it may be used in conditions requiring several nucleotides to be 
removed to allow the oligonucleotide to release from the target efficiently. If temperature 
of the reaction is used to drive exchange of the probes, standard probes may require use 
of a temperature at which nucleic acids and enzymes are at higher risk of thermal 
degradation. 

25 The miniprobe is very short and performs optimally in the presence of a 

downstream stacker oligonucleotide. The miniprobes are well suited to reactions 
conditions that use the temperature of the reaction to drive rapid exchange of the probes 
on the target regardless of whether any bases have been cleaved. In reactions with 
sufficient amount of the cleavage means, the probes that do bind will be rapidly cleaved 

30 before they melt off. 
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The mid-range or midiprobe combines features of these probes and can be used in 
reactions like those favored by long probes, with longer regions of overlapto drive probe 
turnover at lower temperature. In a preferred embodiment, the midrange probes are used 
at temperatures sufficiently high that the probes are hybridizing to the target and 
5 releasing rapidly regardless of cleavage. The mid-range probe may have danced 
performance in the presence of a stacker under some circumstances. 

The distinctions between the mini-, midi- (/.^., mid-range) and long probes are not 
cont^plated to be inflexible and based only on length. The performance of any given 
probe may vary with its specific sequence, the choice of solution conditions, the choice of 
1 0 temperature and the selected cleavage means. 

The assemblage of oligonucleotides that comprises the cleavage structure of the 
presoit invention is sensitive to mismatches between the probe and the target It is also 
contemplated that a mismatch between the INVADER oligonucleotide and the target may 
be used to distinguish related target sequences. In the 3-oligonucleotide system, 
1 5 comprising an INVADER, a probe and a stacker oligonucleotide, it is contemplated that 
mismatches may be located within any of the regions of duplex formed between these 
oligonucleotides and the target sequence. In a preferred embodiment, a mismatch to be 
detected is located in the probe. In a particularly preferred embodiment, the mismatch is 
in the probe, at the basepair immediately upstream (/.e., 5*) of the site that is cleaved 
20 when the probe is not mismatched to the target. 

In another preferred embodiment, a mismatch to be detected is located within the 
region defined by the hybridization of a miniprobe. In a particularly preferred 
embodiment, the mismatch is in the miniprobe, at the basepair immediately upstream 
(i.e,, 5*) of the site that is cleaved when the miniprobe is not mismatched to the target. 

25 iii) Software for Oligonucleotide Design for the INVADER assay 

The present invention provides systems and methods for the design of 
oligonucleotides for use in detection assays. In particular, the present invention provides 
systems and methods for the design of oligonucleotides that successfully hybridize to 
appropriate regions of target nucleic acids (e.g., regions of target nucleic acids that do not 
30 contain secondary structure) under the desired reaction conditions (e.g., temperature, 
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buffer conditions, etc.) for the detection assay. The systems and methods also allow for 
the design of multiple different oligonucleotides (e.g., oligonucleotides that hybridize to 
different portions of a target nucleic acid or that hybridize to two or more different target 
nucleic acids) that all function in the detection assay under the same or substantially the 
5 same reaction conditions. These systems and methods may also be used to design control 
samples that work under the experimental reaction conditions. 

While the systems and methods of the present invention are not limited to any 
particular detection assay, flie following description illustrates the invention when used in 
conjunction with the INVADER assay (Third Wave Technologies. Madison WI; See e.g., 
10 U.S. Pat. Nos. 5,846.717, 5,985.557, 5.994,069, and 6.001.567 and PCT Publications WO 
97/27214 and WO 98/42873, incorporated herein by reference in their entireties) to detect 
a SNP. One skilled in the art will appreciate that specific and general features of this 
illustrative example are generally appUcable to other detection assays, and for use in 
designing INVADER assays for purposes other tiian SNP detection (eg., for DNA or 
15 RNA quantitation, for RNA splice junction detection, etc.). Further, it will be 

appreciated that all algorithms described herein can be applied as separate software 
elemoits, or calculations may be performed manually, for the deagn of any INVADER 
assay probe set without use of the INVADERCREATOR design system described below. 
Oligonucleotide Design for the INVADER assay using the 
20 INVADERCREATOR program 

In some embodiraraits where an oligonucleotide is designed for use in the 
INVADER assay to detect a SNP, the sequence(s) of interest are entered into the 
INVADERCREATOR program (Third Wave Technologies, Madison, WI). As described 
above, sequences may be input for analysis from any number of sources, either directly 
25 into the computer hosting the INVADERCREATOR program, or via a remote computer 
linked through a communication network (eg., a LAN. Intranet or Internet network). 
The program designs probes for both the sense and antisaise strand. Strand selection is 
generally based upon the ease of synthesis, minimization of secondary structure 
formation, and manufacturability. In some embodiments, the user chooses the strand for 
30 sequences to be designed for. In other embodiments, the software automatically selects 
fee strand. By incorporating thermodynamic parameters for optimum probe cycling and 
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signal generation (AUawi and SantaLucia, Biochemistry, 36:10581 [1997]), 
oligonucleotide probes may be designed to operate at a pre-selected assay temperature 
(e.g., 63^C). Based on these criteria, a final probe set primary probes for 2 alleles 
and an INVADER oligonucleotide) is selected. 
5 In some embodiments, the INVADERCREATOR system is a wd>-based program 

with secure site access that contains a link to BLAST (available at the National Center for 
Biotechnology Information, National Library of Medicine, National Institutes of Health 
web site) and that can be linked to RNAstructure (Mathews et al, RNA 5:1458 [1999]), a 
software program that incorporates mfold (Zuker, Science, 244:48 [1989]). 

1 0 RNAstructure tests the proposed oligonucleotide designs generated by 

INVADERCREATOR for potential uni- and bimolecular complex formation. 
INVADERCREATOR is open database connectivity (ODBC)-compliant and uses the 
Oracle database for export/integration. The INVADERCREATOR systan was 
configured with Oracle to work well with UNIX systems, as most genome centers are 

15 UNIX-based. 

In some embodiments, the INVADERCREATOR analysis is provided on a 
separate server (eg., a Sun server) so it can handle analysis of large batch jobs. For 
example, a customer can submit up to 2,000 SNP sequences in one email The server 
passes the batch of sequences on to the INVADERCREATOR software, and, when 

20 initiated, (he program designs SNP sets. In some embodiments, probe set designs are 
returned to the user within 24 hours of receipt of the sequences. 

In some preferred embodiments, each INVADER assay reaction includes at least 
two target sequence-specific, unlabeled oligonucleotides for the primary reaction: an 
upstream INVADER oligonucleotide and a doAvnstream Probe oligonucleotide. The 

25 INVADER oligonucleotide is generally designed to bind stably at the reaction 

temperature, while the probe is designed to finely associate and disassociate with the 
target strand, with cleavage occurring only when an uncut probe hybridizes adjacent to an 
overlapping INVADER oligonucleotide. In some embodiments, the probe includes a 5' 
flap or "arm" that is not complementary to the target, and this flap is released fi-om the 

30 probe when cleavage occurs. In some embodiments, the released flap participates as an 
INVADER oligonucleotide in a secondary reaction. 
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The following discussion provides one example of how a user interface for an 
INVADERCREATOR program may be configured. 

The user opens a wbrk screen (Figure 42). e.g., by clicking on an icon on a 
desktopdisplayofacomputer(e.g.. a Windows desktop). The user enters information 
related to the target sequence for which an assay is to be designed. In some 
embodiments, the user enters a target sequence. In other embodiments, the user enters a 
code or number that causes retrieval of a sequence from a database. In still other 
embodiments, additional information may be provided, such as the user's name, an 
identifying number associated with a target sequence, and/or an order number. In 
preferred embodiments, the user indicates (e.g. via a checkbox or drop down menu) that 
the target nucleic acid is DNA or RNA. In other preferred embodiments, the user 
indicates the species from which the nucleic acid is derived. In particularly preferred 
embodiments, the user indicates whether the design is for monoplex (i.e., one target 
sequence or allele per reaction) or multiplex multiple target sequences or alleles per 
reaction) detection. When the requisite choices and entries are complete, the user starts 
the analysis process. In one embodiment, the user clicks a "Go Design It" button to 
continue. 

In some embodiments, the software validates the field entries before proceeding. 
In some embodiments, the software verifies that any required fields are completed with 
the appropriate type of information. In other embodiments, the software verifies that the 
input sequence meets selected requirements (e.g.. minimum or maximum length. DNA or 
RNA content). If entries in any field are not found to be valid, an error message or dialog 
box may appear. In preferred embodiments, the error message indicates which field is 
incomplete and/or incorrect. Once a sequence entry is verified, the software proceeds 

Willi the assay design. 

In some embodiments, the infomiation supplied in the order entry fields specifies 
what type of design will be created. In prefened embodiments, the target sequence and 
multiplex check box specify which type of design to create. Design options include but 
are not limited to SNP assay. Multiplexed SNP assay (e.g., wherein probe sets for 
different alleles are to be combined in a single reaction). Multiple SNP assay (.e.g.. 
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wherein an input sequence has multiple sites of variation for which probe sets are to be 
designed), and Multiple Probe Arm assays. 

In some embodiments, the INVADERCREATOR software is started via a Web 
Order Entry (WebOE) process through an Intra/Internet browser interface) and these 

5 parameters are transferred from the WebOE via applet <param> tags, rather than entered 
through menus or check boxes. 

In the case of Multiple SNP Designs, the user chooses two or more designs to 
woA with. In some embodiments, this selection opens a new screen view (e.^., a 
Multiple SNP Design Selection view Figure 43). In some embodiments, the software 

10 creates designs for each locus in the target sequence, scoring each, and presents them to 
the user in this screen view. The user can then choose any two designs to work with. In 
some embodiments, the user chooses a first and second design (eg., via a menu or 
buttons) and clicks a "Go Design It" button to continue. 

To select a probe sequence that will perform optimally at a pre-selected reaction 

1 5 temperature, the melting temperature (TnO of the SNP to be detected is calculated using 
the nearest-neighbor model and published parameters for DNA duplex formation (Allawi 
and SantaLucia, Biochemistry, 36:10581 [1997]). In embodiments wherein the target 
strand is RNA, parameters appropriate for RNA/DNA heteroduplex formation may be 
used. Because the assay's salt concentrations are often different than the solution 

20 conditions in which the nearest-neighbor parameters were obtained (IM NaCl and no 
divalent metals), and because the presence and concentration of the enzyme influence 
optimal reaction temperature, an adjustment should be made to the calculated Tm to 
determine the optimal temperature at which to perform a reaction. One way of 
compensating for these factors is to vary the value provided for the salt concentration 

25 within the melting temperature calculations. This adjustment is termed a 'salt conection'. 
As used herein, th^ term "salt correction" refers to a variation made in the value provided 
for a salt concentration for the purpose of reflecting the effect on a Ti,, calculation for a 
nucleic acid duplex of a non-salt parameter or condition affecting said duplex. Variation 
of the values provided for the strand concentrations will also affect the outcome of these 

30 calculations. By using a value of 0.5 M NaCl (SantaLucia, Proc Natl Acad Sci U S A, 
95:1460 [1998]) and strand concentrations of about 1 mM of the probe and 1 fM target, 
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the algorithm used for calculating probe-target melting temperature has been adapted for 
use in predicting optimal INVADER assay reaction temperature. For a set of 30 probes, 
the average deviation between optimal assay temperatures calculated by this method and 
those e:q)erimentally determined is about 1.5 "C. 
5 The length of the downstream probe analyte-specific region (ASR) is defined by 

the temperature selected for rumiing the reaction (eg., 63°C). Starting from the position 
of the variant nucleotide on the target DNA (the target base that is paired to the probe 
nucleotide 5' of the intended cleavage site), and adding on the 3' end, an iterative 
procedure is used by which the length of the target-binding region of the probe is 
10 increased by one base pair at a time until a calculated optimal reaction temperature (Tm 
plus salt correction to compensate for enzyme effect) matching the desired reaction 
temperature is reached. The non-complementary arm of the probe is preferably selected 
to allow the secondary reaction to cycle at the same reaction temperature. The entire 
probe oUgonucleotide is screened using programs such as mfoM (Zuker, Science, 244: 48 
15 [19891) or Oligo 5.0 (Rychlik and Rhoads, Nucleic Acids Res. 17: 8543 [1 989]) for the 
possible formation of dimer complexes or secondary structures that could interfere with 
the reaction. The same principles are also followed for INVADER oligonucleotide 
design. Briefly, starting from the position N on the target DNA. the 3' end of the 
INVADER oligonucleotide is designed to have a nucleotide not complementary to either 
20 allelesuspectedofbeingcontainedinthesampletobetested. The mismatch does not 
adversely affect cleavage (Lyamichev et al. Nature Biotechnology, 17: 292 [1999]). and 
it can enhance probe cycling, presumably by minimizing coaxial stabilization effects 
between the two probes. Additional residues complementary to the target DNA starting 
from residue N-l are then added in the 5' direction until the stability of the 
25 INVADERoligonucleotide-target hybrid exceeds that of the probe (and therefore the 
planned assay reaction temperature), generally by 15-20 °C. 

In some embodiments, the released cleavage fragment from a primary.reaction is 
to be used in a secondary reaction. It is one aspect of the assay design that the all of the 
probe sequences may be selected to allow the primary and secondary reactions to occur at 
30 the same optimal temperature, so that the reaction steps can run simultaneously. In an 
alternative embodiment, the probes may be designed to operate at different optimal 
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temperatures, so that the reaction stq)s are not simultaneoxisly at their temperature 
optima. 

In some embodiments, the software provides the user an opportimity to change 
various aspects of the design including but not limited to: probe, target and INVADER 

5 oligonucleotide temperature optima and concentrations; blocking groups; probe arms; 
dyes, capping groups and other adducts; individual bases of the probes and targets (e.g., 
adding or deleting bases from the end of targets and/or probes, or changing internal bases 
in the INVADER and/or probe and/or target oligonucleotides). In some embodiments, 
changes are made by selection from a menu. In other embodiments, changes are entered 

10 into text or dialog boxes. In preferred embodiments, this option opens a new screen (e.g., 
a Designer Worksheet view, Figure 44). 

In some embodiments, the software provides a scoring system to indicate the 
quality (e.g., the likelihood of performance) of the assay designs. In one embodiment, 
the scoring system includes a starting score of points (eg., 100 points) wherein the 

1 5 starting score is indicative of an ideal design, and wherein design features known or 
suspected to have an adverse affect on assay performance are assigned penalty values. 
Penalty values may vary depending on assay parameters other than the sequences, 
including but not limited to the type of assay for which the design is intended (e.g., 
monoplex, multiplex) and the temperature at which the assay reaction will be perfonned. 

20 The following example provides illustrative scoring criteria for use with some 
embodiments of the INVADER assay based on an intelligence defined by 
experimentation. Examples of design features that may incur score penalties include but 
are not limited to the following [penalty values are indicated in brackets, first number is 
for lower temperature assays (eg., 62-64 °C), second is for higher temperature assays 

25 (eg., 65-66 °C)]: 

1. [100:100] 3* end of INVADER oligonucleotide resembles the probe arm: . 
ARM SEQUENCE: PENALTY AWARDED IF INVADER 

OLIGONUCLEOTIDE ENDS IN: 
30 Arml: CGCGCCGAGG 5*...GAGGX or 5\..GAGGXX 

Arm 2: ATGACGTGGCAGAC 5'...CAGACXor 5\..CAGACXX 
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Arm3: ACGGACGCGGAG 5'...GGAGX or 5'...GGAGXX 

Arm4: TCCGCGCGTCC 5'...GTCCX or 5'...GTCCXX 

2. [70:701 a probe has 5-base stretch (i.e.. 5 of the same base in a row) containing 
the polymorphism; 

3. [60:60] a probe has 5-base stretch adjacent to the polymorphism; 

4. [50:50] a probe has 5-base stretch one base from the polymorphism; 

5. [40:40] a probe has 5-base stretch two bases from the polymorphism; 

6. [50:50] probe 5-base stretch is of Gs - additional penalty; 

7. [100:100] a probe has 6-base stretch anywhere; 

8. [90:90] a two or three base sequence repeats at least four times; 

9. [1 00: 1 00] a degenerate base occurs in a probe; 

10. [60:90] probe hybridizing region is short (13 bases or less for designs es-e?^; 12 
bases or less for designs 62-64°Q 

1 1. [40:90] probe hybridizing region is long (29 bases or more for designs 65-67°C, 28 
bases or more for designs 62-64*'C) 

12. [5:5] probe hybridizing region length - per base additional penalty 

13. [80:80] Ins/Del design with poor discrimination in first 3 bases afler probe arm 

14. [100:100] calculated INVADER oligonucleotide Tm within 7.5°C of probe target 
Tm (designs 65-67''C with INVADER oligonucleotide less than < 70.5'C, designs 62- 
64*'C wifli INVADER oligonucleotide < 69.5°C 

15. [20:20] calculated probes Tms diffo- by more than 2.0°C 

16. [100:100] a probe has calculated Tm 2''C less than its target Tm 

17. [10:10] target of one strand 8 bases longer than that of other strand 

18. [30:30] INVADER oligonucleotide has 6-base stretch anywhere - initial penalty 

19. [70:70] INVADER oligonucleotide 6-base stretch is of Gs - additional penalty 

20. [15:15] probe hybridizing region is 14, 15 or 24-28 bases long (65-6700 or 13.14 
or 26,27 bases long (62-64«'C) 

21. [15:15] a probe has a 4-base stretch ofGs containing the polymorphism 

In particularly pref^ed embodiments, temperatures for each of the 
oligonucleotides in the designs are recomputed and scores are recomputed as changes are 



89. 



wo 01/90337 



PCTAJSOl/17086 



made. In some embodiments, score descriptions can be seen by clicking a "descriptions" 
button. In some embodiments, a BLAST search option is provided. In preferred 
embodiments, a BLAST search is done by clicking a "BLAST Design" button. In some 
embodiments, this action brings up a dialog box describing the BLAST process. In 

5 preferred embodiments, the BLAST search results are displayed as a highlighted design 
on a Designer Worksheet. 

In some embodiments, a user accepts a design by clicking an "Accept" button. In 
other embodiments, the program approves a design without user intervention. In 
preferred embodiments, the program sends the approved design to a next process step 

10 (e.g., into production; into a file or database). In some embodiments, the program 
provides a screen view an Output Page, Figure 45), allowing review of the final 
designs created and allowing notes to be attached to the design. In preferred 
embodiments, the user can return to the Designer Worksheet by clicking a "Go 
Back" button) or can save the design (^.g., by clicking a "Save It" button) and continue 

1 S {e.g. , to submit the designed oligonucleotides for production). 

In some embodiments, the program provides an option to create a screen view of 
a design optimized for printing a text-only view) or other export an Output 
view. Figure 46). In preferred embodiments, the Output view provides a description of 
the design particularly suitable for printing, or for exporting into another application (e.g., 

20 by copying and pasting into another application). In particularly preferred embodiments, 
the Output view opens in a separate window. 

The present invention is not limited to the use of the INVADERCREATOR 
software. Indeed, a variety of software programs are contemplated and are commercially 
available, including, but not limited to GCG Wisconsin Package (Genetics computer 

25 Group, Madison, WI) and Vector NTI (Informax, Rockville, Maryland). 

b) Design Of The Reaction Conditions 

Target nucleic acids (e.g., RNA and DNA) that may be analyzed using the 
methods of the present invention that employ a 5* nuclease or other appropriate cleavage 
agents. Such nucleic acids may be obtained using standard molecular biological 
30 techniques. For example, nucleic acids (RNA or DNA) may be isolated fix>m a tissue 
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sample (e.g.. a biopsy specimen), tissue culture cells, samples containing bacteria and/or 
viruses (including cultures of bacteria and/or viruses), etc. The target nucleic acid may 
also be transcribed in vitro from a DNA template or may be chemically synthesized or 
amplified in by polymerase chain reaction. Furthermore, nucleic acids may be isolated 
5 from an organism, either as genomic material or as a plasmid or similar 

extrachromosomal DNA. or they may be a fragment of such material generated by 
treatment with a restriction endonuclease or other cleavage agent, or a shearing force, or 
it may be synthetic. 

Assembly of the target, probe, and INVADER oligonucleotide nucleic acids into 
10 the cleavage reaction of the present invention uses principles commonly used in the 

design of oligonucleotide-based enzymatic assays, such as dideoxynucleotide sequencing 
and polymerase chain reaction (PGR). As is done in these assays, the oligonucleotides 
are provided in sufficient excess that the rate of hybridization to the target nucleic acid is 
very rapid. These assays are commonly performed with 50 finoles to 2 pmoles of each 
15 oligonucleotide per microliter of reaction mixture, although they are not necessarily 
limited to this range. In the Examples described herein, amounts of oligonucleotides 
ranging from 250 finoles to 5 pmoles per microliter of reaction volume were used. These 
values were chosen for the purpose of ease in demonstration and are not intended to limit 
the performance of the present invention to these concentrations. Other {e.g., lower) 
20 oUgonucleotide concentrations commonly used in other molecular biological reactions 

are also contemplated. 

It is desirable that an INVADER oligonucleotide be immediately available to 
direct the cleavage of each probe oligonucleotide that hybridizes to a target nucleic acid. 
In some embodiments described herein, the INVADER oligonucleotide is provided in 

25 excess over the probe oligonucleotide. While this is an effective means of making the 
INVADER oligonucleotide immediately available in such embodiments it is not intended 
that the practice of the present invention be limited to conditions wherein the INVADER 
oligonucleotide is in excess over the probe, or to any particular ratio of INVADER-to- 
probe (e.g., in some preferred embodiments described herein, the probe is provided in 

30 excess over the INVADER oligonucleotide). Another means of assuring the presence of 
an INVADER oligonucleotide whenever a probe binds to a target nucleic acid is to 
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design the INVADER oligonucleotide to hybridize more stably to the target, i.e., to have 
a higher Tm than the probe. This can be accomplished by any of the means of increasing 
nucleic acid duplex stability discussed herein {e,g., by increasing the amount of 
complementarity to the target nucleic acid). 

5 Buffer conditions should be chosen that are compatible with both the 

oligonucleotide/target hybridization and with the activity of the cleavage agent. The 
optimal buffer conditions for nucleic acid modification enzymes, and particularly DNA 
modification enzymes, generally included enough mono- and di-valent salts to allow 
association of nucleic acid strands by base-pairing. If the method of the present invention 

1 0 is performed using an enzymatic cleavage agent other than those specifically described 
here, the reactions may generally be performed in any such buffer reported to be optimal 
for the nuclease function of the cleavage agent. In general, to test the utility of any 
cleavage ag^t in this method, test reactions are performed wherein the cleavage agent of 
interest is tested in the M0PS/MnCl2/KCl buffer or Mg-containing buffers described 

1 5 herein and in whatever buffer has been reported to be suitable for use with that agent, in a 
manufacturer's data sheet, a journal article, or in personal communication. 

The products of the INVADER oligonucleotide-directed cleavage reaction are 
fragments generated by structure-specific cleavage of the input oligonucleotides. The 
resulting cleaved and/or uncleaved oligonucleotides may be analyzed and resolved by a 

20 number of methods including, but not limited to, electrophoresis (on a variety of supports 
including acrylamide or agarose gels, paper, etc.), chromatography, fluorescence 
polarization, mass spectrometry and chip hybridization. In some Examples the invention 
is illustrated using electrophoretic separation for the analysis of the products of the 
cleavage reactions. However, it is noted that the resolution of the cleavage products is 

25 not limited to electrophoresis. Electrophoresis is chosen to illustrate the method of the 
invention because electrophoresis is widely practiced in the art and is easily accessible to 
the average practitioner. In other Examples, the invention is illustrated without 
electrophoresis or any other resolution of the cleavage products. 

The probe and INVADER oligonucleotides may contain a label to aid in their 

30 detection following the cleavage reaction. The label may be a radioisotope (e.g., a ^^P or 
^^S-labelled nucleotide) placed at either the 5' or 3* end of the oligonucleotide or 
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alternatively, the label may be distributed throughout the oligonucleotide (i.e.. a 
uniformly labeled oligonucleotide). The label may be a nonisotopic detectable moiety, 
such as a fluorophore, that can be detected directly, or a reactive group that permits 
specific recognition by a secondary agent. For example, biotinylated oligonucleotides 
may be detected by probing with a streptavidin molecule that is coupled to an indicator 
(e.g., alkaline phosphatase or a fluorophore) or a hapten such as dioxigenin may be 
detected using a specific antibody coupled to a similar indicator. The reactive group may 
also be a specific configuration or sequence of nucleotides that can bind or otherwise 
interact with a secondary agent, such as another niicleic acid, and enzyme, or an antibody. 
In some embodiments, a probe is labeled with fluorescing moiety and a quenching 
moiety, wherein cleavage of the cleavage structure separates the fluorescing moiety torn 
the quenching moiety, resulting in a detectable signal (e.g., FRET detection). In some 
embodiments, a change in quenching of signal from a donor fluorophor is detected, while 
in other embodiments, a change in emission fit)m an acceptor fluorophore is detected. In 
still other embodiments, the effect of FRET on both donor and acceptor emissions are 
detected. 

In some embodiments of FRET detection, the fluorescence lifetime of the fluorescence 
emitter is measured (,e.g., as in time-resolved fluorescence). While not limiting time- 
resolved fluorescence detection embodiments to any particular labeling systems, 
examples of tags that are usefiil in time-resolved FRET measurements include europium 
chelate (Eu'*; Biosclair. et ai, J. Biomolecular Screening 5(5):319 [2000]), europium 
trisbipyridine ciyptate (TBPEu'*; Alpha-Bazin, et ai. Anal. Biochem. 286(1):17 [2000]), 
and ruthenium ligand complex {[Ru{bpy)2(phen-ITC)]^*; Youn, et al. Anal. Biochem. 
232(1):24 [1995]; Lakowicz, et al.. Anal. Biochem. 288:62 [2001]). 

c) Optimization Of Reaction Conditions 

The INVADER oligonucleotide-directed cleavage reaction is useful to detect the 
presence of specific nucleic acids. In addition to the considerations listed above for the 
selection and design of the INVADER and probe oligonucleotides, the conditions under 
which the reaction is to be perfomied may be optimized for detection of a specific target 
sequence. 
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One objective in optimizing the INVADER oligonucleotide-directed cleavage 
assay is to allow specific detection of the fewest copies of a target nucleic acid. To 
achieve this end, it is desirable that the combined elements of the reaction interact with 
the maximum efficiency, so that the rate of the reaction (e.g., the number of cleavage 

5 events per minute) is maximized. Elements contributing to the overall efficiency of the 
reaction include the rate of hybridization, the rate of cleavage, and the eflSciency of the 
release of the cleaved probe. 

The rate of cleavage will be a function of the cleavage means chosen, and may be 
made optimal according to the manufacturer's instructions when using commercial 

1 0 preparations of enzymes or as described in the examples herein. The other elements (rate 
of hybridization, efficiency of release) depend upon the execution of the reaction, and 
optimization of these elements is discussed below. 

Three elements of the cleavage reaction that significantly affect the rate of nucleic 
acid hybridization are the concentration of the nucleic acids, the temperature at which the 

IS cleavage reaction is performed and the concentration of salts and/or other charge- 
shielding ions in the reaction solution. 

The concentrations at which oligonucleotide probes are used in assays of this type 
are well known in the art, and are discussed above. One example of a common approach 
to optimizing an oligonucleotide concentration is to choose a starting amount of 

20 oligonucleotide for pilot tests; 0.01 to 2 ^ is a concentration range used in many 
oligonucleotide-based assays. When initial cleavage reactions are performed, the 
following questions may be asked of the data: Is the reaction performed in the absence of 
the target nucleic acid substantially fi*ee of the cleavage product?; Is the isite of cleavage 
specifically positioned in accordance with the design of the INVADER oligonucleotide?; 

25 Is the specific cleavage product easily detected in the presence of the uncleaved probe (or 
is the amount of uncut material overwhelming the chosen visualization method)? 

A negative answer to any of these questions would suggest that the probe 
concentration is too high, and that a set of reactions using serial dilutions of the probe 
should be performed until the appropriate amount is identified. Once identified for a 

30 given target nucleic acid in a give sample type (e.g., purified genomic DNA, body fluid 
extract, lysed bacterial extract), it should not need to be re-optimized. The sample type is 
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important because the complexity of the material present may influence the probe 

conc^tration optimum. 

Conversely, if the chosen initial probe concentration is too low. the reaction may 
be slow, due to inefficient hybridization. Tests with increasing quantities of the probe 
will identify the point at which the concentration exceeds the optimum {e.g., at which it 
produces an undesirable effect, such as background cleavage not dependent on the target 
sequence, or interference with detection of the cleaved products). Since the hybridization 
will be facUitated by excess of probe, it is desirable, but not required, that the reaction be 
performed using probe concentrations just below this point. 

The concentration of INVADER oligonucleotide can be chosen based on the 
design considerations discussed above. In some embodiments, the INVADER 
oligonucleotide is in excess of the probe oligonucleotide. In a preferred embodiment, the 
piobe oUgonucleotide is in excess of the INVADER oligonucleotide. 

Temperature is also an important factor in the hybridization of oligonucleotides. 
The range of temperature tested will depend in large part on the design of the 
oligonucleotides, as discussed above. Where it is desired to have a reaction be run at a 
particular temperature (e.g., because of an enzyme requirement, for convenience, for 
compatibility with assay or detection apparatuses, etc.). the oligonucleotides that function 
in the reaction can be designed to optimally perform at the desired reaction temperature. 
Each INVADER reaction includes at least two target sequence-specific oligonucleotides 
for tiie primary ration: an upstream INVADER oligonucleotide and a downstream 
probe oUgonucleotide. In some preferred embodiments, tiie INVADER oligonucleotide 
is designed to bind stabily at the reaction temperattire. while the probe is designed to 
finely associate and disassociate with the target strand, with cleavage occurring only 
when an uncut probe hybridizes adjacent to an overlapping INVADER oligonucleotide. 
In preferred embodiments, the probe includes a 5' flap that is not complementary to the 
target, and tiiis flap is released from the probe when cleavage occurs. The released flap 
can be detected directly or indirectly. In some preferred embodiments, as discussed in 
detail below, tiie released flap participate as in INVADER oligonucleotide in a secondary 
reaction. 
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Optimum conditions for the INTVADER assay are generally those that allow 
specific detection of the smallest amount of a target nucleic acid. Such conditions may 
be characterized as those that yield the highest target-dependent signal in a given 
timeframe, or for a given amount of target nucleic acid, or that allow the highest rate of 
5 probe cleavage (Le,, probes cleaved per minute). 

As noted above, the concentration of the cleavage agent can affect the actual 
optimum temperature for a cleavage reaction. Additionally, different cleavage agents, 
even if used at identical concentrations, can affect reaction temperature optima differently 
(eg., the difference between the calculated probe Tm and the observed optimal reaction 

10 temperature may be greater for one enzyme than for another). Determination of 

appropriate salt conections for reactions using different enzymes or concentrations of 
enzymes, or for any other variation made in reaction conditions, involves a two step 
process of a) measuring reaction temperature optima under the new reaction conditions, 
and varying the salt concentration withm the Tm algorithm to produce a calculated 

1 5 temperature matching or closely approximating the observed optima. Measurement of an 
optimum reaction temperature generally involves performing reactions at a range of 
temperatures selected such that the range allows observation of an increase in 
performance as an optimal temperature is approached (either by increasing or decreasing 
temperatures), and a decrease in performance when an optimal temperature has been 

20 passed, thereby allowing identification of the optimal temperature or temperature range 
(See e.g., Lyamichev, et al.. Biochemistry 39: 9523 [2000]). 

In some embodiments, a secondary reaction is used where the released cleavage 
fragment from a primary reaction hybridizes to a synthetic cassette to form a secondary 
cleavage reaction. In some preferred embodiments, the cassette comprises a fluorescing 

25 moiety and a quenching moiety, wherein cleavage of the secondary cleavage structure 
separates the fluorescing moiety from the quenching moiety, resulting in a detectable 
signal (e.g., FRET detection). The secondary reaction can be configured a number of 
different ways. For example, in some embodiments, the synthetic cassette comprises two 
oligonucleotides: an oligonucleotide that contains the FRET moieties and a 

30 FRET/INVADER oligonucleotide bridging ohgonucleotide that allows the INVADER 
oligonucleotide (i.e., the released flap from the primary reaction) and the FRET 
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oUgonucleotide to hybridize thereto, such that a cleavage structure is formed. In some 
embodiments, the synthetic cassette is provided as a single oUgonucleotide, comprising a 
hairpin stmcture (i.e.. the FRET oligonucleotide is connected at its 3' end to the bridging 
oUgonucleotide by a loop). The loop may be nucleic acid, or a non-nucleic acid spacer or 
linker. The linked molecules may together be described as a FRET cassette. In the 
secondary reaction using a FRET cassette the released flap from the primary reaction, 
which acts as an INVADER oUgonucleotide, should be able to associate and disassociate 
with the FRET cassette freely, so that one released flap can direct the cleavage of 
multiple FRET cassettes. It is one aspect of the assay design that all of the probe 
sequences may be selected to aUow the primary and secondary reactions to occur at the 
same optimal temperature, so that the reaction steps can run simultaneously. In an 
alternative embodiment, the probes may be designed to operate at different optimal 
temperatures, so that the reaction steps are not simultaneously at their temperature 
optima. As noted above, the same iterative process used to select the ASR of the probe 
can be used in the design of the portion of the primary probe that participates in a 
secondary reaction. 

Another detwminant of hybridization efficiency is the salt concentration of the 
reaction. In large part, the choice of solution conditions wiU depend on the requirements 
of the cleavage agent, and for reagents obtained commercially, the manufactiurer-s 
inslructions are a resource for this information. When developing an assay utilizing any 
particular cleavage agent, the oligonucleotide and temperature optimizations described 
above should be performed in the buffer conditions best suited to that cleavage agent. 

In some embodiments, additional agents may be included in reaction mixtures to 
enhance assay performance. For example, charged compounds such as aminoglycosides 
and other polyaraines have been used to modulate DNA and RNA conformation and 
fimction (see, e.g., Eamshaw and Gait, Nucl. Acids Res. 26:5551 [1998]; Robinson and 
Wang, Nucl. Acids Res. 24:676 [1996]; Jerinie, J. Mol. Biol. 304(5):707 [2000]; 
Schroeder et ai, EMBO 19(1):1 [2000]). Inclusion of the aminoglycoside antibiotic 
neomycin sulfate (e.g., at 1 in a primary reaction) can enhance assay performance by, 
e.g., reducing background signal, and tiierefore reducing the limit of detection of a 
particular INVADER assay probe set. Compounds of this type that may find use in 
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INVADER assay reactions include, but are not limited to, aminoglycosides, oligomerized 
aminoglycosides, and aminoglycoside bioconjugates, and other polyanions including, but 
not limited to, spermine and hexaamine cobalt. 

A "no enzyme" control allows the assessment of the stability of the labeled 
5 oligonucleotides under particular reaction conditions, or in the presence of the sample to 
be tested eg., in assessing the sample for contaminatmg nucleases). In this manner, the 
substrate and oligonucleotides are placed in a tube containing all reaction components, 
except the enzyme and treated the same as the enzyme-containing reactions* Other 
controls may also be included. For example, a reaction with all of the components except 
10 the target nucleic acid will serve to confirm the dependence of the cleavage on the 
presrace of the target sequence. 

d) Selection of a Cleavage Agent 

As demonstrated in a number of the Examples, some 5* nucleases do not require 
an upstream oligonucleotide to be active in a cleavage reaction. Although cleavage may 

15 be slower without the upstream oligonucleotide, it may still occm* (Lyamichev et al. 
Science 260:778 [1993], Kaiser et aL, J. Biol. Chem., 274:21387 [1999]). When aDNA 
strand is the template or target strand to which probe oligonucleotides are hybridized, the 
5* nucleases derived firom DNA polymerases and some flap endonucleases (FENs), such 
as that from Methanococcus jannaschii, can cleave quite well without an upstream 

20 oligonucleotide providing an overlap (Lyamichev et al, Science 260:778 [1993], Kaiser 
et al., J. Biol. Chem., 274:21387 [1999], and US Patent No, 5,843,669, herein 
incorporated by reference in its entirety). These nucleases may be selected for use in 
some embodiments of the INVADER assay, e.g., in embodiments wherein cleavage of 
the probe in the absence of an INVADER oligonucleotide gives a different cleavage 

25 product, which does not interfere with the intended analysis, or wherein both types of 
cleavage, INVADER oligonucleotide-directed and INVADER oligonucleotide- 
independent, are intended to occur. 

In other embodiments it is preferred that cleavage of the probe be dependent on 
the presence of an upstream INVADER oligonucleotide, and enzyme having this 

30 requirement would be used. Other FENs, such as those from Archeaoglobus fulgidus 
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(Afu) and Pyrococcus Juriosus (Pfu), cleave an overlapped structure on a DNA target at 
so much greater a rate than liiey do a non-overlapping structure (i.e., either missing the 
upstream oligonucleotide or having a non-overlapping upstream oligonucleotide) that 
they can be viewed as having an essentially absolute requirement for the overlap 

5 (Lyamichev et al., Nat. Biotechnol, 17:292 [1999], Kaiser et al.. J. Biol. Chem., 

274:21387 [1999]). When an RNA target is hybridized to DNA oligonucleotide probes 
to form a cleavage structure, many FENs cleave the downstream DNA probe poorly, 
regardless of the presence of an overlap. On such an KNA-containing structure, the 5' 
nucleases derived from DNA polymerases have a strong requirement for the overlap, and 

10 are essentially inactive in its absence. The selection of enymes for use in the detection of 
RNA targets is discussed in more detail below, in Section IV: Improved Enzymes For 
Use In INVADER Oligonucleotide-Directed Cleavage Reactions Comprising RNA 
Targets. 

e) Probing For Multiple Alleles 

1 5 The INVADER oligonucleotide-directed cleavage reaction is also useful in the 

detection and quantification of individual variants or alleles in a mixed sample 
population. By way of example, such a need exists in the analysis of tumor material for 
mutations in genes associated with cancers. Biopsy material fifom a tumor can have a 
significant complement of nomial cells, so it is desirable to detect mutations even vdjen 

20 present in fewer than 5% of the copies of the target nucleic acid in a sample. In this case, 
it is also desirable to measure what fraction of the population carries the mutation. 
Similar analyses may also be done to examine allehc variation in other gene systems, and 
it is not intended that the method of the present invention by limited to the analysis of 
tumors. 



25 



As demonstrated below, in one embodiment, reactions can be performed under 
conditions that prevent the cleavage of probes bearing even a single-nucleotide difference 
mismatch, but that permit cleavage of a similar probe that is completely complementary 
to the target in this region. In a preferred embodiment, a mismatch is positioned at the 
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nucleotide in the probe that is 5* of the site where cleavage occurs in the absence of the 
mismatch. 

In other embodiments, the INVADER assay may be performed under conditions 
that have a tight requirement for an overlap , using the Afii FEN for DNA target 
5 detection or the 5' nuclease of DNA polymerase for RNA target detection, as described 
above), providing an alternative means of detecting single nucleotide or other sequence 
variations. In one embodiment, the probe is selected such that the target base suspected 
of vaiying is positioned at the 5' end of the target-complOTientaiy region of this probe. 
The upstream INVADER oligonucleotide is positioned to provide a single base of 

1 0 overly. If the target and the probe oligonucleotide are complementary at the base in 
question, the overlap forms and cleavage can occur. However, if the target does not 
complement the probe at this position, that base in the probe becomes part of a non- 
complementary 5* arm, no overlap between the INVADER oligonucleotide and probe 
oligonucleotide exists, and cleavage is suppressed. 

IS It is also contemplated that different sequences may be detected in a single 

reactiorL Probes specific for the different sequences may be differently labeled. For 
example, the probes may have different dyes or other detectable moieties, different 
lengths, or they may have differences in net charges of the products after cleavage. When 
diff^^tly labeled in one of these ways, the contribution of each specific target sequence 

20 to final product can be tallied. This has application in detecting the quantities of different 
versions of a gene within a mixture. Different genes in a mixture to be detected and 
quantified may be wild type and mutant genes as may be found in a tumor sample, 
such as a biopsy). In this embodiment, one might design the probes to precisely the same 
site, but one to match the wild-type sequence and one to match the mutant. Quantitative 

25 detection of the products of cleavage firom a reaction performed for a set amount of time 
will reveal the ratio of the two genes in the mixture. Such analysis may also be 
performed on ururelated genes in a mixture. This type of analysis is not intended to be 
limited to two genes. Many variants within a mixture may be similarly measured. 

Alternatively, different sites on a single gene may be monitored and quantified to 

30 verify the measurement of that gene. In this embodiment, the signal fi-om each probe 
would be expected to be the same. 
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It is also contemplated that multiple probes may be used that are not differentiy 
labeled, such that the aggregate signal is measured. This may be desirable when using 
many probes designed to detect a single gene to boost the signal from that gene. This 
configuration may also be used for detecting unrelated sequences within a mix. For 
example, in blood banking it is desirable to know if any one of a host of infectious agents 
is present in a sample of blood. Because the blood is discarded regardless of which agent 
is present, different signals on the probes would not be required in such an application of 
the present invention, and may actually be undesirable for reasons of confidentiality. 

Just as described for the two-oligonucleotide system, above, the specificity of the 
detection reaction will be influenced by the aggregate length of the target nucleic acid 
sequences involved in the hybridization of the complete set of the detection 
oligonucleotides. For example, there may be applications in which it is desirable to 
detect a single region within a complex genome. In such a case the set of 
oligonucleotides may be chosen to require accurate recognition by hybridization of a 
longer segment of a target nucleic acid, often in the range of 20 to 40 nucleotides. In 
other instances it may be desirable to have the set of oligonucleotides interact with 
multiple sites within a target sample. In these cases one approach would be to use a set 
of oligonucleotides that recognize a smaller, and thus statistically more common, 
segment of target nucleic acid sequence. 

In one preferred embodiment, the INVADER and stacker oligonucleotides may be 
designed to be maximally stable, so that they will remain bound to the target sequence for 
extended periods during the reaction. This may be accompUshed through any one of a 
number of measures well known to those skilled in the art, such as adding extra 
hybridizing sequences to the length of the oligonucleotide (up to about 50 nts in total 
length), or by using residues with reduced negative charge, such as phosphorothioates or 
peptide-nucleic acid residues, so that the complementary strands do not repel each other 
to degree that natural strands do. Such modifications may also serve to make these 
flanking oligonucleotides resistant to contaminating nucleases, thus fiwther ensuring their 
continued presence on the target strand during the course of the reaction. In addition, the 
INVADER and stacker oligonucleotides may be covalently attached to the target {e.g., 
through the use of psoralen cross-Iinldng). 
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f) Applications for pooled DNA and RNA samples 
In some embodiments, the present invention provides methods and kits for 
assaying a pooled sample using INVADER detection reagents (e.g. primary probe, 
INVADER probe, and FRET cassette). In some preferred embodiments, the kit 
5 comprises instructions on how to perform the INVADER assay and specifically how to 
apply the INVADER detection assay to pooled samples fi-om many individuals, or to 
"pooled" samples fi-om many cells (e.g. firom a biopsy sample) fi-om a single subject 

In particular embodiments, the present invention allows detection of 
polymorphims in pooled samples combined fi-om many individuals in a population (e.g. 

10 10, 50, 100, or 500 individuals), or from a single subject where the nucleic acid 

sequences are fi-om a large number of cells that are assayed at once. In this regard, the 
present invention allows the frequency of rare mutations in pooled samples to be detected 
and an allele frequency for the population established. In some embodiments, this allele 
frequency may then be used to statistically analyze the results of applying the INVADER 

15 detection assay to an individual's frequency for the polymorphism (e,g. determined xising 
the INVADER assay). In this regard, mutations that rely on a percent of mutants found 
(e.g. loss of heterozygozity mutations) may be analyzed, and the severity of disease or 
progression of a disease determined (See, e.g. US Patent 6,146,828 and 6,203,993 to 
L£q>idus, hereby incorporated by reference for all purposes, where genetic testing and 

20 statistical analysis are employed to find disease causing mutations or identify a patient 
sample as containing a disease causing mutations). 

In some embodiments of the present invention, broad population screens are 
performed. In some preferred embodiments, pooling DNA &om several hundred or a 
thousand individuals is optimal. In such a pool, for example, DNA Scorn any one 

25 individual would not be detectable, and any detectable signal would provide a measure of 
frequency of the detected allele in a broader population. The amount of DNA to be used, 
for example, would be set not by the number of individuals in a pool, but rather by the 
allele frequency to be detected. For example, in some embodiments, an assay gives 
ample signal from 20 to 40 ng of DNA in a 90 minute reaction. At this level of 

30 sensitivity, analysis of 1 \ig of DNA from a high-complexity pool would produce 
comparable signal from alleles present in only about 3-5% of the population. 
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g) Applications of RNA detection. 

RNA quantitation is becoming increasingly important in basic, phaimaceutical, 
and clinical research. For example, quantitation of viral RNAs can predict disease 
progression and therapeutic efficacy. Likewise, gene expression analysis of diseased vs. 
normal, or untreated vs. treated, tissue can identify relevant biological responses or assess 
the effects of pharmacological agents. As the focus of the Human Genome Project 
moves toward gene expression analysis, the field will require a flexible RNA analysis 
technology that can quantitatively monitor multiple forms of alternatively transcribed 

and/or processed RNAs. 

As decribed above for the detection of multiple alleles, multiplex formats of the 
RNA INVADER assay enable simultaneous expression analysis of two or more genes 
within the same sample. In a primary reaction, one-nucleotide overiap-substrates are 
generated by the hybridization of INVADER oligonucleotides and probe oligonucleotides 
to their respective RNA targets. Each probe contains a specific, target-complementary 
region and a distinctive non-complementary 5' flap that is associated only with that 
specific mRNA in that assay. The distinctive flaps may be distinguished in any of the 
myriad ways disclosed herein (e.g., with different labels, different secondary cleavage 
systems having different labels, specific antibodies, different sizes when resolved, 
differenct sequences detected by hybridization in solution or on surfaces, etc.) 

While tiie RNA invasive cleavage assay, like the method used for DNA detection 
described above, can use two invasive cleavage reactions in sequence (described below, 
m Section H of the Detailed Description of the Invention), its preference for the 5' 
nucleases derived from DNA polymerases (described in detail in Section IV of the 
Detailed Description of tiie Invention) indicates that additional fonnat changes are 
preferred. Unlike tiie FEN 5' nucleases generally used for detection of DNA targets, 
optimal signal amplification with tiie DNA Pol-related 5* nucleases occurs only when a 
probe turnover mechanism is employed in both the primary and secondary reactions (in 
contrast to an INVADER ohgonucleotide tiimover mechanism, wherein an INVADER 
oligonucleotide cycles, e.g., to direct the cleavage of multiple FRET cassettes, as 
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described below, in Section n of this Detailed Description). Consequently, in preferred 
embodiments, RNA detection uses sequential operation of the two reactions, rather than 
simultaneous reaction performance. Because the reactions are performed truly 
sequentially, in these embodiments, the RNA INVADER assay signal accumulates 
5 linearly in both a target- and time-dependent maimer. In contrast, the primary and 

secondary reactions of the DNA INVADER assay, when run concurrently, amplify signal 
as a linear function of target level, but as a quadratic function of time. In the sequential 
embodiments, the RNA INVADER assay uses two separate oligonucleotides, a secondary 
probe a FRET probe) and secondary target, for signal generatioiL 

10 A key feature of the RNA invasive cleavage assay is its ability to discriminate 

higjily homologous RNA sequences, such as tiiose found in cytochrome P450 gene 
families. Like the DNA INVADER assay, the RNA INVADER assay can discrimmate 
single-base changes. In some embodiments, the first 5' complementary base of each 
probe is positioned at a non-conserved site in its mRNA target, so that a mismatch 

1 5 prevents fonnation of the overlap-structure, and thus prevents cleavage of the probe. 
Alternatively spliced mRNA variants can be specifically detected by positioning the 
cleavage site at a splice junction. 

To monitor large changes in mRNA levels, the dynamic range of the assay can be 
extended using real-time analysis. However, since the assay generates signal linearly 

20 with time or target level, simply varying the amoimt of sample added per reaction and 
calculating the copies of mRNA per rig total RNA enables accurate quantitation with a 
single endpoint measurement on low-cost instrumentation. Further, in cases where 
absolute quantitation is not necessary, the assay's linear signal amplification mechanism 
and reproducibility also eliminate the need for a standard curve and enable simple and 

25 precise relative quantitation of any one gene. 

The RNA INVADER assay is particularly suited for detecting altematively 
spliced or edited RNA variants because even a single base change at the overlap site 
affects 5' nuclease cleavage. All areas requiring RNA quantitation, such as high- 
throughput screening in drug discovery research, monitoring drug metabolism and safety 

30 in clinical trials, and clinical load monitoring of viral RNA can use this technology. 
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SpUce variants can be monitored in at least two ways with the assay: I) detection of an 
individual exon or 2) detection of a specific splice junction. 

To examine an RNA population for variants having more or fewer exons a&er 
splicing, INVADER assay probe sets are designed for each of the exons of interest (or for 
all exons in the mature RNA). Quantitation of exons, independent of how many mRNAs 
they reside in, may provide information about the number of splice variants for a given 
gene, as weU as indicate the levels of expression for each exon. Mini in vitro transcripts 
containing only one or a few exons can be generated for each probe set so that absolute 
quantitation can be performed for each exon, thus enabling accurate comparisons of exon 
levels. If it is known that a particular exon is present in all known variants, in some 
embodiments, a probe set is designed for that exon for use as an internal control to 
normalize across different samples. RNAs having a one copy of each exon (e.g., 
•'nonnally spliced" KNA) should produce signal from the collection of probe sets in 
certain relative amounts (which should be esseintially equal for all exons, corrected for 
variations in the sensitivity of individual probe sets; see Section V). Alterations in 
splicing alter the relative amounts of the exons. For example, if all of the produced 
RNAs are missing one of the normal exons, the signal for that exon drops toward zero, 
while if half of the RNAs are missing that exon, the signal for that exon drops toward 
50%. More complex combinations of splice variations and mixtures of differently 
spliced mRNAs yield more complex and more informative profiles. Detection is not 
limited to exons. RNA populations may also be monitored for the presence of intron 
sequences that are usually removed by splicing. Such global exon screening provides 
biologically relevant or diagnostic information when comparing normal vs. diseased 
tissue or untreated vs. treated cells (e.g. , in pharmacogenomic screening assays). An 
array-based description of this type of measurement is referred to as alternative splicing 
detector arrays (D. Black, Cell 103:367 [2000]). It is contemplated that the mRNA 
INVADER assay gives similar results but with greater specificity and more accurate 
quantitation than the oligonucleotide array formats. 

In an alternative embodiment, alternatively spliced mRNAs is detected by 
examiniation of specific spUce junctions. The advantage of monitoring the splice sites, as 
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opposed to the exons themselves, is that even splice variants involving very small exons 
{e.g. < 10 nts) are accurately detected with the assay. 

Additionally, in some embodiments, the mRNA INVADER assay isalso used to 
monitor alternative start and stop sites in the mRNA, and is used to monitor lifetimes of 
5 processed and unprocessed RNAs and RNA fragments (e.g., as used in timecourse 
studies following induction). 

II. Signal Enhancement By Incorporating The Products Of An Invasive 
Cleavage Reaction Into A Subsequent Invasive Cleavage Reaction 

10 

As noted above, the oligonucleotide product released by the invasive cleavage can 
be used subsequently in any reaction or read-out method that uses oligonucleotides in the 
size range of a cleavage product. In addition to the reactions involving primer extension 
and transcription, described herein, another enzymatic reaction that makes use of 

15 oligonucleotides is the invasive cleavage reaction. The present invention provide means 
of using the oligonucleotide released in a primary invasive cleavage reaction as a 
component to complete a cleavage structure to enable a secondary invasive cleavage 
reaction. ITis not intended that the sequential use of the invasive cleavage product be 
limited to a single additional step. It is contemplated that many distinct invasive cleavage 

20 reactions may be performed in sequence. 

The polymerase chain reaction uses a DNA replication method to create copies of 
a targeted segment of nucleic acid at a logarithmic rate of accumulation. This is made 
possible by the fact that when the strands of DNA are separated, each individual strand 
contains sufficient information to allow assembly of a new complementary strand. When 

25 the new strands are synthesized the number of identical molecules has doubled. Within 
20 iterations of this process, the original may be copied 1 million-fold, making very rare 
sequences easily detectable. The mathematical power of a doubling reaction has been 
incorporated into a number of amplification assays. 

By performing multiple, sequential invasive cleavage reactions the method of the 

30 present invention captures an exponential mathematical advantage without producing 
additional copies of the target analyte. In a simple invasive cleavage reaction the yield. 
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Y, is simply the turnover rate, K, multipUed by the time of the reaction, t (ie., Y = 
(KXO)- If Y is used to represent the yield of a simple reaction, then the yield of a 
compound (i.e., a multiple, sequential reaction), assuming that each of the individual 
invasive cleavage steps has the same turnover rate, can be simply represented as Y", 
where n is the number of invasive cleavage reactions that have been performed in the 
series. If the yields of each step differ the ultimate yield can be represented as the 
product of the multiplication of the yields of each individual reaction in the series. For 
example, if a primary invasive cleavage reaction can produce one thousand products in 
30 minutes, and each of those products can in tum participate in 1000 additional 
reactions, there will be lOOO' copies (1000 x 1000) of the ultimate product in a second 
reaction. If a third reaction is added to the series, then the theoretical yield will be 1000^ 
(1000 X 1000 X 1000). In the methods of tiie present invention the exponent comes from 
the numbw of invasive cleavage reactions in the cascade. This can be contrasted to the 
amplification methods described above (e.g., PCR) in which Y is limited to 2 by the 
number of strands in duplex DNA, and the exponent n is the number of cycles performed, 
so tiiat many iterations are necessary to accumulate large amounts of product. 

To distinguish tiie exponential amplifications described above &om those of tiie 
presait invention, tiie former can be considered reciprocating reactions because the 
products flie reaction feed back into the same reaction (e.g., event one leads to some 
number of evaits 2, and each event 2 leads back to some number of events 1). In 
contrast, flie evaits in some embodiments of the present invention are sequential (e.g., 
event 1 leads to some number of events 2; each event 2 leads to some number of events 3, 
ete., and no event can contibute to an event earlier in the chain). 

The sensitivity of tiie reciprocating methods is also one of the greatest weaknesses 
whrai these assays are used to determine if a target nucleic acid sequence is present or 
absait in a sample. Because the product of these reactions is detectable copies of tiie 
starting material, contamination of a new reaction witii tiie products of an earlier reaction 
can lead to false positive results, (i.e., the apparent detection of flie target nucleic acid in 
samples tiiat do not actually contain any of tiiat target analyte). Furthermore, because tiie 
concenfa'ation of tiie product in each positive reaction is so high, amounts of DNA 
suflBcient to create a strong false positive signal can be communicated to new reactions 
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very easily either by contact with contaminated instruments or by aerosol. In contrast to 
the reciprocating methods, the most concentrated product of the sequential reaction (z.e., 
the product released in the ultimate invasive cleavage event) is not capable of initiating a 
like reaction or cascade if carried over to a fresh test sample. This is a marked advantage 
5 over the exponential amplification methods described above because the reactions of the 
present invention may be performed without the costly containment arrangements (e.g., 
either by specialized instruments or by separate laboratory space) required by any 
reciprocating reaction. While the products of a penultimate event may be inadvertentiy 
transferred to produce a background of the ultimate product in the absence of the a target 

10 analyte, the contamination would need to be of much greater volume to give an 
equivalent risk of a false positive result 

When the term sequential is used it is not intended to limit the invention to 
configurations in which that one invasive cleavage reaction or assay must be completed 
before the initiation of a subsequent reaction for invasive cleavage of a different probe. 

1 S Rather, the term refers to the order of events as would occur if only single copies of each 
of the oligonucleotide species were used in an assay. The primary invasive cleavage 
reaction refers to that which occurs first, in response to the formation of the cleavage 
structure on the target nucleic acid. Subsequent reactions may be referred to as 
secondary, tertiary and so forth, and may involve artificial "target" strands that serve only 

20 to support assembly of a cleavage structure, and which are unrelated to the nucleic acid 
analyte of interest. While the complete assay may, if desired, be configured with each 
step of invasive cleavage separated either in space (eg., in different reaction vessels) or 
in time (e.g., using a shift in reaction conditions, such as temperature, enzyme identity or 
solution condition, to enable the later cleavage events), it is also contemplated that all of 

25 the reaction components may be mixed so that secondary reactions may be initiated as 
soon as product fix>m a primary cleavage becomes available. In such a format, primary, 
secondary and subsequent cleavage events involving different copies of the cleavage 
structures may take place simultaneously. 

Several levels of this sort of linear amplification can be envisioned, in which each 

30 successive round of cleavage produces an oligonucleotide that can participate in the 
cleavage of a different probe in subsequent rounds. The primary reaction would be 
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specific for the analyte of interest with secondary (and tertiary, etc.) reactions being used 
to generate signal while still being dependent on the primary reaction for initiation. 

The released product may perform in several capacities in the subsequent 
reactions. For example, the product of one invasive cleavage reaction becomes the 
INVADER oligonucleotide to direct the specific cleavage of another probe in a second 
reaction. In such an example, the first invasive cleavage structure is formed by the 
annealing of the INVADER oligonucleotide and the probe oligonucleotide (Probe 1) to 
the first target nucleic acid (Target 1). The target nucleic acid is divided into three 
regions based upon which portions of the INVADER and probe oUgonucleotides are 
capable of hybridizing to the target. Region 1 of the target has complementarity to only 
the INVADER oligonucleotide; region 3 of the target has complementarity to only the 
probe; and region 2 of the target has an overlap between the INVADER and probe 
oligonucleotides. 

Qeavage of Probe I releases the "Cut Probe 1". The released Probe 1 is then 
used as the INVADER oligonucleotide in second cleavage. The second cleavage 
structure is formed by the anneaUng of the Cut Probe 1, a second probe oUgonucleotide 
("Probe 2") and a second target nucleic acid ("Target 2"). In some embodiments, Probe 2 
and the second target nucleic acid are covalently connected, preferably at Iheir 3' and 5' 
ends, respectively, thus forming a hairpin stem and loop, termed herein a "cassette". The 
loop may be nucleic acidor a non-nucleic acid spacer or linker. Inclusion of an excess of 
the cassette molecule allows each Cut Probe I to SCTve as an INVADER to direct the 
cleavage of multiple copies of the cassette. 

Probe 2 may be labeled and detection of cleavage of the second cleavage structure 
may be accomplished by detecting the labeled cut Probe 2; the label may a radioisotope 
(e.g., "p. "S), a fluorophore (e.g., fluorescein), a reactive group capable of detection by a 
secondary agent (e.g., biotin/streptavidin), a positively charged adduct which pemiits 
detection by selective chai^ reversal (as discussed in Section IV above), etc. 
Alternatively, the cut Probe 2 may used in a tailing reaction, or to complete or activate a 
protein-binding site, or may be detected or used by any of the means for detecting or 
using an oligonucleotide described herein. 
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In other embodiments, probe oligonucleotides that are cleaved in the primary 
reaction can be designed to fold back on themselves (/.e., they contain a region of self- 
complementarity) to create a molecule that can serve as both the INVADER and target 
oligonucleotide (termed here an "IT" complex). The IT complex then enables cleavage 
5 of a different probe present in the secondary reaction. Inclusion of an excess of the 
secondary probe molecule ("Probe 2"), allows each IT molecule to serve as the platform 
for the generation of multiple copies of cleaved secondary probe. The target nucleic acid 
is divided into three regions based upon which portions of the INVADER and probe 
oligonucleotides are capable of hybridizing to the target (as discussed above and it is 

10 noted that the target may be divided into four regions if a stacker oUgonucleotide is 
employed). The second cleavage structure is formed by the annealing of the second 
probe ("Probe 2") to the fragment of Probe 1 ("Cut Probe 1") that was released by 
cleavage of the first cleavage structure. The Cut Probe 1 forms a hairpin or stem/loop 
stmcture near its 3' terminus by virtue of the annealing of the regions of self- 

15 complementarity contained within Cut Probe 1 (this self-annealed Cut Probe 1 forms the 
IT complex). The IT confq)lex (Cut Probe 1) is divided into three regions. Region 1 of 
the IT complex has complementarity to the 3' portion of Probe 2; region 2 has 
complementarity to both die 3' end of Cut Probe 1 and to the 5* portion of Probe 2; and 
region 3 contains the region of self-complementarity (/.&., region 3 is complementary to 

20 the 3* portion of the Cut Probe 1). Note that with regard to the IT complex (z.e.. Cut 
Probe 1), region 1 is located upstream of region 2 and region 2 is located upstream of 
region 3. As for other embodiments of invasive cleavage, region "2" can represent a 
region where there is a physical, but not sequence, overlap between the INVADER 
oligonucleotide portion of the Cut Probe 1 and the Probe 2 oligonucleotide. 

25 Tho cleavage products of the secondary invasive cleavage reaction (Le,, Cut Probe 

2) can either be detected, or can in turn be designed to constitute yet another integrated 
INVADER-target complex to be used with a third probe molecule, again unrelated to the 
preceding targets. 

It is envisioned that the oligonucleotide product of a primary cleavage reaction 
30 may fill the role of any of the oligonucleotides described herein {e.g., it may serve as a 
target strand without an attached INVADER oligonucleotide-like sequence, or it may 
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serve as a stacker oligonucleotide, as described above), to enhance the turnover rate seen 
in the secondary reaction by stabilizing the probe hybridization through coaxial stacking. 

Secondary cleavage reactions in some preferred embodiments of the present 
invention include the use of FRET cassettes. Such molecules provide both a secondary 
target and a FRET labeled cleavable sequence, allowing homogeneous detection (i.e., 
without product separation or other manipulation after the reaction) of the sequential 
invasive cleavage reaction. Other preferred embodiments use a secondary reaction 
system in which the FRET probe and synthetic target are provided as separate 
oligonucleotides. 

In a preferred embodiment, each subsequent reaction is facilitated by (i.e., is 
dependent upon) the product of the previous cleavage, so that the presence of the ultimate 
product may serve as an indicator of the presence of the target analyte. However, 
cleavage in the second reaction need not be dependent upon the presence of the product 
of the primary cleavage reaction; the product of the primary cleavage reaction may 
merely measurably enhance the rate of the second cleavage reaction. 

In summary, the INVADER assay cascade (j.e., sequential invasive cleavage 
reactions) of the present invention is a combination of two or more linear assays that 
allows the accumulation of the ultimate product at an exponential rate, but without 
significant risk of carryover contamination. It is important to note that background that 
does not arise from sequential cleavage, such as thermal breakage of the secondary probe, 
generally increases linearly with time. In contrast, signal generation from a 2-step 
sequential reaction follows quadratic kinetics. Thus, collection of data as a time course, 
either by taking time points or through the use of an instrument that allows real-time 
detection during the INVADER assay reaction incubations, provides the attractive 
capabiUty of discriminating between the true signal and any background solely on the 
basis of quadratic versus linear increases in signal over time. For example, when viewed 
graphically, the real signal will appear as a quadratic curve, while any accumulating 
background will be linear, and thus easy to distinguish, even if the absolute level of the 
background signal (e.g., fluorescence in a FRET detection format) is substantial. 

The sequential invasive cleavage amplification of the present invention can be 
used as an intermediate boost to any of the detection methods ie.g., gel based analysis by 
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either standard or by charge reversal), polymerase tailing, and incorporation into a protein 
binding region, described herein. When used is such combinations the increased 
production of a specific cleavage product in the invasive cleavage assay reduces the 
burdens of sensitivity and specificity on the read-out systems, thus facilitating their use. 
5 In addition to enabling a variety of detection platforms, the cascade strategy is 

suitable for multiplex analysis of individual analytes (Le,, individual target nucleic acids) 
in a single reaction. The multiplex format can be categorized into two types. In one case, 
it is desirable to know the identity (and amount) of each of the analytes that can be 
present in a clinical sample, or the identity of each of the analytes as well as an internal 

10 control. To identify the presence of multiple individual analytes in a single sample, 

several distinct secondary amplification systems may be included. Each probe cleaved in 
response to the presence of a particular target sequence (or internal control) can be 
designed to trigger a different cascade coupled to different detectable moieties, such as 
dijaerent sequences to be extended by DNA polymerase or different dyes in an FRET 

15 format The contribution of each specific target sequence to final product can thereby be 
tallied, allowing quantitative detection of different genes or alleles in a sample containing 
a mixture of genes or alleles. 

In the second configuration, it is desirable to determine if any of several analytes 
are present in a sample, but the exact identity of each does not need to be known. For 

20 example, in blood banking it is desirable to know if any one of a host of infectious agents 
is present in a sample of blood. Because the blood is discarded regardless of which agent 
is present, different signals on the probes would not be required in such an application of 
the present invention, and may actually be undesirable for reasons of confidentiality. In 
this case, the S* arms (i.e.y the S' portion that will be released upon cleavage) of the 

25 different analyte-specific probes would be identical and would therefore trigger the same 
secondary signal cascade. A sunilar configuration would permit multiple probes 
complementary to a single gene to be used to boost the signal fi'om that gene or to ensure 
inclusivity when there are numerous alleles of a gene to be detected. 

In the primary INVADER assay reaction, there are two potential sources of 

30 background. The first is &om INVADER oligonucleotide-independent cleavage of probe 
annealed to the target, to itself, or to one of the other oligonucleotides present in the 

112 



wo 01/90337 



PCT/USOl/17086 



reaction. The use of an enzyme that cannot efficiently cleave a structure that lacks a 
primer is preferred for this reason. The enzyme PJu FEN-1 gives no detectable cleavage 
in the absence of the upstream oligonucleotide or even in the presence of an upstream 
oligonucleotide that fails to invade the probe-target complex. This indicates that the Pfu 
FEN-1 endonuclease is a suitable enzyme for use in the methods of the present invention. 

Other structure-specific nucleases may be suitable as a well. As discussed in the 
first example, some 5' nucleases can be used in conditions that significantly reduce this 
primer-independent cleavage. For example, it has been shown that when ttie 5' nuclease 
of DNAPTaq is used to cleave hairpins the primer-independent cleavage is markedly 
reduced by the inclusion of a monovalent salt in the reaction (Lyamichev, et al., [1993], 
supra). 

TSL Effect of ARRESTOR Molecules on Signal and Background in Sequential 
Invasive Cleavage Reactions. 

As described above, the concentration of the probe that is cleaved can be used to 
increase the rate of signal accumulation, with higher concentrations of probe yielding 
higher final signal. However, the presence of large amounts of residual uncleaved probe 
can present problems for subsequent use of the cleaved products for detection or for 
further amplification. If flie subsequent step is a simple detection (e.g., by gel 
resolution), the excess uncut material may cause background by streaking or scattering of 
signal, or by overwhebning a detector {e.g., over-exposing a film in flie case of 
radioactivity, or exceeding the quantitative detection limits of a fluorescence imager). 
This can be overcome by partitioning the product fi-om tiie uncut probe. In more complex 
detection mefliods, the cleaved product may be intended to interact witii another entity to 
indicate cleavage. As noted above, the cleaved product can be used in any reaction that 
makes use of oUgonucleotides, such as hybridization, primer extension, ligation, or the 
direction of invasive cleavage. In each of tiiese cases, tiie fate of tiie residual uncut probe 
should be considered in the design of the reaction. In a primer extension reaction, the 
uncut probe can hybridize to a template for extension. If cleavage is required to reveal 
tiie correct 3' end for extension, tiie hybridized uncut probe will not be extended. It may, 
however, compete witii the cleaved product for tiie template. If flie template is in excess 
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of the combination of cleaved and uncleaved probe, then both of the latter should be able 
to find a copy of template for binding. If, however, the template is limiting, any 
competition may reduce the portion of the cleaved probe that can find successfully bind 
to the available template. If a vast excess of probe was used to drive the initial reaction, 
5 the remainder may also be in vast excess over the cleavage product, and thus may provide 
a veiy effective competitor, thereby reducing the amount of the final reaction 
extension) product for ultimate detection. 

The participation of the uncut probe material in a secondary reaction can also 
contribute to background in these reactions. While the presentation of a cleaved probe 

10 for a subsequent reaction may represent an ideal substrate for the enzyme to be used in 
the next step, some enzymes may also be able to act, albeit inefficiently, on the uncut 
probe as well. It was shown during the development of the present invention that 
transcription can be promoted from a nicked promoter even when one side of the nick has 
additional unpaired nucleotides. Sunilarly, whai the subsequent reaction is to be an 

15 invasive cleavage, the uncleaved probe may bind to the elements intended to form the 
second cleavage structure with the cleaved probe. In experiments conducted during the 
development of the present invention, it was found that some of the 5' nucleases 
described herein can catalyze some measure of cleavage of defective structures. Even at 
a low level, this aberrant cleavage can be misinterpreted as positive target-specific 

20 cleavage signal. 

With these negative eflFects of the surfeit of uncut probe consid^ed, there is 
clearly a need for some method of preventing these interactions. As noted above, it is 
possible to partition the cleaved product from the uncut probe after the primary reaction 
by traditional methods. However, these methods are often time consuming, may be 

25 expensive disposable columns, gels, etc.), and may increase the risk for sample 
mishandling or contamination. It is far preferable to configure the sequential reactions 
such that the original sample need not be removed to a new vessel for subsequent 
reaction. 

The present invention provides a method for reducmg interactions between the 
30 primary probe and other reactants. This method provides a means of specifically 
diverting the uncleaved probes from participation in the subsequent reactions. The 
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diversion is accomplished by the inclusion in the next reaction step an agent designed to 
specifically interact with the uncleaved primary probe. While the primary probe in an 
invasive cleavage reaction is discussed for reasons of convenience, it is contemplated that 
the ARRESTOR molecules may be used at any reaction step within a chain of invasive 
cleavage steps, as needed or desired for the design of an assay. It is not intended that the 
ARRESTOR molecules of the present invention be limited to any particular step. 

The method of diverting the residual uncut probes from a primary reaction makes 
use of agents that can be specifically designed or selected to bind to the uncleaved probe 
molecules with greater affinity than to the cleaved probes, thereby allowing the cleaved 
probe species to effectively compete for the elements of the subsequent reaction, even 
when the uncut probe is present in vast excess. These agents have been termed 
"ARRESTOR molecules," due to their fimction of stopping or arresting the primary 
probe from participation m the later reaction. In various Examples below, an 
oligonucleotide is provided as an ARRESTOR molecule in an invasive cleavage assay, It 
can be appreciated that any molecule or chemical that can discriminate between the fiiU- 
length uncut probe and the cleaved probe, and that can bind or otherwise disable the 
uncleaved probe preferentially may be configured to act as ARRESTOR molecules 
within the meaning of the present invention. For example, antibodies can be derived with 
such specificity, as can the "aptamers" that can be selected through multiple steps of in 
vitro ampUfication (e.g., "SELEX," U.S. Patent Nos. 5,270,163 and 5,567,588; herein 
incorporated by reference) and specific rounds of capture or other selection means. 

In one embodfanent, the ARRESTOR molecule is an oUgonucleotide. In another 
embodiment the ARRESTOR oligonucleotides is a composite oligonucleotide, 
comprising two or more short oligonucleotides that are not covalently linked, but that 
bind cooperatively and are stabilized by co-axial stacking. In a preferred embodiment, 
the oligonucleotide is modified to reduce interactions with the cleavage agents of the 
present invention. When an oligonucleotide is used as an ARRESTOR oligonucleotide, it 
is intended that it not participate in the subsequent reactive step. The binding of the 
ARRESTOR oligonucleotide to the primary probe may, either with the participation of 
the secondary target, or without such participation, create a bifiircated stmcture that is a 
substrate for cleavage by the 5* nucleases used in some embodiments of the methods of 
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the present invention. Formation of such structures would lead to some level of 
unintended cleavage that could contribute to background, reduce specific signal or 
compete for the enzyme. It is preferable to provide ARRESTOR oligonucleotides that 
will not create such cleavage structures. One method of doing this is to add to the 
5 ARRESTOR oligonucleotides such modifications as have been found to reduce the 
activity of INVADER oligonucleotides, as the INVADER oligonucleotides occupy a 
sunilar position within a cleavage structure the 3' end of the INVADER 
oligonucleotide positions the site of cleavage of an unpaired 5' arm). Modification of the 
3' end of the INVADER oligonucleotides was examined for the effects on cleavage in the 

1 0 Example section below; a number of the modifications tested were found to be 
significantly debilitating to the fimction of the. INVADER oligonucleotide. Other 
modifications not described herem may be easily characterized by performing such a test 
using the cleavage enzyme to be used in the reaction for which die ARRESTOR 
oligonucleotide is intended. 

15 In a preferred embodiment, the backbone of an ARRESTOR oligonucleotide is 

modified. This may be done to increase the resistance to degradation by nucleases or 
temperature, or to provide duplex structure that is a less favorable substrate for the 
enzyme to be used {e,g,, A-form duplex vs. B-form duplex). In particularly preferred 
embodiment, the backbone-modified oligonucleotide fiutiier comprises a 3' terminal 

20 modification. In a preferred embodiment, the modifications comprise 2' 0-methyl 

substitution of the nucleic acid backbone, while in a particularly preferred embodiment, 
the 2* 0-methyl modified oligonucleotide fiirther comprises a 3* terminal amine group. 

The purpose of the ARRESTOR oligonucleotide is to allow the minority 
population of cleaved probe to effectively compete with the uncleaved probe for binding 

25 whatever elements are to be used in the next step. While an ARRESTOR oligonucleotide 
that can discriminate between the two probe species absolutely (i.e., binding only to 
uncut and never to cut) may be of the greatest bmefit in some embodiments, it is 
envisioned that in many applications, including the sequential INVADER assays 
described herein, the ARRESTOR oligonucleotide of the present invention may perform 

30 the intended fimction with only partial discrimination. When the ARRESTOR 

oligonucleotide has some interaction with the cleaved probe, it may prevent detection of 
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some portion of these cleavage products, thereby reducing the absolute level of signal 
generated from a given amount of target material. If this same ARRESTOR 
oligonucleotide has the simultaneous effect of reducing the background of the reaction 
{Le,. from non-target specific cleavage) by a factor that is greater than the factor of 

5 reduction in the specific signal, then the significance of the signal {Le„ the ratio of signal 
to background), is increased, even with the lower amount of absolute signal. Any 
potential ARRESTOR molecule design may be tested in a simple fashion by comparing 
the levels of background and specific signals from reactions that lack ARRESTOR 
molecules to the levels of background and specific signal from similar reactions that 

10 include ARRESTOR oligonucleotides. What constitutes an acceptable level of tradeoff 
of absolute signal for specificity will vary for different appUcations (e,g., target levels, 
read-out sensitivity, etc.), and can be determined by any individual user using the 
methods of the present invention. 

15 IV, Improved Enzymes For Use In INVADER Oligonucleotide-Directed 
Cleavage Reactions Comprising RNA Targets; 

A cleavage structure is defined herein as a structure that is formed by the 
interaction of a probe oligonucleotide and a target nucleic acid to form a duplex, the 

20 resulting structure bemg cleavable by a cleavage agent, including but not limited to an 
enzyme. The cleavage structure is fiirther defined as a substrate for specific cleavage by 
the cleavage means in contrast to a nucleic acid molecule that is a substrate for 
nonspecific cleavage by agents such as phosphodiesterases. In considering 
improvements to enzymatic cleavage agents, one may consider the action of said 

25 enzymes on any structures that fall within.the defmition of a cleavage structure. Specific 
cleavage at any site within such a structure is contemplated. 

Improvements in an enzyme may be an increased or decreased rate of cleavage of 
one or more types of structures. Improvements may also result in more or fewer sites of 
cleavage on one or more of said cleavage structures. In developing a library of new 

30 structure-specific nucleases for use in nucleic acid cleavage assays, improvements may 
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have many different embodiments, each related to the specific substrate structure used in 
a particular assay. 

As an example, one embodiment of the INVADER oligonucleotide-directed 
cleavage assay of the present invention may be considered. In the INVADER 
5 oligonucleotide-Kiirected cleavage assay, the accumulation of cleaved material is 
influenced by several features of the enzyme bdiavior. Not surprisingly, the turnover 
rate, or the number of structures that can be cleaved by a single enzyme molecule in a set 
amount of time, is veiy important in detennining the amount of material processed during 
the course of an assay reaction. If an enzyme takes a long time to recognize a substrate 
10 if it is presented with a less-than-optimal structure), or if it takes a long time to . 

, execute cleavage, the rate of product accumulation is lower than if these steps proceeded 
quickly. If these steps are quick, yet the enzyme "holds on" to the cleaved structure, and 
does not unmediately proceed to another uncut structure, the rate will be negatively 
affected. 

1 5 Enzyme turnover is not the only way in which enzyme behavior can negatively 

afifect the rate of accumulation of product. When the means used to visualize or measure 
product is specific for a precisely defined product, products that deviate firom that 
definition may escape detection, and thus the rate of product accumulation may appear to 
be lower than it is. For example, if one had a sensitive detector for trinucleotides that 

20 could not see di- or tetranucleotides, or any sized oligonucleotide other that 3 residues, in 
the INVADER-directed cleavage assay of the present invention any errant cleavage 
would reduce the detectable signal proportionally. It can be seen firom the cleavage data 
presented here that, while there is usually one site within a probe that is favored for 
cleavage, there are often products that arise from cleavage one or more nucleotides away 

25 from the primary cleavage site. These are products that are target-dependent, and are 

thus not non-specific background. Nevertheless, if a subsequent visualization system can 
detect only the primary product, these represent a loss of signal. One example of such a 
selective visualization system is the charge reversal readout presented herein, in which 
the balance of positive and negative charges determines the behavior of the products. In 

30 such a system the presence of an extra nucleotide or the absence of an expected 

nucleotide can exclude a legitimate cleavage product &om ultimate detection by leaving 
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that product with the wrong balance of charge. It can be easily seen that any assay that 
can sensitively distinguish the nucleotide content of an oUgonucleotide, such as standard 
stringent hybridization, suffers in sensitivity when some fraction of the legitimate product 
is not eligible for successful detection by that assay. 

These discussions suggest two highly desirable traits in any enzyme to be used in 
the method of the present invention. First, the more rapidly the enzyme executes an 
entire cleavage reaction, including recognition, cleavage and release, the more signal it 
may potentially created in the INVADER oligonucleotide-directed cleavage assay. 
Second, the more successful an enzyme is at focusing on a single cleavage site within a 
structure, the more of the cleavage product can be successfully detected in a selective 
read-out 

The rationale cited above for making improvements in enzymes to be used in the 
INVADER ohgonucleotide-directed cleavage assay are meant to serve as an example of 
one direction in which improvements might be sought, but not as a limit on either the 
nature or the applications of improved enzyme activities. As another direction of activity 
change that would be ^ropriately considered improvement, the DNAP-associated 5' 
nucleases may be used as an example. In creating some of the polymerase-deficient 5' 
nucleases described herein it was found that the those that were created by deletion of 
substantial portions of the polymerase domain, assumed activities that were weak or 
absent in the parent proteins. These activities included the abiUty to cleave non-forked 
structures, a greatly enhanced ability to exonucleolytically remove nucleotides from the 
5' ends of duplexed strands, and a nascent ability to cleave circular molecules without 
benefit of a free 5' end. 

In addition to the 5' nucleases derived from DNA polymerases, the present 
invention also contemplates the use of stracture-specific nucleases that are not derived 
from DNA polymerases. For example, a class of eukaryotic and archaebacterial 
endonucleases have been identified which have a similar substrate specificity to 5' 
nucleases of Pol I-type DNA polymerases. These are the FENl (Flap EndoNuclease), 
RAD2, and XPG pCeroderma Pigmentosa-complementation group G) proteins. These 
proteins are involved in DNA repair, and have been shown to favor the cleavage of 
structures that resemble a 5' arm that has been displaced by an extending primer during 
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polymerization. Similar DNA repair enzymes have been isolated from single cell and 
higher eukaryotes and from archaea, and there are related DNA repair proteins in 
eubacteria. Similar 5* nucleases have also been associated with bacteriophage such as T5 
andT7. 

5 Recently, the 3-dimensional structures of DNAPTaq and T5 phage 5*-exonuclease 

were determined by X-ray diffraction (Kim et al. Nature 376:612 [1995]; and Ceska et 
al^ Nature 382:90 [1995]). The two enzymes have very similar 3-dimensional structures 
despite limited amino acid sequence similarity. The most striking feature of the T5 
5 -exonuclease structure is the existence of a triangular hole formed by the active site of 

10 the protem and two alpha helices. This same region of DNAPTaq is disordered in the 
crystal structure, indicating that this region is flexible, and thus is not shown in the 
published 3-dimensional structure. However, the 5* nuclease domain of DNAPTaq is 
likely to have the same structure, based its overall 3-dimensional similarity to T5 
5 -exonuclease, and that the amino acids in the disordered region of the DNAPTaq 

15 protein are those associated with alpha helix formation. The existence of such a hole or 
groove in the 5' nuclease domain of DNAPTaq was predicted based on its substrate 
specificity (Lyamichev et aL, supra). 

It has been suggested that the 5' arm of a cleavage structure must thread through 
the helical arch described above to position said structure correctly for cleavage (Ceska et 

20 a/., supra). One of the modifications of 5* nucleases described herein opened up the 
heUcal arch portion of the protein to allow improved cleavage of structures that cut 
poorly or not at all (e.g., structures on circular DNA targets that would preclude such 
threading of a 5' arm). The gene construct that was chosen as a model to test this 
approach was the one called CLEAVASE BN, which was derived from DNAPTaq but 

25 does not contain the polymerase domain. It comprises the entire 5' nuclease domain of 
DNAP Taq, and thus should be very close in structure to the T5 5* exonuclease. This 5' 
nuclease was chosen to demonstrate the principle of such a physical modification on 
proteins of this type. The arch-opening modification of the present invention is not 
intended to be limited to the 5* nuclease domains of DNA polymerases, and is 

30 contemplated for use on any structure-specific nuclease that includes such an aperture as 
a limitation on cleavage activity. The present invention contemplates the insertion of a 
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thrombin cleavage site into the heUcal arch of DNAPs derived from the genus Thermus as 
well as 5- nucleases derived from DNAPs derived from the genus Thermus. The specific 
example shown herein using the CLEAVASE BN/thrombin nuclease merely illustrates 
the concept of opening the heUcal arch located within a nuclease domain. As the amino 
acid sequence of DNAPs derived from the genus Thermus are highly conserved, the 
teachings of the present invention enable the insertion of a thrombin site into the helical 
arch present in these DNAPs and 5' nucleases derived from these DNAPs. 

The opening of the helical arch was accomplished by insertion of a protease site 
in the aich. This allowed post-translational digestion of the expressed protein with the 
appropriate protease to open the arch at its apex. Proteases of this type recognize short 
stretches of specific amino acid sequence. Such proteases include thrombin and factor 
Xa. Cleavage of a protein with such a protease depends on both the presence of that site 
in the amino acid sequence of the protein and the accessibility of that site on the folded 
intact protein. Even with a crystal structure it can be difficult to predict the susceptibility 
of any particular region of a protein to protease cleavage. Absent a crystal structure it 
must be determined empirically. 

In selecting a protease for a site-specific cleavage of a protein that has been 
modified to contain a protease cleavage site, a first step is to test the unmodified protein 
for cleavage at alternative sites. For example, DNAPTaq and CLEAVASE BN nuclease 
were both incubated under protease cleavage conditions with factor Xa and thrombin 
proteases. Both nuclease proteins were cut with factor Xa within the 5' nuclease domain, 
but neither nuclease was digested with large amounts of thrombin. Thus, thrombin was 
chosen for initial tests on opening the arch of the CLEAVASE BN enzyme. 

In the protease/ CLEAVASE modifications described herein the factor Xa 
protease cleaved strongly in an unacceptable position in the unmodified nuclease protein, 
in a region likely to compromise the activity of the end product. Other unmodified 
nucleases contemplated herein may not be sensitive to the factor Xa, but may be sensitive 
to thrombin or other such proteases. Alternatively, they may be sensitive to these or 
other such proteases at sites that are immaterial to the function of the nuclease sought to 
be modified. In approaching any protein for modification by addition of a protease 
cleavage site, the unmodified protein should be tested with the proteases under 
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consideration to determine which proteases give acceptable levels of cleavage in other 
regions. 

Working with the cloned segment of DNAPTaq from which the CLEAVASE BN 
protein is expressed, nucleotides encoding a thrombin cleavage site were introduced 
5 in-frame near the sequence encoding amino acid 90 of the nuclease gene. This position 
was determined to be at or near the apex of the helical arch by reference to both the 
3-dimensional structxn-e of DNAPTaq, and the structure of T5 5' exonuclease. The 
encoded amino acid sequence, LVPRGS, was inserted into the apex of the heUcal arch by 
site-directed mutagenesis of the nuclease gene. The proline (P) in the thrombin cleavage 

10 site was positioned to replace a proline normally in this position in CLEAVASE BN 
because proline is an alpha helix-breaking amino acid, and may be important for the 
3-dimensional structure of this arch. This construct was expressed, purified and then 
digested with thrombin. The digested enzyme was tested for its ability to cleave a target 
nucleic acid, bacteriophage Ml 3 genomic DNA, that does not provide free 5' ends to 

1 5 facilitate cleavage by the threading model 

While the helical arch in this nuclease was opened by protease cleavage, it is 
contemplated that a number of other techniques could be used to achieve the same end. 
For example, the nucleotide sequence could be rearranged such that, upon expression, the 
resulting protein would be configured so that the top of the helical arch (amino acid 90) 

20 would be at the amino terminus of the protein, the natural carboxyl and amino termini of 
the protein sequence would be jomed, and the new carboxyl terminus would lie at natural 
amino acid 89. This approach has the benefit that no foreign sequences are introduced 
and the enzyme is a smgle amino acid chain, and thus may be more stable that the 
cleaved 5' nuclease. In the crystal structure of DNAPTaq, the amino and carboxyl 

25 termini of the 5 -exonuclease domain He in close proximity to each other, which suggests 
that the ends may be directly joined without the use of a flexible linker peptide sequence 
as is sometimes necessary. Such a rearrangement of the gene, with subsequent clonmg 
and expression could be accomplished by standard PGR recombination and cloning 
techniques known to those skilled in the art. 

30 The INVADER invasive cleavage reaction has been shown to be usefiil in the 

detection of RNA target strands {See e.g., U.S. Patent 6,001,567, incorporated herein by 
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reference in its entirety). As with the INVADER assay for the detection of DNA 
(Lyamichev et ai. Nat. Biotechnol., 17:292 [1999]), the reactions may be run under 
conditions tiiat permit the cleavage of many copies of a probe for each copy of the target 
RNA present in the reaction. In one anbodiment, tiie reaction may be performed at a 
temperature close to tiie melting temperature (Tm) of tiie probe that is cleaved, such that 
the cleaved and uncleaved probes readily cycle on and off the target strand without 
temperature cycling. Each time a fiill-lengtii probe binds to the target in the presence of 
the INVADER oligonucleotide, it may be cleaved by a 5' nuclease enzyme, resulting in 
an accumulation of cleavage product. The accumulation is highly specific for the 
sequCTice being detected, and may be configured to be proportional to both time and 
target concentration of the reaction. In another embodiment, the temperature of the 
reaction may be shifted {i.e., it may be raised to a temperatiJie that wUl cause the probe to 
dissociate) then lowered to a temperature at which a new copy of the probe hybridizes to 
the target and is cleaved by the en2yme. In a further embodiment, the process of raising 
and lowering the temperature is repeated many times, or cycled, as it is in PGR (Mullis 
and Faloona, Methods in Enzymology, 155:335 [1987], Saiki et al, Science 230:1350 
[1985]). 

As noted above, 5' nucleases of Pol A ^e DNA polymerases are preferred for 
cleavage of an invasive cleavage structiire that comprises an RNA target stirand. The 
present invention provides eniymes having improved performance in detection assays 
based on the cleavage of a staructiire comprising RNA. In particular, the altered 
polymerases of the present invention exhibit in^roved performance in detection assays 
based on tiie cleavage of a DNA monber of an invasive cleavage structure that comprises 
an RNA target stirand. 

The 5' nucleases of the present invention may be derived from Pol A type DNA 
polymerases. The terminology used in describing the alterations made in tiiis class of 5' 
nucleases relates to the descriptions of DNA polymerase stiiictures known in the art. The 
Klenow fragment of the Pol A polymerase from E. coli (tiie C-terminal two tiiirds, which 
has flie DNA synthesizing activity but lacks the 5' nuclease activity) has been described 
as having a physical form resembling a right hand, having an opeai region called the 
"patai", and a cleft that holds the primer/template duplex defined on one side by a 
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"fingers" domain and on the other by a "thumb" domain (Joyce and Steitz, Trends in 
Biochemical Science 12:288 [1987]). This is shown schematically in Figure 5. Because 
this physical form has proved to be conunon to all Pol A DNA polymerases and to a 
number of additional template-dependent polymerizing enzymes such as reverse 
5 transcriptases, the hand terminology has become known in the art, and the sites of activity 
in these enzymes are often described by reference to their position on the hand. For 
reference, and not intended as a limitation on the present invention, the palm is created 
from roughly the first 200 amino acids of the polymerase domain, the thumb from the 
middle 140, and the fingers by the next 160, with the base of the cleft formed fiom the 

10 remaining regions (Figures 6). Although some enzymes may deviate from these 
structural descriptions, the equivalent domains and sequences corresponding to such 
domains may be identified by sequence homology to known enzyme sequences, by 
comparison of enzyme crystal structures, and other like methods. 

In creating the improved enzymes of the present invention, several approaches 

1 5 have been taken, although the present invention in not limited to any particular approach. 
First two DNA polymerases, Taq and Tth, that have different rates of DNA strand 
cleavage activity on RNA targets were compared. To identify domains related to the 
differences in activity, a series of chimerical constructs was created and the activities 
were measured. This process identified two regions of the Tth polymerase that could, if 

20 transferred into the Taq polymerase, confer on the TaqPol an RNA-dependent cleavage 
activity equivalent to ttiat of the native Tth protein. Once these regions were identified, 
the particular amino acids involved in the activity were examined. Since the two proteins 
are about 87 percent identical in amino acid sequence overall, the identified regions had 
only a small number of amino acid differences. By altering these amino acids singly and 

25 in combinations, a pair of amino acids were identified in TthPol that, if introduced into 
the TaqPol protein, increased the rate of cleavage up to that of the native TthPol. 

These data demonstrate two important aspects of the present invention. First, 
specific amino acids can be changed to confer TthPoHike RNA-dependent cleavage 
activity on a polymerase having a lesser activity. More broadly, however, these results 

30 provide regions of these polymerases that are involved in the recognition of the 

RNA-containing cleavage structure. Identification of these important regions, combined 
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with published information on the relationships of other amino acids to the various 
functions of these DNA polymerases and computer-assisted molecular modeling during 
the development of the present invention have allowed a rational design approach to 
create additional improved 5' nucleases. The information also allowed a focused random 
mutagenesis approach coupled with a rapid screening procedure to quickly create and 
identify enzymes having improved properties. Using these methods of the present 
invention, a wide array of unproved polymerases are provided. 

The methods used in creating and selecting the improved 5' nucleases of the 
present invention are described in detail below and in the experimental examples. A 
general procedure for screening and characterizing the cleavage activity of any 5' 
nuclease is included in the experimental examples. The methods discussions are divided 
into the following sections: I) Creation and selection of chimerical constructs; II) 
Site-specific mutagenesis based on information from chimerical constructs; III) 
Site-specific mutagenesis based on molecular modeling and published physical studies; 
and IV) focused random mutagenesis. 

1) Creation and selection of chimerical constructs 

The PolA-type DNA polymerases, including but not limited to DNA polymerase 
enzymes from Thermus species, comprise two distinctive domains, the 5' nuclease and 
the polymerase domains, shown schematically in Figure 6. The polymerase domains 
reside in the C-termmal two-thirds of the protems and are responsible for both 
DNA-dependent and RNA-dependent DNA polymerase activities. The N-terminal 
one-third portions contain the 5' nuclease domains. In the genus Thermus Pol A 
polymerase, the palm region consists of, roughly, amino acids 300-500, the thumb region 
includes amino acids 500-650, while the fingers region is formed by the remaining amino 
acids from 650 to 830 (Figure 6). 

The derivatives. Taq DN RX HT and Tth DN RX HT, of Taq and TthPol used in 
many of the experiments of the present invention, and described herein, are modified to 
reduce synthetic activity and to facilitate chimera construction, but have 5' nuclease 
activity essentially identical to unmodified TaqPol and TthPol. Unless otherwise 



125 



wo 01/90337 



PCT/USOl/17086 



specified, the TaqPol and TthPol enzymes of the following discussion refer to the DN RX 
HT derivative. 

TthPol has a 4-fold higher cleavage rate with the IL-6 RNA template (shown in 
Figure 10) than TaqPol (shown in Figures 1 1 and 12), although the Taq and TthPols show 
5 similarities of cleavage in DNA target structures (Figure 10). Since the amino acid 
sequences of TaqPol and TthPol (Figures 8 and 9) share about 87% identity and greater 
than 92% similarity, the high degree of homology between the enzymes allowed creation 
of a series of chimeric enzymes between TthPol and TaqPol. The activity of the chimeric 
enzymes was used as a parameter to identify the region(s) of these proteins affecting 

1 0 RNA dependent 5' nuclease activity. 

The chimeric constructs between TthPol and TaqPol genes shown schematically 
in Figures 7 and 19 were created by swapping DNA fragments defined by the restriction 
endonuclease sites, EcoRI and BamHI, common for both genes, the cloning vector site 
Sail and the new sites, NotI, BstBI and Ndel, created at the homologous positions of both 

15 genes by site directed mutagenesis. The restriction enzymes have been abbreviated as 
follows: EcoRI is E; NotI is N; BstBI is Bs; Ndel is D, BamHI is B, and Sail is S. 

The activity of each chimeric enzyme was evaluated using the invasive signal 
amplification assay with the IL-6 RNA target (Figure 10), and the cycling cleavage rates 
shown in Figure 12 were determined as described in the Experimental Examples. 

20 Comparison of the cleavage rates of the first two chimeras, TaqTth(N) and TthTaq(N), 
created by swapping the polymerase and 5' nuclease domains at the NotI site (Figure 7), 
shows that TaqTth(N) has the same activity as TthPol, whereas its counterpart TthTaq(N) 
retains the activity of TaqPol (Figure 12). This result indicates that the higher cleavage 
rate of TthPol is associated with its polymerase domain and suggests an important role of 

25 the polymerase domain in the 5' nuclease activity. 

The next step was to identify a minimal region of TthPol polymerase that would 
give rise to the TthPol-like RNA dependent 5* nuclease activity when substituted for the 
corresponding region of the TaqPol sequence. To this end, the TaqTth(N) chimera was 
selected to generate a series of new constructs by replacing its TthPol sequence with 

30 homologous regions of TaqPol First, the N-tenninal and C-tenninal parts of the TaqPol 
polymerase domain were substituted for the corresponding regions of TaqTth(N) using 
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the common BamHI site as a breaking point to create TaqTth(N-B) and TaqTth(B-S) 
. chimeras, respectively (Figure 7). TaqTth(N-B) which has the TthPol sequence between 
amino acids 328 and 593, is approximately 3 times more active than the TaqTth(B-S) and 
40% more active than TthPol (Figure 12). This result estabUshes that the Notl-BamHI 

5 portion of the TthPol polymerase domain determines superior RNA-dependent 5' 
nuclease activity of TthPol. 

From these data it was determined that a central portion of the TthPol, when used 
to replace the homologous portion of TaqPol (TaqTth(N-B) construct) could confer 
superior RNA recognition on the chimerical protein composed primarily of Taq protem. 

10 In fact, the cycling rate of this chimerical protein exceeded that of the parent TthPol. 
Comparison of chimeras that included sub-portions of the activity-improving region of 
TfljPol, £5)proximately 50% of the region in each case {See, TaqTth(N-D) and 
TaqTth(D-B), Figures 7 and 12) showed no significant improvement in RNA dependent 
activity as compared to the parent TaqPol. This result indicates that aspects of each half 

15 of file region are required for this activity. A construct having an only slightly smaller 
portion of the Tth insert portion (TaqTth(Bs-B)) showed activity that was close to that of 
the parent TthPol protdn, but which was less than that of the TaqTtii(N-B) construct. 

2) Site-specific mutagenesis based on information from chimerical 

20 constructs 

Comparison of flie TthPol and TaqPol amino acid sequences between the BstBI 
and Bamm sites reveals only 25 differences (Figure 13). Among those, there are 12 
conservative changes and 13 substihitions resulting in a change in charge. Since the 
analysis of the chimeric enzymes has suggested that some critical amino acid changes are 

25 located in both BstBI-Ndel and Ndel-BamHI regions of TthPol, site directed 

mutagenesis was used to introduce the TthPol specific amino acids uxto the BstBI-Ndel 
and Ndel-BamHI regions of the TaqTth(D-B) and TaqTth (N-D) chimeras, respectively. 
Six TttiPol-specific substitutions were generated in the BstBI-Ndel region of the 
TaqTth(D.B) by single or double amino acid mutagenesis and only one double mutation, 

30 W417L/G418K, was able to restore the TthPol activity with the IL-6 RNA target {See 
e.g.. Figure 14). Similarly, 12 TtiiPol specific amino acids were introduced at the 
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homologous positions of the Ndel-BamHI region of the TaqTth(N-D) and only one of 
them, E507Q, increased the cleavage rate to the TthPol level {See e.g;. Figure 14). 

To confirm that the W417L, G418K and E507Q substitutions are sufficient to 
increase the TaqPol activity to the TthPol level, TaqPol variants carrying these mutations 
5 were created and their cleavage rates with the IL-6 RNA substrate were compared with 
that of TthPol. Figure 15 shows that the TaqPol W417UG418K/E507Q and TaqPol 
G418K/E507Q mutants have 1.4 times higher activity than TthPol and more than 4 fold 
higher activity than TaqPol, whereas the TaqPol W417L/E507Q mutant has the same 
activity as TthPol, which is about 3 fold higher than TaqPol. These results demonstrate 

10 that K418 and Q507 of TthPol are important amino acids in defining its superior RNA 
dependent 5' nuclease activity compared to TaqPol. 

The ability of these amino acids to improve the RNA dependent S* nuclease 
activity of a DNA polymerase was tested by introducing the corresponding mutations into 
the polymerase A genes of two additional organisms: Thermus filiformvs and Thermus 

15 scotoductus, TaqPol showed improved RNA dependent activity when it was modified to 
contain the W417L and E507Q mutations, which made it more similar at these residues 
to the corresponding residues of TthPol (K418 and Q507). The TfiPol was modified to 
have P420K and E507Q, creating TfiDN 2M, while the TscPol was modified to have 
E416K and E505Q, to create TscDN 2M. The activity of these enzymes for cleaving 

20 various DNA and RNA containing structures was determined as described in Example 1 , 
using the IdT2, IrT3, hairpin and X-structures diagrammed in Figures 21 and 22, with the 
results shown in both Figure 25 and Table 8. Both enzymes have much less 
RNA-dependent cleavage activity than either the TthPol or the Taq 2M enzymes. 
However, introduction of the mutations cited above into these polymerases increased the 

25 RNA dependent cleavage activity over 2 fold compared to flie unmodified enzymes 
(Figure 25). These results demonstrate that transferability of improved RNA dependent 
cleavage activity into a wide range of polymerases using the methods of the present 
invention. 
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3) Site-specific mutagenesis based on molecular modeling and published 
physical studies 

The positions of the G418H and E507Q mutations in the crystal structure of a 
complex of the large fragment of TaqPol (Klentaql) with a primer/template DNA 
determined by Li et al (Li et al. Protein Sci., 7:1116 [1998]) are shown in Figure 17. 
The E507Q mutation is located at the tip of the thumb subdomain at a nearest distance of 
3.8 A and 1 8 A from the backbone phosphates of the primer and template strands, 
respectively. The interaction between the thumb and the minor groove of the DNA 
primer/template was previously suggested by the co-crystal structures of KJenow 
fragment DNA polymerase I (Breese et ai. Science 260:352 [1993]) and TaqPol (Eom et 
at.. Nature 382:278 [1996]). Deletion of a 24 ammo acid portion of the tip of the thumb 
in Klenow fragment, coiresponding to ammo acids 494-518 of TaqPol, reduces the DNA 
binding affmity by more than 100-fold (Minnick et al , J. Biol. Chem., 27 1 :24954 
[1996]). These observations are consistent with the hypothesis that the thumb region, 
which includes the E507 residue, is involved in interactions with the upstream substrate 
duplex. 

The W417L and the G418K mutations in the pabn region of TaqPol (Figure 17) 
are located approximately 25 A from the nearest phosphates of the template and upstream 
strands, according to the co-crystal structures of TaqPol with duplex DNA bound in the 
polymerizing mode (Li et al.. Protein Sci., 7:1116 [1998], Eom et al, Nature 382:278 
[1996]). The same distance was observed between the analogous W513 and P514 amino 
acids of Klenow fragment and the template strand of DNA bound in the editing mode 
(Breese et al. Science 260:352 [1993]). Thus, no interactions between TaqPol and the 
overlapping substrate can be suggested from the available co-crystal studies for this 
region. 

Although an understanding of the mechanism of action of the enzymes is not 
necessary for the practice of the present invention and the present invention is not limited 
to any mechanism of action, it is proposed that the amino acids at positions 417 and 418 
in the palm region of TaqPol mteract with the upstream substrate duplex only when the 
raizyme fimctions as a 5* nuclease, but no interaction with these amino acids occurs when 
TaqPol switches into polymerizing mode. This hypothesis suggests a novel mode of 
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substrate binding by DNA polymerases called here the "5* nuclease mode." Several lines 
of evidence support this hypothesis. The study of the chimeric enzymes described here 
clearly separates regions of the polymerase domain involved in the 5* nuclease and 
polymerase activities. Accordingly, the W417L and G418K mutations, together with the 
5 E507Q mutation, affect the 5* nuclease activity of TaqPol on substrates having an RNA 
target strand (Figure 15), but have no effect on either RNA-dependent or DNA-dependent 
DNA polymerase activities (Figure 1 6). On the other hand, mutations in the active site of 
TaqPol, such as R573A, R587A, E615A, R746A, N750A and D785N, which correspond 
to substitutions in Klenow fragment of Exoli DNA Pol I that affect both polymerase 

10 activity and substrate binding affinity in the polymerizing mode (Polesky ei aL, J. Biol 
Chem., 265:14579 [1990], Polesky et al, J\ BioL Chem., 267:8417 [1992], Pandey ei al, 
Eur, J. Biochem., 214:59 [1993]) were shown to have little or no effect on the 5* nuclease 
activity. Superposition of the polymerase domains of TaqPol (Eom et aiy Nature 
382:278 [1996]), KcoU Pol I and Bacillus stearothermophilus Pol I (Kiefer et al. Nature 

15 391:304 [1998]) using the programs DALI (Hobn and Sander, J. Mol. BioL, 233:123 
[1993], Hotai and Sander, Science 273:595 [1996]) and Insight n Molecular Simulation 
Inc., Naperville, IL) shows that the palm region of TaqPol between amino acids 402-451, 
including W417 and G418, is structurally highly conserved between the three 
polymerases, although there is no structural similarity between the rest of the pahn 

20 subdomains. This observation suggests an important role for this region in eubacterial 
DNA polymerases. 

The 5* nuclease and polymerase activities should be precisely synchronized to 
create a nicked structure rather than a gap or an overhang that could cause a deletion or 
an insertion during Okazaki fragment processing or DNA repair, if ligase joins the ends 

25 inappropriately. According to the previously proposed model (Kaiser et aL, J. Biol. 
Chem., 274:21387 [1999]), the 3* terminal nucleotide of the upstream strand is 
sequestered by the 5* nuclease domain to prevent its extension, thus halting synthesis. 
The interaction with the 3* nucleotide apparently activates the 5' nuclease that 
endonucleolitically removes the displaced 5' arm of the downstream strand. This 

30 cleavage occurs by the precise incision at the site defined by the 3' nucleotide, thus 
creating the nick. This model requires a substantial rearrangement of the 
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substrate-enzyme complex, which may include a translocation of the complex to the 5' 
nuclease mode to separate the primer/template from the polymerase active site. 

It is possible that a relocation of the substrate away from the polymerase active 
site could be induced by the interaction between the duplex formed between the template 
and incoming strands and the crevice formed by the finger and thumb subdomains. Such 
an interaction could force conformational transitions in the thumb that would bring the 
template/primer duplex into close contact with the W417 and G418 amino acids. 
Significant flexibility of the thumb has been previously reported that might explain such 
changes (Beese et al. Science 260:352 [1993], Eom et al. Nature 382:278 [1996], OUis 
et al., Nature 313:762 [1985], Kim et al. Nature 376:612 [1995], Korolev et al., Proc. 
Natl. Acad. Sci.. 92:9264 [1995], Li et al, EMBO J., 17:7514 [1998]). Additional 
conformational changes of the fingers domain that might help to open the crevice, such as 
the transition from die 'closed' to the 'open' structure described by Li et al. (Li et al., 
EMBO J., 17:7514 [1998]), are consistent with this model. It may be that the 5' nuclease 
binding mode was not observed in any of the published co-crystal structures of a DNA 
Pol I because the majority of the structures were solved for the polymerase domain only, 
with a tonplate/primer substrate rather than with an overlapping 5' nuclease substrate. 

Km values of 200-300 nM have been determined for TaqPol, TthPol and TaqPol 
G418K/E507Q for the RNA containing substrate. These values are much higher than the 
Km value of <1 nM estimated for TthPol with an all-DNA overlapping substrate 
suggesting that the RNA template adversely affects substrate binding. The low affinity 
could be explained by the unfavorable interaction between the enzyme and either the 
A-form duplex adopted by the substrate with an RNA target, or the ribose 2' hydroxyls of 
tiie RNA strand. Between these two factors, the latter scans more likely, since the 5' 
nucleases of eubacterial DNA polymerases can efficiently cleave substrates with an RNA 
downstream probe (Lyamichev et al.. Science 260:778 [1993]), which would presumably 
have an A-form. Further, the co-crystal studies suggest that the template/primer duplex 
partially adopts a conformation close to A-form in its complex with DNA polymerase 
(Eom et al.. Nature 382:278 [1996], Kiefer et al.. Nature 391:304 [1998], Li et al., 
EMBO J., 17:7514 [1998]). The G418K/E507Q mutations increase the kcat of TaqPol 
more than two fold, but have little effect on Km- Such an effect would be expected if the 
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mutations position the substrate in an orientation more appropriate for cleavage rather 
than simply increasing the binding constant. 

In addition to the mutational analysis described above, another approach to 
studying specific regions of enzymes, enzyme structure-function relationships, and 
5 enzyme-substrate interaction is to investigate the actual, physical structure of the 
molecule. 

With the advances in crystallographic, NMR, and computer and software 
technology, study of molecular structure has become a viable tool for those interested in 
the configuration, organization, and dynamics of biomolecules. Molecular modeling has 

1 0 increased the understanding of the nature of the interactions that underlie the structure of 
proteins and how proteins interact functionally with substrate. Numerous publications 
describing the structures of various polymerases or polymerase protein portions, HTV 
reverse transcriptase, and other nucleic acid binding proteins have provided mechanistic 
insights into protein conformation, changes in conformation, and molecular interactions 

1 5 necessary for function. 

As an example, the report by Doublie et al (Doublie et ai, Nature 391 :251 
[1998]) discloses the crystal structure of T7 DNA polymerase and provides information 
about which amino acid regions are likely to have an affect on substrate binding, which 
are required to contact the substrate for polymerization, and which amino acids bind 

20 cofactors, such as metal ions. It is noted in this paper and others that many of the 

polymerases share not only sequence similarity, but structural homology as well. When 
certain structural domains of different polymerases are superimposed (for example, T7 
polymerase, Klenow firagment editing complex, the unliganded Taq DNA polymerase 
and the Taq Polymerase-DNA complex) conserved motifs are clearly discemable. 

25 Specifically, combining the infonnation firom all of these different structural 

sources and references, a model of the protein interacting with DNA, RNA, or 
heteroduplex can be made. The model can then be examined to identify amino acids that 
may be involved in substrate recognition or substrate contact. Changes in amino acids 
can be made based on these observations, and the effects on the various activities of the 5* 

30 nuclease proteins are assessed using screening methods such as those of the present 
invention, described in the experimental examples. 
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The domain swapping analysis discussed previously demonstrated that sequences 
of TthDN that are important in RNA-dependent 5' nuclease activity lie in the polymerase 
domain of the protein. Therefore, study of structural data of the polymerase domain with 
respect to nucleic acid recognition provides one method of locating amino acids that, 
5 when altered, alter RNA recognition in a 5 ' nuclease reaction. For example, analysis 
conducted during the development of the present invention examined published analyses 
relating to primer/template binding by the polymerase domain ofE. coli Pol 1, the 
Klenow fragment Table 2 shows a sampling of kinetic constants determined for the 
Klenow fragmmt, and shows the effects a number of mutations on these measurements. 
10 The corresponding or similarly positioned amino acids in the TaqPol are indicated in the 
light hand columa It was postulated that mutations having a noticible impact on the 
intwactions of the Klenow fragment with the DNA template or the primer/template 
duplex, as indicated by changes in Kd and Relative DNA affinity values, might also have 
effects when made at the corresponding sites in TaqPol and related chimerical or mutant 
15 derivatives. A selection of the mutations that produced a higher Kd value or a lower 

Relative DNA affinity value when introduced into the Klenow fragment were created and 
examined in TaqPol. These Taq derivatives include, but are not limited to, those 
indicated by asterisks in the right hand column of Table 2. 

For some Klaiow variants, such as the R682 mutants, selection for testing was 
20 not made based on the DNA affinity measurements, but because molecular modeling 
suggested interaction between some aspect of the template/primer duplex and that amino 
acid. Similarly, additional regions of Taq polymerase (or Taq derivatives) were targeted 
for mutagenesis based on structural data and information from molecular modeling. 
Based on modeling, the thumb region was postulated to contact an RNA template. Thus, 
25 amino acids in the thumb region were looked for that, if altered, might alter that contact. 
For example, Figures 6 and 17 show that amino acids 502, 504, and 507 are located at the 
tip of the thumb. It was postulated that altering these amino acids might have an affect 
on the enzyme-substrate iriteraction. Using the activity screening methods described In 
Example 1, mutations that produced beneficial effects were identified. This approach 
30 was used to create a number of improved enzymes. For example, TaqPol position H784, 
corresponding to Klenow amino acid H881, is an amino acid in the fingers region and, as 
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such, may be involved in primer/template substrate binding. When the H881 amino acid 
in the Klenow enzyme is replaced by alanine, the change decreases the affinity of the 
enzyme for DNA to only 30 to 40% of the wild type level. An analogous substitution 
was tested in a TaqPol-derived enzyme. Starting with the Taq derivative 
5 W417L/G41 8K/E507Q, amino acid 784 was changed from ffistidine (H) to Alanine (A) 
to yield the W417L/G418K/E507Q/H784A mutant, termed Taq 4M. This variant showed 
improved 5' nuclease activity on the RNA test IrTl (Figure 24) test substrate (data in 
Table 3). Amino acid R587 is in the thumb region, and was selected for mutation based 
on its close proximity to the primer/template duplex in computer models. When an 

10 R587A mutation was added to the Taq 4M variant, the activity on the test IrTl test 

substrate was still further improved. In addition, the reduction, relative to the 4M variant, 
in cleavage of the X structure shown in Figure 22 constitutes an additional improvement 
in this enzyme's function. 

Not all amino acid changes that reduce DNA binding in the polymerization aflfect 

15 the 5' nuclease activity. For example, mutations E615A, R677A, affecting amino acid 
that are also in the thumb and fingers domains, respectively, have either adverse effect, or 
no effect on the 5* nuclease activities, respectively, as measured using the test substrates 
in Figures 21 and 22, and compared to the parent variants that lacked these changes. The 
R677A mutation was added to, and compared with the TaqSS variant, while the E615A 

20 mutation was added to and compared with the Taq 4M variant. The test methods 

described herein provide a convenient means of analyzing any variant for the alterations 
in the cleavage activity of both invasive an noninvasive substrates, for both DNA and 
RNA containing structures. Thus, the present invention provides methods for identifying 
all suitable improved enzymes. 

25 Alterations that might mcrease the affinity of tiie enzymes for the nucleic acid 

targets were also examined Many of the mutations described above were selected 
because they caused the Klenow fragment enzyme to have decreased affinity for DNA, 
with the goal of creating enzymes more accepting of structures containing non-DNA 
strands. In general, the native DNA polymerases show a lower affinity for RNA/DNA 

30 duplexes, compared to their aflSnity for DNA/DNA duplexes. During the development of 
the present invention, it was sought to increase the general affinity of the proteins of the 
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presait invention for a nucleic acid substrate without restoring or increasing any 
preference for structures having DNA rather than RNA target strands. The substitution of 
amino acids having different charges was examined as a means of altering the interaction 
between the proteins and the nucleic acid substrates. For example, it was postulated that 

5 addition of positively charged amino acid residues, such as lysine (K), might increase the 
affinity of a protein for a negatively charged nucleic acid. 

As noted above, alterations in the thumb region could affect the interactions of flie 
protein with flie nucleic acid substrate. In one example, the mutation G504K (tip of the 
thumb) was introduced m Taq4M and caused and enhancement of nuclease activity by 

10 1 5% on an RNA target. Additional positively charged mutations (A502K and E507K) 
farther improve the RNA target dependent activity by 50% compared to the parent 
Taq4M eD:^e. 

The use of data fiom published studies and molecular modeling, in combination 
with results accrued during the development of the present invention allowed tiie 

15 identification of regions of the proteins in which changes of amino acids would be likely 
to cause observable differences in at least one aspect of cleavage fanction. While regions 
could be targeted in tins way, it was observed tiiat changes in different amino acids, even 
if near or immediate neighbors in the protein, could have different effects. For example, 
while the A502K substihition created a marked increase in tiie RNA-dependent cleavage 

20 activity of Taq 4M, changing amino acid 499 from G to a K, only 3 amino acids away 
from 502, gave minimal improvement. As can be seen in tiie Experimental Examples, 
tiie approach of tiie present invention was to change several amino acids in a candidate 
region, eitiier alone or in combination, then use tiie screening metiiod provided in 
Example I to rapidly assess the effects of tiie changes. In fliis way, tiie rational design 

25 approach is easily ^plied to tiie task of protein engineering. 

In addition to tiie tiiumb, pahn, and hand regions found in flie polymerase domain 
of tiiese proteins, regions tiiat are specific to 5' nucleases and nuclease domains were 
examined. Comparative stiidies on a variety of 5' nucleases have shown tiiat, tiiough tiie 
amino acid sequences vary dramatically from enzyme to enzyme, there are structural 

30 featiires common to most Two of tiiese features are flie heUx-hairpin-helix motif 
(H-h-H) and flie arch or loop structure. The H-h-H motif is believed to mediate 
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non-sequence specific DNA binding. It has been found in at least 14 fiamilies of proteins, 
including nucleases, N-glycosylases, ligases, helicases, topoisomerases, and polymerases 
Poherty et al, NucL Acid. Res., 24:2488 [1996]). The crystallographic structure of rat 
DNA polymerase pol p bound to a DNA template-primer shows non-specific hydrogen 
5 bonds between the backbone nitrogens of the pol p HhH motif and the phosphate 
oxygens of the primer of the DNA duplex (Pelletier et al. Science 264:1891 [1994]). 
Because the HhH domain of 5* nuclease domains of Taq and Tth polymerases may 
function in a similar manner, it is contemplated that mutations in the HhH region of the 
enzyme alter activity. Mutations may be introduced to alter the shape and structure of the 
1 0 motif, or to change the charge of the motif to cause increased or decreased afOnity for 
substrate. 

Another structure conmion to many 5' nucleases from diverse sources such as 
eukaiyotes, eubacteria, archaea and phage, is the arch or loop domain. The crystal 
structure of the 5' exonuclease of bacteriophage T5 showed a distinct arch formed by two 

15 helices, one positively charged and one containing hydrophobic residues (Ceska et al. 
Nature 382:90 [1996]). Interestingly, three residues that are conserved between T5 and 
Taq, Lys 83, Arg 86 and Tyr 82 are all in the arch. These correspond to amino acids Lys 
83, Arg 86, and Tyr 82 in Taq DNA polymerase. The crystal structure for Taq (5* 
nuclease) has also been determined (Kim et aL, Nature 376:612 [1995]). 

20 The crystal structure from the flap endonuclease-1 from Methanococcus 

janneschii also shows such a loop motif (Hwang et al, Nat. Struct Biol., 5:707 [1998]). 
The backbone crystal structure of Mja FEN-1 molecules may be superimposed on T5 
exonuclease, Taq 5 '-exonuclease and T4 RNase H. An interesting feature common to all 
of these is the long loop. The loop of FEN-1 consists of a number of positively charged 

25 and aromatic residues and forms a hole with dimensions large enough to acconmiodate a 
single-stranded DNA molecule. The corresponding region in T5 exonuclease consists of 
three helices forming a helical arch. The size of the hole formed by the helical arch in T5 
exonuclease is less than half that formed by the LI loop in Mj FEN-1 . In T4 RNfase H or 
Taq 5' exonuclease, this region is disordered. Some regions of the arch bind metals, 

30 while other regions of the arch contact nucleic acid substrate. Alignment of the 

amino-acid sequences of six 5' nuclease domains from DNA polymerases in the pol I 
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family show six highly conserved sequence motife containing ten conserved acidic 
residues (Kim et al.. Nature 376 [1995]). 

The effects of alterations in the arch region were examined. In Taq polymerase 
the arch region is formed by amino acids 80-95 and 96-109. Site directed mutagenesis 
was performed on the arch region. Alignment of amino acid sequences of the FEN and 
polymerase 5' nucleases suggested the design of 3 amino acid substitution mutations, 
P88E, P90E and G80E. These substitutions were made on the Taq4M polymerase mutant 
as a parent enzyme. Results indicated that although the background activity on the HP 
and X substrates shown in Figure 22 are tremendously suppressed in all mutants, the 
desirable 5' nuclease activity on proper substrates (IdT and Irf, Figure 24) is also 
reduced. Despite Gie sequence homology between Taq and Tth polymerases, they have 
very different activity on HP and X substrates. The alignment of the Taq and Tth 
polymerase arch regions also dranonstrates regions of extensive sequence homology as 
well as minor diffCTences. These differences led to the design of mutations L109F and 
AllOT using Taq4M to generate Taq4M L109F/A1 lOT, and the mutant Taq 4M 
A502K/G504K/E507K/T514S to generate Taq 4M 

L109F/A1 10T/A502K/G504K/E507K/T514S mutant. These two mutations have 
drastically converted Taq4M oizyme to become more like Tth enzyme in terms of the 
background substrate specificity while the 5' nuclease activities on both DNA and RNA 
substrates are abnost unchanged. 



4) Focused random mutagenesis 

As described above, physical studies and molecular modeling may be used alone 
or in combination to identify regions of the enzymes in which changes of amino acids are 
likely to cause obsCTvable diflferences in at least one aspect of cleavage function. In the 
section above, use of this information was described to select and change specific amino 
acids or combinations of amino acids. Another method of generating an enzyme with 
altered function is to introduce mutations randomly. Such mutations can be introduced 
by a number of methods known in the art, including but not Umited to, PGR amplification 
under conditions that favor nucleotide misincotporation (REF), amplification using 
primers having regions of degeneracy (i.e., base positions in which different individual, 
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but otherwise similar oligonucleotides in a reaction may have different bases), and 
chemical synthesis. Many methods of random mutagenesis are known in the art (Del Rio 
et aL, Biotechniques 17:1 132 [1994]), and may be incorporated into the production of the 
enzymes of the present invention. The discussions of any particular means of 
5 mutagenesis contained herein are presented solely by way of example and not intended as 
a limitation. When random mutagenesis is performed such that only a particular region 
of an entire protein is varied, it can be described as "focused random mutagenesis." As 
described in the Experimental Examples, a focused random mutagenesis approach was 
applied to vary the HhH and the thumb domains some of the enzyme variants previously 

10 created. These domains were chosen to provide examples of this approach, and it is not 
intended that the random mutagenesis approach be limited to any particular domain, or to 
a single domain. It may be applied to any domain, or to any entire protein. Proteins thus 
modified were tested for cleavage activity in the screening reactions described in 
Example 1, using the test substrates diagrammed in Figures 22 and 24, with the results 

1 5 described in Tables 6 and 7. 

Random mutagenesis was performed on the HhH region with the parent TaqSS or 
TthDN H785A mutants. None of the 8 mutants generated showed an improvement in 
activity compared to the parent enzyme (Table 6). In fact, mutations of the region 
between residues 198-205 have about 2-5 fold lower activity on both DNA and RNA 

20 substrates, suggesting that this region is essential for substrate recognition. Mutagenesis 
in the thumb region resulted in new mutations that improved 5' nuclease activity by 
20-100% on a DNA target and about 10% on an RNA target (Table 7). 

Numerous amino acids in each of the distinct subdomains play roles in substrate 
contact Mutagenesis of these may alter substrate specificity by altering substrate 

25 binding. Moreover, mutations introduced in amino acids that do not directly contact the 
substrate may also alter substrate specificity through longer range or general 
conformation altering effects. These mutations may be introduced by any of several 
methods known in the art, including, but not limited to random mutagenesis, site directed 
mutagenesis, and generation of chimeric proteins. 

30 As noted above, numerous methods of random mutagenesis are known in the art. 

The methods applied in the focused random mutagenesis described herein may be applied 
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to whole genes. It is also contemplated that additional useful chimerical constructs may 
be created through the use of molecular breeding (See e.g., U.S. Pat. No, 5,837,458 and 
PCT PubUcations WO 00/18906, WO 99/65927, WO 98/3 1837, and WO 98/27230, 
herein incorporated by reference in their entireties). Regardless of the mutagenesis 
method chosen, the rapid screening method described herein provides a fast and effective 
means of identifying beneficial changes within a large collection of recombinant 
molecules. This makes the random mutagenesis procedure a manageable and practical 
tool for creating a large collection of altered 5' nucleases having beneficial 
improvements. The cloning and mutagenesis strategies employed for the enzymes used as 
examples are applicable to other thermostable and non-thermostable Type A 
polymerases, since DNA sequence similarity among these enzymes is very high. Those 
skilled in the art would understand that differences in sequence would necessitate 
differences in cloning strategies, for example, the use of different restriction 
endonucleases may be required to generate chimeras. Selection of existing alternative 
sites, or introduction via mutagenesis of altemative sites are well established processes 
and are known to one skilled in the art. 

Enzyme expression and purification can be accomplished by a variety of 
molecular biology methods. The examples described below teach one such method, 
though it is to be understood that the present invention is not to be limited by the method 
of cloning, protein expression, or purification. The present invention contemplates that 
the nucleic acid construct be capable of expression in a suitable host. Numerous methods 
are available for attaching various promoters and 3* sequences to a gene structure to 
achieve efficient expression. 

5) Site-Specific mutagenesis 

In some embodiments of the present invention, any suitable technique {e.g„ 
including, but not limited to, one or more of the techniques described above) are used to 
generate improved cleavage enzymes (e.g., SEQ ID NO:221) with heterologous domains. 
Accordingly, in some embodiments, site-specific mutagenesis (e.g., primer-directed 
mutagenesis using a commercially available kit such as the Transformer Site Directed 
mutagenesis kit (Clontech)) is used to make a pluraUty of changes thoughout a nucleic 
acid sequence in order to generate nucleic acid encoding a cleavage enzyme of the 
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present invention. Insome embodiments, a plurality of primer-directed mutagenesis steps 
are carried out in tandem to produce a nucleic acid encoding a cleavage enzyme of the 
present invention. 

In some embodiments, a plurality of primer directed mutagenesis steps are 
5 directed to a selected portion of a nucleic acid sequence, to produce changes in a selected 
portion of a cleavage enzyme of the present invention. In other embodiments, a nucleic 
acid having changes in one selected portion is recombmed with a nucleic acid having 
mutations in a different selected portion (e.g., through cloning, molecular breeding, or 
any of the other recombination methods known in the art), thereby creating a nucleic acid 

10 having mutations in a plurality of selected portions, and encoding a cleavage enzyme of 
the present invention. The mutations in each selected portion may be introduced by any 
of the methods described above, or any combination of said methods, including but not 
limited to methods of random mutagenesis and site-directed mutagenesis. 

For example, in one illustrative embodiment of the present invention, the nucleic 

15 acid sequence of SEQ ID NO:222 (a nucleic acid sequence encoding the cleavage 
enzyme of SEQ ID NO:221) is generated by making a pluraUty of primer-directed 
mutations to the nucleic acid sequence of SEQ ID NO: 104 (see Example 7 for the 
construction of SEQ ID NO: 104), In some embodiments, each mutation is introduced 
using a separate mutagenesis reaction. Reactions are carried out sequentially such that 

20 the resulting nucleic acid (SEQ ID NO: 222) contains all of the mutations. In another 
illustrative embodiment of the present invention, the nucleic acid sequence of SEQ ID 
NO:222 is generated by making a plurality of primer-directed mutations, as described 
above, in the nuclease portion (e.g., as diagrammed in Figure 6) of SEQ ID NO: 111. The 
mutant nuclease portion is then combined with the "polymerase" portion of SEQ ID 

25 NO: 104 at the Not I site, using the recombination methods described in Example 4, 

thereby creating a single nucleic acid having SEQ ID NO:222, and encoding the cleavage 
enzyme of SEQ ID NO;22 1 . Following mutagenesis, the resulting altered polypeptide is 
produced and tested for cleavage activity using any suitable assay (eg., including, but not 
Umited to, those described in Examples 1 and 6). In some embodiments, the nucleic acid 

30 sequence encoding the cleavage enzyme of SEQ ID NO:221 (e.g., SEQ ID NO:222) is 
further modified using any suitable method. 
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V. Reaction Design for INVADER Assay Detection of RNA Targets; 

Approaches to designing INVADER assays for the detection of KNA targets can 
vary depending on the needs of a particular assay. For example, in some embodiments, 
an RNA to be detected or analyzed may be present in a test sample at low levels, so a 
high level of sensitivity («.c., a low limit of detection, or LOD) may be desirable; in other 
embodiments, an RNA may abundant, and may not require an especiaUy sensitive assay 
for detection. In some embodiments, an RNA to be detected may be similar to other 
RNAs in a sample that are not intended to be detected, so that a high level of selectivity 
in an assay is desirable, while in other embodiments, it may be desued that multiple 
similar RNAs be detected in a single reaction, so an assay may be provided that is not 
selective with respect to the differences among these similar RNAs. 

In some embodiments it is especiaUy desirable to avoid detection of any DNA 
molecules related to the target RNA molecules in a reaction. In some embodiments, this 
is accomplished by designing INVADER assay probe sets to RNA splice junctions, such 
that only the properly spUced mRNAs provide the selected target sites for detection. In 
other embodiments, samples are handled such that DNA remains double stranded (e.g., 
the nucleic acids are not heated or otherwise subjected to denaturing conditions), and is 
. thus not available to serve as target in an INVADER assay reaction. In other 
embodiments, cells are lysed under conditions that leave nuclei intact, thereby containing 
and preventing detection of the genomic DNA, while releasing the cytosoUc mRNAs into 
the lysate solution for detection by the assay. 

In some embodiments, the INVADER assay is to be used for detection or 
quantitation of an entire RNA having a particular variation of a sequence (e.g. , a mutation 
a SNP, a particular spliced junction); in such embodiments, the location of the base or 
sequence to be detected is a detemiining factor in the selection of a site for the 
INVADER assay probe set to hybridize. In other embodiments, any portion of an RNA 
target may be used to indicate the presence or the amount of the entire RNA {e.g., as m 
gene expression analysis). In this case, the probe sets may be directed toward a portion 
of the RNA selected for optimal performance {e.g., sites determined to be particularly 
accessible for probe hybridization) as a target in the INVADER assay. 
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The discussion of INVADER assay probe design is divided into the following sections: 

i. Target site selection based on accessibility 

ii. Target site selection based on selectivity 
5 iii. Oligonucleotide design 

a. Target-specific regions: length and melting temperature 

b. Non-complementary regions 

c. Folding and dimer analysis 
iv. Assay performance evaluation 

10 v. Design and assay optimization 



i. Target site selection based on accessibility 

One consideration in the selection of sites for detection is the availability of the 
target site for hybridization of the assay probe set. To simply use randomly selected 

1 5 complementary oligonucleotides for a given RNA target without prior knowledge of 

regions of the RNA that allow efficient hybridization can be an ineffective approach. For 
example, it is estimated that targeting RNA witii antisense oligonucleotides based on 
random design results in one out of 18-20 tested oligonucleotides showing significant 
inhibition of gene expression (Sczakiel, Fronteirs in Biosciences 5:194 [2000]; Patzel et 

20 a/., Nucleic Acids Res., 27:4328 [1999]; Peyman et al, Biol Chem. Hoppe-Seyler 
367:195 [1995]; Monia et al. Nature Med., 2:668 [1996]). Secondary and tertiary 
structures of RNA are thought to be the major reasons that influence tiie ability of an 
oligonucleotide to bind targeted regions of the RNA (Vickers et al^ Nucleic Acids Res., 
28:1340 [2000]; Lima et al. Biochemistry 31:12055 [1992]; Uhlenbeck, J. MoL Biol,, 

25 65:25 [1972]; Freier and Tinoco, Biochemistry 14:33 10 [1975]). This is due to the 
hybridization kinetics and thermodynamics of destroying any structural motifs of the 
RNA and, in return, hybridizing the complementary DNA oligonucleotide (Patzel et aly 
Nucleic Acids Res., 27:4328 [1999]; Mathews et al, RNA 5:1458 [1999]). Thus, the 
ability to identify regions of RNA that are "accessible" for hybridization is important for 

30 design and selection of effective oligonucleotides. 
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There are several experimental and theoretical methods available for identifying 
accessftle regions in RNA. These include the use of RNase-H footprinting (Ho et al. 
Nature Biotechnology 16:59 [1998]; Mateeva et a/.. Nucleic Acids Res., 25:5010 [1997]; 
Mateeva et al. Nature Biotechnology 16:1374 [1998]). complementary arrays of 
5 oligonucleotide libraries (Southern et al. Nucleic Acids Res., 22:1368 [1994]; Mir and 
Southern, Nature Biotechnology 17:788 [1999]). ribozyme libraries with random 
hexamer internal guide sequences (Campbell and Cech, RNA 1:598 [1995]; Lan et al. 
Science 280:1593 [1998]), and RNA and DNA structure prediction computer programs 
(Sczakiel. Frontiers inBiosciences 5:194 [2000]; Patzel et al. Nucleic Acids Res., 
10 27:4328 [1999]; Zuker, Science 244:48 [1989]; Walton et al, Biotechnol. Bioeng., 65:1 
[1999]). Recently, new methods have been developed that use primer extension to 
identify sites in RNAs that ai« accessible for hybridization. Target nucleic acids {e.g., 
niRNA target nucleic acids) are contacted with a plurality of primers containing a 3' a 
region of degenerate sequence and primer extension reactions are conducted. Where the 
15 target nucleic acid is an RNA molecule, preferred enzymes for use in the extension 
reactions are reverse transcriptases, which produce a DNA copy of the RNA template. 
Folded structures present in the target nucleic acid affect the initiation and/or efficiency 
of the extension reaction. The extension products of the primers are analyzed to provide 
a map of the accessible sites. For example, certain extension products are not generated 
20 where the primer is complementary to a sequence that is involved in a folded stracture. 
Regions of the target nucleic acid that do not allow hybridization of the primer and do not 
result in the production of an extension product are considered inaccessible sites. In 
contrast, the presence of an extension product indicates that the primer was able to bind 
to an accessible region of the target nucleic acid. Such methods are referred to herein as 
25 "revCTse transcription with random oligonucleotide libraries" or "RT-ROL" (HT Allawi, 
et al., RNA 7(2):3 14-27 [2001]). The use of a physical measurement such as RT-ROL or 
array hybridization provides the most direct evidence of the accessibiUty of a site on an 
RNA strand. In general, INVADER assays directed toward accessible regions produce 
stronger signals for a given amount of RNA than assays directed toward less accessible 
30 regions of an RNA strand. For the detection of rare RNAs (e.g., fewer than about 5,000 
to 10,000 copies per INVADER assay reaction), or in any assay wherein it is desirable to 
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have the best (i.e., lowest) limit of detection possible, it may be beneficial to start the 
assay design by analyzing the RNA structure using RT-ROL or another method of 
physical analysis. 

In other embodiments, ease of assay design may be more important than creation 
5 of an assay with a particularly low LOD. Structure prediction software can simplify the 
task of determining which parts of an RNA are likely to be single stranded, and thus be 
more accessible for probe hybridization. As first step, the sequence of an RNA to be 
detected is entered into an electronic file. It may be entered manually or imported from a 
file {e.g,, a sequence data file, or a word processing file). In some embodiments, the 
10 sequence is downloaded firom a database, such as GenBank or EMBL. The RNA 

sequence can then be analyzed using a program such as mfold (Zucker, Science 244:48 
[1989]), OligoWalk (Mathews et al, RNA 5:1458 [1999]), and variations of both 
(Sczakiel, Frontiers in Biosciences 5:194 [2000]; Patzel et aL, Nucleic Acids Res., 
27:4328 [1999]; Walton etaL, Biotechnol. Bioeng., 65;1 [1999]). 

1 5 Mfold Analysis for target RNA structure prediction. 

The output of mfold analysis can be used in several ways to assist in identifying 
accessible regions of an RNA target molecule. In one embodiment, the mfold program is 
used to generate an "ss count" file for identifying regions least likely to be involved in 
mtra-strand baseparing. In another embodiment, the mfold program is used to generate a 
20 "xt" file, a file used as input information for use with RNA Structure 3.5 to perform an 
OligoWalk analysis. In preferred embodiments, for either use, the sequence to be 
detected is entered into mfold In a preferred embodiment, the settings used in the mfold 
analysis include: 

• Folding Temperature: 37'C (Even though the INVADER reaction may not be 
25 conducted at this temperature.) 

• % Suboptimality: 5 

• # foldings: 50 

• Window Parameter: Default 

• Maximum distance between paired bases: No Limit 
30 • Select BATCH folding 
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• Eater:an e mail address where the results are to be seat when ready 

• Image Resolution: High 

• Structure Format: Bases 

• Base Number Frequency:De&ult 
5 • Structure Rotation Angle: 0 

• Structure Annotation: SS-Count 

• 1 M NaCl (Australian mfold Internet site only) 

When results are ready, an e mail message is sent containing the Web address of 
10 the results. The only file that is necessary for subsequent INVADER assay probe design 
analysis is the SS-Count file, which is then downloaded &om the Results page. An 
exemplary mfold analysis using a GenBank entry for Human Ubiquitin (#4506712) is 
shown below: 

SS-Couot analysis for accessible sites identification. 
1 5 The SS-Count file is then imported into an Excel spreadsheet file and the 

following options are chosen: Data Type = Delimited <press Next>; Delimited = Select 
(x) Spaces; (x) Treat multiple delimiters as one <press Next>; Column for data fonnat = 
General. Selecting these options results in the import into Excel of three columns of data 
(Figure 39). The first line in the first column is the total number of stable structures 
20 mfold found under the parameters used m the folding. With the Ubiquitm example, there 
were 12 stmctures found. 

The rest of the first column is the RNA nucleotide position numbered from 1 . 
The second column is the raw SS-Count number and represents the number of times the 
corresponding base was NOT base-paired as part of some secondary structure. The third 
25 column is the sequence of the RNA analyzed, identifying the base at each numbered 
position. By looking for bases that are involved in fewer structures (i.e., bases listed in 
column 3 corresponding to the higher numbers in column 2), it is possible to identify 
regions of the mRNA (identified by position number in column 1) that are more likely to 
be free of intra-strand base-pairing, and thus are more likely to be available for detection 
30 using the INVADER assay. 



145 



wo 01/90337 



PCTAJSOl/17086 



One way of viewing the data is to calculate the running average SS-count for a ten 
nucleotide stretch of the RNA (the Ave(lO) Index) and chart the Ave(10) Index against 
the base pair position (See e.g., Figure 48). 

An alternative plot is to graph the nucleotide position against the Ave(lO) Index 
5 expressed as a percentage of the total number of structures found by mfold (Fig 39). As 
with the raw SS-count table, regions of the RNA corresponding to higher numbers in the 
Ave(10) Index are involved in fewer predicted folded structures. Viewing the Ave(IO) 
Index in either a chart or graph format reduces the complexity of the data, and can reveal 
longer stretches of the RNA that are more likely to be structure-ftee. For example, from 

1 0 the gr^h of the running average, a user can pick out all of the msgor peak areas as likely 
regions for INVADER assay probe design. This creates an SS-Count Candidate List. In 
the Human ubiquitin example, there is one major peak and about 6 oflier peaks (Fig. 39) 
The next step is to refer back to the raw (z.e.» not the rumiing average) SS-Count 
data from within each of the peak areas, and identify the residue whore the running 

IS average is changing in magnitude, this is a local "turn" and is generally a good candidate 
residue to be positioned at the INVADER assay probe set cleavage site. For example, in 
Human Ubiquitin, residue Gl 19 is found at a minor local turn wifliin a globally 
accessible region (Fig, 40). The INVADER assay probe set with the cleavage site at this 
location is a good performer in detection of this RNA. Placing the cleavage site at Gl 14 

20 did not result in better detection even though it had a higher %Ave(10) value. While not 
limiting the present invention to any particular mechanism, and an understanding of the 
mechanism is not necessary to practice the invention, this is likely to be because this area 
was less accessible to the Probe or INVADER oligonucleotides than was position Gl 19. 

OligoWalk Structure prediction with RNA Structure 3.5 for accessible sites 
25 identification 

In some embodiments, the program OligoWalk, a module of the software 
"RNAStructure" (Mathews et al, RNA 5:1458 [1999]) is used in the selection of sites 
that are more likely to be accessible for oligonucleotide binding. OligoWalk uses sets of 
thermodynamic parameters for both RNA and DNA, and their hybrids (AUawi and 
30 SantaLucia, Biochemistry 36:10581 [1997]; Mathews et al, J. Mol. Biol, 288:911 
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[19991; Sugimoto et al, Biochemistry 34: 1 121 1 [1995]) in an algorithm that relies on 
mfold for RNA secondary structure prediction (Zucker, Science 244:48 [1989]). 
OligoWalk is designed to predict the most favorable regions of an RNA target for 
designing antisense oligonucleotides by estimating the overall thermodynamics of 

5 hybridizing an antisense oligomer to the RNA by taking into account the thermodynamics 
of destroying any structural motifs in the RNA target or the antisense oligonucleotide. 
The affinity of the oligomer to its target is expressed as an overall Gibbs free energy 
change of a self-structured oligomer, and of a target associating into an oligomer-target 
complex. This free energy is usually a negative number, indicating favorable binduig, 

10 and is expressed in Tccal/mol' units. OhgoWalk analysis i$ performed with 8 to 15 base 
oligonucleotide size to resemble the average length of the analyte specific region of the 
Signal Probe. Plotting the total binding energy against the length of the RNA generates a 
graph of peaks and valleys. The lowest negative values generally indicate the most 
favorable sites for oligonucleotides to bind. The most inaccessible regions have positive 

15 binding energy values, and generally are a poor sites for assay probe design 

In a preferred embodiment, the OligoWalk module of RNA Structure 3.5 is used 
to determine binding energies by performing an 8-base OUgoWalk using the following 
settings: 

■ Break Local Structure 

20 ■ Include suboptimal structures 

■ OligoL^gth: 8nt 

■ Oligo Concentration: 100 nM 

■ Oligo Type: DNA 

■ Walk entire Target RNA 

25 

When these parameters have been set, the sequence file to be folded (the ".ct" 
output file from mfold) can be selected and opened. Once the sequence has been folded, 
a rq>ort can be created using the Output menu. The report is imported into Excel and the 
data generated above is plotted. In a preferred embodiment, the OligoWalk data is 
30 graphed with the SS-Count data. The regions displaying the lowest free energy values 
(/.&, the largest negative numbers) are generally the most Ukely to be accessible for 
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hybridization. In preferred embodiments, the 3' end and the majority of the target- 
binding region of the probe oligonucleotide complement an accessible region of the target 
RNA. In particularly preferred embodiments, the majority of the binding site for the 
corresponding INVADER oligonucleotide falls within the same accessible region. In 
5 another preferred embodiment, the binding site for an INVADER oligonucleotide falls 
within a nearby accessible region. 

An INVADER oligonucleotide can generally be positioned to bind to a less 
accessible site. While not limiting the present invention to any particular mechanism, it 
is observed that the INVADER oligonucleotides are generally longer than probe 

10 oligonucleotides used in the INVADER assay reactions and, because they are generally 
designed to remain bound to the target at the reaction temperature, they will be selected 
to have a Tn,s about 12 to 15 higher than that of a corresponding probe. Consequently, 
INVADER oligonucleotides may more readily break the local target structure, and thus 
may be less dependent on the accessibility of the target-binding site. 

15 In selecting among accessible sites for the design of INVADER assay 

oligonucleotides, the base composition of the site is also considered. It has been 
observed that stretched of more than 4 or 5 of the same nucleotide in a row (e.g., 
. . . AAAA. . . or . . .CCCC. . .) in any portion of the binding site for the assay 
oligonucleotides may reduce the performance of the probe set in the assay (e.g., by 

20 increasing background or decreasing specificity). Thus, in preferred embodiments, any 
stretches comprising four or more repeated bases are generally avoided. Another 
consideration is the effect of base composition on lengths of the oligonucleotides in the 
probe set. In many cases, targeting A-T rich sequences requires the use of longer 
oligonucleotides for a reaction performed at a given temperature, compared to the length 

25 of oligonucleotides targeted to sequences having a more even distribution of A-T and G- 
C bases. Longer oligonucleotides can be more prone to formation of intrastrand 
structures and dimer structures. Thus, it is preferred that the distribution or A-T bases 
and G-C bases within a target region be as close to even (/.e, about 50% G-C content) as 
the region to be detected permits. In particularly preferred embodiments, the distribution 

30 of A-T and G-C positions is evenly distributed across the binding sites {e.g., not having 
all A-T positions in one half, with all G-C positions in the other). 
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ii. Target site selection based on selectivity 

In some embodiments, probe sets are designed to examine highly homologous, or 
closely related RNA targets (/.&, targets that are very similar in sequence). In such 
5 embodiments, the RNA or homologous cDNA sequences are compared, e.g., using an 
alignment program such as MEGALIGN (DNAstar Madison, WI). 

In some embodiments, selectivity is provided by designing probe sets to detect 
splice junctions. SpUce junctions can be identified by aligning the cDNA and gene 
sequences using an aKgnment program (e.g. MEGALIGN) or under the BLAST menu at 
10 the NCBI website (BLAST 2 sequences). Splice junctions are also often listed in the 
GenBank report (intron/exon sites). INVADER assay oligonucleotide sets are designed 
such that the probe and INVADER oligonucleotides are complementary to the coding 
strand (mRNA), generally with die cleavage site being as close to the splice junction as 
possible. In some embodiments, different splice junctions within an mRNA are analyzed 
15 for accessibility, as described above. In preferred embodiments, probe sets are designed 
to detect one or more spUce junctions showing greater accessibiUty compared to the 
accessibility of other splice junctions within the same RNA target. 

In some embodiments designed to exclude detection of RNAs related to the target 
RNA, sequences are examined to identify bases that are unique to the target RNA when 
20 compared to the other similar sequences from which the target is distinguished 

Generally, the unique base is positioned to hybridize to the 5' end of the target-specific 
region of the probe oligonucleotide. In some embodiments, two adjacent bases are 
unique to the target compared to the related RNA. If two adjacent unique bases are 
available in an appropriately accessible portion of the target RNA, it is preferred that 
25 these bases be used as the site around which the probe and INVADER oligonucleotides 
sets are designed. In some embodiments, the two unique bases are positioned such that 
the site of cleavage of the probe is between the two base-pairs fliey form with the probe. 
In other embodiments, one of the unique bases is in the last position of the hybridization 
site of the INVADER oUgonucleotide (i.e. , it is positioned to base-pair to the penultimate 
30 residue on the 3' end of the INVADER oUgonucleotide). 
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In some embodiments, the assay is designed to include detection of RNAs that are 
similar, but not identical, to the target RNA. If the assay is being designed for inclusive 
det^tion, the compared sequences are examined to identify sites having complete 
homology. Such designs can be created to detect homologous sequences in the same 
5 species or between species. Generally, the most homologous regions are selected as 
hybridization targets for probe oligonucleotides. Genially, some variation can be 
tolerated, for example, if it is not at the base that would hybridize to the 5* end of the 
target-specific region of a probe. In some embodiments, variation is accommodated by 
the use of degenerate bases in the INVADER assay oligonucleotides (eg., mixtures of 
1 0 bases are used at positions within thesynthesized probe, INVADER and/or stacker 
oligonucleotides, said mixtures selected to complement the mixture of specific bases 
present in the collection of related target RNAs). 

iii. Oligonucleotide design 

15 a. Target-specific regions: length and melting temperature 

As described above in Section I (a) concerning the oligonucleotide design, in 
some embodiments, the length of the analyte-specific regions are defined by the 
temperature selected for running the reaction. Starting fix)m the desired position (e.g., a 
variant position or splice junction in a target RNA, or a site corresponding to a low free 

20 energy value in an OligoWalk analysis) an iterative procedure is used by which the length 
of the ASR is increased by one base pair until a calculated optimal reaction temperature 
(Tm plus salt correction to compensate for enzyme and any other reaction conditions 
effects) matching the desired reaction temperature is reached. In general probes are 
selected to have an ASR with a calculated Tm of about 60 °C if a stacking oligonucleotide 

25 is not used, and a Tm of about 50 to 55 if a stacking oligonucleotide is used (a stacking 
oligonucleotide typically raises the Tm of a flanking probe oligonucleotide by about 5 to 
15 °C). If the position of variation or a splice junction is a starting position, then the 
additions are made to the 3* end of the probe. Altematively, if the 3' end of the probe is 
to be positioned at the most accessible site, the additions are in the 5* direction. In some 

30 embodiments, wherein a stacker oligonucleotide is to be used, it is preferred that the 
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probe be designed to have a 3' base that has stable stacking interaction interface with the 
5' base of the stacker oUgonucleotide. The stability of coaxial stacking is highly 
dependent on the identity of the stacking bases. Overall, the stability trend of coaxial 
stacking in decreasing order is purine:purine > purinetpyrimidine « pyriniidne:purine > 
5 pyrimidine:pyriraidine. In other embodiments employing a stacker, a less stable stacking 
interaction is preferred; in such cases the probe 3' base and/or the stacker 5' base are 
selected to provide a leass stable stacking interaction. In some embodiments, the probe 
3' base and/or the stacker 5' base are selected to have a mismatch with respect to the 
target strand, to reduce the strength of the stacking interaction. 
10 The same principles are also followed for I>fVADER oligonucleotide design. 

Briefly, starting from the position N, additional residues complementary to the target 
RNA starting from residue N-1 are then added in the upstream direction until the stabiUty 
of the INVADER-target hybrid exceeds that of the probe (and therefore the planned assay 
reaction temperature). In preferred embodiments, the stability of the INVADER-target 
15 hybrid exceeds that of the probe by 12-15 °C. In general, INVADER oligonucleotides 
are selected to have a Tm near 75 °C. Software applications, such as 
INVADERCREATOR (Third Wave Technologies, Madison. WI) or Oligonucleotide 5.0 
may be used to assist in such calculations. 

If a stacking oligonucleotide is to be used, similar design principles are applied. 
20 The stacking oligonucleotide is generally designed to hybridize at the site adjacent to the 
3' end of the probe oUgonucleotide, such that the stacker/target helix formed can 
coaxiaUy stack with the probe/target helix. The sequence is selected to have a calculated 
Tm of about 60 to 65 "C, with the calculation based on the use of natural bases. 
However, stacking oligonucleotides are generally synthesized using only 2'-0-methyl 
25 nucleotides, and consequently, have actual TmS that are higher than calculated by about 
0,8°C per base, for actual T^s close to 75 °C. 

In some embodiments, ARRESTOR oUgonucleotides are mcluded in a secondary 
reaction. ARRESTOR oligonucleotides are provided in a secondary reaction to sequester 
any remaining uncleaved probe from the primary reaction, to preclude interactions 
30 between the primary probe and the secondary target strand. ARRESTOR 
oligonucleotides are generally 2'-0-methylated, and comprise a portion that is 
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complementary to essentially all of their respective probe's target-specific region, and a 
portion that is complementary to at least a portion of the probe's flap regions (e.g., six 
nucleotides, counted from the +1 base towards the 5* end of the arm). 

b. Non-eomplementary regions 

5 Probe 5' Arm selection 

Hie non-complementary arm of the probe, if present, is preferably selected (by an 
iterative process as described above) to allow die secondary reaction to be performed at a 
particular reaction temperature. In the secondary reaction, the secondary probe is 
generally cycling, and the cleaved 5' arm (serving as an INVADER oligonucleotide) 
1 0 should stably bind to ttie secondary target strand. 



INVADER oligonucleotide 3* terminal mismatch selection 

In preferred embodiments, the 3* base of the INVADER oligonucleotide is not 
complementary to the target strand, and is selected in the following order of preference 
15 (listed as INVADER oligonucleotide 3' base/target base): 
Cintarget: C/C > A/C> T/C> G/C 

A in target: A/A > C/A > G/A > T/ A 

G in target: A/G > G/G > T/G > C/G 

Urn target: C/U> A/U>T/U>G/U 

20 

c. Folding and dimer analysis 

• In some embodiments, the oligonucleotides proposed for use in the INVADER 
assay are examined for possible inter- and intra-molecular structure formation in the 
absence of the target RNA. In general, it is desirable for assay probes to have fewer 
25 predicted inter- or intra molecular interactions. In some embodiments, the program 
OLIGO OLIGO 5.0, Molecular Biology Insights, Inc., Cascade, CO) is used for 
such analysis. In other embodiments, the program mfold is used for the analysis. In yet 
other embodiments, the RNAStructure program can be used for dimer analysis. The 
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following sections provide stepwise instructions for the use of these programs for 
analysis of INVADER assay oligonucleotides. 

OLIGO 5.0 analysis for Probe structure and interaction prediction. 

Analysis of INVADER oligonucleotides using OLIGO 5.0 comprises the 
following steps. AU menu choices are shown in UPPER CASE type. 



1. Launch OLIGO 5.0 and open a sequence file for each mRNA to be analyzed. This 
is done by using a menu to select the following 

10 • Choose FILE- >NEW 

• Paste in longest available sequence 

• Choose ACCEPT & QUIT (F6) 

2. Set Program settings to default 

Choose FILE- >RESET- >ORIGINAL DEFAULTS 

1 5 3. Identify Probe Oligonucleotide 

• Select OLIGO LENGTH to be around 16 nucleotides (open the menu for this 
option by using ctrl-L keystrokes). 

• Move the cursor indicating the 5' end of the Current OUgo untU the 3' end is 
located at the candidate cleavage site residue. 

20 • Choose ANALYSE- >DUPLEX FORMATION- >CURRENT OLIGO (ctrl-D) 

for a rough determination of the extent of dimer and hairpin formation. 

• Confirm length of analyte region corresponds with desired reaction temperature 
[e.g. , through the use of Tm calculation as described in the Optimization of 
Reaction Conditions, I (c) of the Detailed Description of the Invention] 

25 • Select the "LOWER" button in OLIGO 5 .0 to copy the anti-saise sequence (this 
will be the analyte-specific region of the actual probe oligonucleotide and is anti- 
sense to the RNA strand.) 

• Import into a database file. 
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• Save to computer memory. 

4. Identify INVADER Oligonucleotide 

• Choose sequence adjacent to the probe oligonucleotide identified fiom step 3. 

• Select OLIGO LENGTH to -24 nucleotides 

• Confirm length of analyte region corresponds with desired reaction temperature 
[e.g., through the use of Tm calculation as described in the Optimization of 
Reaction Conditions, I (c) of the Detailed Description of the Invention, about 75 
X for INVADER oligonucleotides). Select the "LOWER" button in OLIGO 5.0 
to copy the corresponding anti-sense sequence (this will be the analyte-specific 
region of the actual INVADER oligonucleotide,) 

• Import into a database file. 

• Save to computer memory. 

5. Addition of Cleaved Arm Sequence and INVADER Oligonucleotide Mismatch 
Sequence. 

• Export the Probe oligonucleotide as Upper Primer. 

• Export the INVADER oligonucleotide as Lower Primer. 

• EDIT UPPER PRIMER to add in a candidate arm sequence (selected, for 
example, as described above). 

• Check that the arm sequence does not create new secondary structures (analysis 
performed as described above). 

• EDIT LOWER PRIMER to add in the 3' mismatched nucleotide that will overlap 
into the cleavage site (selected according to the guidelines for this mismatched 
bases, provided above). 

• Select all Upper and Lower Primer boxes in the "Print/Save Options" 

• PRINT ANALYSIS of Upper (Probe) and Lower (INVADER) oligonucleotides 
and check for lack of stable secondary structures. 

• Save both mRNA sequence and oligonucleotide sequence database files before 
quitting the program. 
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Generally, oligonucleotides having detected intra-molecular fonnations with 
stabilities of less than -6 AG are prefened. Less stable stractures represent poor 
substrates for CLBAVASE enzymes, and thus cleavage of such structures is less likely to 
contribute to background signal. Probe and INVADER oligonucleotides having less 
affinity for each other are more available to bind to the target, ensuring the best cycling 
rates. 

The Tm of dimerized probes (i.e.. probes wherein one probe molecule is 
hybridized to another probe molecule) should ideally be lower than the for the probe 
hybridized to the target, to ensure that the probes preferentially hybridize to the target 
sequence at the elevated temperatures at which INVADER assay reactions are generaUy 
conducted. Similarly, the Tm for the INVADER oligonucleotide hybridized either to 
itself or to a probe molecule should be lower than the INVADER oligonucleotide/target 
Tm. It is preferred that dimer T^s (i.c.. Probe/Probe and Probe/INVADER 
oligonucleotide) be 25**C or less to ensure that they wiU be unlikely to form at the 
plaimed reaction temperature. 

The melting tanperatures for each of these complexes can be determined as 
described above in Optimization of Reaction Conditions, I (c) of the Detailed Description 
of the Invention, or by using the OLIGO software. Once RNAs sites and several 
candidate INVADER assay oligonucleotide sets are selected according to the process 
outlined above, the candidate oligonucleotide sets can be ranked according to the degree 
to which they comply willi preferred selection rules, e.g., their location on the SS-Count 
average plot (peak, valley, neiflier), and the energetic predictions of probe and 
INVADER oligonucleotide interactions. In some embodiments, the ranked probe sets are 
tested in order of rank to identify one or more sets having suitable performance in an 
RNA INVADER assay. In o&er embodiments, several of the top ranked sets (e.g., two, 
three or more) are selected for testing, to r^idly identify one or more sets having suitable 
or desireable performance. 

Mfold analysis for probe structure and interaction prediction 

Analysis of probe and INVADER oligonucleotide interactions may be performed 
using mfold for DNA provided by Michael Zuker, available through Rensselaer 
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Polytechnic Institute at bioinfo.math.rpi.edu/~mfolci/dna/fornil.cgi. The analysis is 
performed without changing the default ionic conditions, and with a selected temperature 
of 37 °C and with % suboptimality set to 75. Each sequence probe, INVADER 
oligonucleotide, stacker, etc.) is folded using the program to check for any unimolecular 
5 structure formation hairpins). The energies provided by mfold gives for 
unimolecular structures can be used as provided, without further calculations. 

Bimolecular structiu^ formation for a given oligonucleotide is assessed by typing 
in the oligonucleotide sequence (5* to 3') followed by the sequence of a small, stable 
hairpin forming sequence CCCCCTTTTGGGGG [SEQ ID NO:707]), followed by 
10 the same oligonucleotide sequence, again listed 5' to 3. Constraints are entered to require 
that these Ts remain single-stranded and the strings of Cs and Gs in this spacer are 
basepaired. The conmiand "F" is used to force basepairing, while the conunand "P" is 
used to prohibit basepairing, and the positions of the forced or prohibited basepairs are 
counted from the 5' end. For example, if the sequence of interest is a 20-mer, then the 
1 5 following is entered: 

F 21 0 5 [this forces the C's, C21 to C25, to base pair] 

P 26 0 4 [this forces the T's, T26 to T29, to be single stranded] 

F 30 0 5 [this forces the G's, G30 to G34, to base pair] 

On examination of the resulting structures, the stability of each can be estimated 
20 by subtracting the stability the thermodynamic measures) of the central spacer 

hairpin from the total result Thermodynamics of possible structure = mfold structure 
thermodynamics - core hairpin thermodynamics). For convenience, in some 
embodiments, any nearest neighbor interactions between the central haiipin and dimers 
formed by the test sequence are ignored for this calculation; a more accurate analysis 
25 would require consideration of this interaction. The core hairpin formed by 

CCCCCTTTTGGGGG (SEQ ID NO:707) has the foUowmg thermodynamics: AG = -5.3; 
AH = -37.8; AS = -104.8. 

The process can be demonstrated using the following probe sequence: 5'- 
CCCTATCTTTAAAGTTTTTAAAAAGTTTGA-3' (SEQ ID NO:708). The 
30 oligonucleotide sequence is examined by mfold analysis for bimolecular structures using 
the following steps. 
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1- In mfold sequence box type: 

CCCTATCmAAAGTTTTTAAAAAGT^ 
TATCmAAAGTTmAAAAAGTTTGA (SEQ ID NO:137) 

5 2- In the constraint box type: 
P3604 
F3105 
F4005 



1 0 Results (showing one): 

structure 1 

do = -14.2 dH « -150. S dS o -439.5 Tm » 69..3 

CCCTATCTTT |G G T 

AAA TTTTTAAAAA TTTGA CCCCC T 
15 TTT AAAAATTTTT AAATT GGQGO T 
AG ''G O TCTATCCC T 



To evaluate the stability of the duplex: 

CCCTATCTTT |G G 
20 AAA TTTTTAAAAA TTTGA 
TTT AAAAATTTTT AAATT 
AG "^G G TCTATCCC 

the thermodyanamic values for the hairpin alone are subtracted from the values for the 
25 complete structure: 

AG = -14.2 -(-5.3) = -8.9, 
AH = -150.5-(-37.8) = -112.7, 

AS = -439,5 - (-104.8) = -334.7, 
Usingacalculation wherein Tn, (°C)= {AH/[AS + Rhi(CT/4)]} -273.15, w 

30 the gas constant L987 (cal/K.mol), In is the natural log, and CT is the total single strand 
concentration in Molar, this results in a calculated of 46,1 for the non-hairpin 
portion of the structure. 

The above method is not limited to the use of the core hairpin sequence 
CCCCCTTTTGGGGG but rather any stable hairpin sequences can be used. For 

35 example, CGCGCGGAACGCGCG (SEQ ID NO:138) or CCCGGGTTTTCCCGGG 
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(SEQ ID NO:139). However, if a different hairpin sequence is used, one needs to 
calculate its stability using mfold and use its thermodynamics in the subsequent 
calculations. 

5 RNAStructure for oligonucleotide interaction prediction 

Dimer formation can also be evaluated using the RNAStructure program. Unlike 
mfoldy RNAStructiu'e allows the calculation of all possible oligonucleotide- 
oligonucleotide interactions and provides an output .ct file. One can then view the 
structures using any ,ct viewing program such as RNAStructure or RNAvis (1997, P. 

10 Rijk, University of Antwerp (UIA), available on the Internet at rraa.uia.ac.be/mavis) and 
evaluate the stability of any dimer formation using the nearest-neighbor model (Borer et 
al, 1974) and DNA nearest-neighbor parameters (AUawi & SantaLucia, 1997). 

For example, to evaluate the propensity of the sequence 5' 
AGGCGCACCAATTTGGTGTT y (SEQ ID NO: 140) for duner formation using the 

15 DNA Fold Intermolecular module of RNAStructure, the sequence is saved into a file 
(e.g., probe.seq) and the following parameters are set: 
Sequence file 1: probe.seq 
Sequence file 2: probe.seq 
CT file: dimer.ct 

20 Max % Energy difference: 50 
Max number of structures: 20 
Window size: do not change 

After the calculation is done, one can view the resulting .ct file using the "view" module 
25 of RNAStructure. Generally, there will be several structures within the .ct file. The view 
module is used to view them individually. One of the dimers that the test sequece, above, 
can form according to RNAStructure is: 



30 
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AGGCG TT 
CACCAATTTGGTG 
GTGGTTTAACCAC 
TT GCGGA 

5 

According to the nearest-neighbor model (i.e, using DNA nearest-neighbor and 
mismatch parameters [Allawi & SantaLucia, 1997]), the stability of this duplex in IM 
NaCl and at a probe concentration of 100 jiM is: 
AG°37 = -10,07 
10 AH=-87.6 
AS=-250.1 
Tm= 50.1 "C 



By changmg the identities of Sequence Files 1 & 2, RNAStructure can be used to 
15 evaluate the possibility of any dimer formation between pairs of all of the DNA 
oligonucleotides present in an INVADER assay reaction. 

iv. Assay performance evaluation 

Probe sets selected according to the guideUnes provided above can be tested in the 

20 INVADER assay to evaluate performance. While ttie oUgonucleotides are designed to 
perform at or near a particular desired reaction tanperature, the best performance for a 
given design may not be precisely at the intended temperature. Thus, in evaluating any 
new INVADER assay probe set, it can be helpfiil to examine the performance in the 
INVADER assay conducted at several difiBerent reaction tenq>eratures, over a range of 

25 about 10 to 1 5 "C, centered around the designed temperature. For convenience, 

temperature optimization can be performed on a temperature gradient thermocycler with 
a fixed amount of RNA (e.g., 2.5 amoles of an in vitro transcript per reaction), and for a 
fixed amount of time (e.g., 1 hour each for Primary and Secondary reactions). The 
temperature gradient test will reveal the temperature at which the designed probe set 

30 produces the best performance {e-g. , the highest level of target-specific signal compared 
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to background signal, generally expressed as a multiple of the zero-target background 
signal, or "fold over zero"). 

The results can be examined to see how close the measured temperature optimum 
is to the intended temperature of operation. In some embodiments, it is desirable to have 
5 probe sets tiiat operate at or near a pre-selected temperature. If the measured temperature 
optimum is higher than the desired reaction temperature, a probe design can be altered in 
ways that tend to reduce the probe/target Tm (e.g. , shortened by one or more bases, or 
altered to contain one or more mismatched bases). In some embodiments, wherein a 
stacker oligonucleotide is not used, wherein the reaction temperature is more than 7 

1 0 above the desired reaction temperature, and wherein the performance the fold over 
zero) is acceptable, use of a 3' mismatch on the probe oligonucleotide is likely to lower 
the reaction temperature without otherwise altering the assay performance. 

An LOD determination can be made by performing reactions on varying amounts 
of target RNA an in vitro transcript control RNA of known concentration). In 

1 S preferred embodiments, a designed assay has an LOD of less than 0.05 attomole. hi 
particularly preferred embodiments, a designed assay has an LOD of less than 0.01 
attomole. It is contemplated that the same guideline provided above for reducing the 
LOD of a designed assay may be used for the purpose of raising the LOD of a designed 
assay, /.e, to make it LESS sensitive to the presence of a target RNA, For example, it 

20 may be desirable to detect an abundant RNA and a rare RNA in the same reaction. In 
such a reaction, it may be desirable to attenuate the signal generated for the abundant 
RNA so that it does not overwhelm the signal from the rarer species. In some 
embodiments this may be done by designing probe sets for reduced signal generation, 
e.g., an LOD of at least (not less than) O.S attomoles. In some embodiments, a single step 

25 INVADER assay may be used for detection of abundant targets in a sample, while 

sequential INVADER reactions to amplify signal, as described in Section H, may be used 
for less abundant analytes in the same sample. In preferred embodiments, the single step 
and the sequential INVADER assay reactions for the different analytes are performed in a 
single reaction. 

30 In some embodiments, time course reactions are run, wherein the accumulation of 

signal for a known amount of target is measured for reactions run for different lengths of 
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time. This measurement will establish the linear ranges, z\e, the ranges in which accurate 
quantitative measurements can be made using a given assay design, with respect to time 
and starting target RNA level. 

V. Design and assay optimization 

Some designed assays may not meet the preferred performance criteria described 
above, A number of variations on the performance of INVADER assay reactions have 
been described herein. In optimizing performance of the INVADER assay for the 
detection of RNA targets, these variations may be used alone or in combination. For 
example, in some embodiments, a stacker oligonucleotide is employed. While not 
limitmg the present invention to any particular mechanism of action, in some 
embodunents, a stacker oligonucleotide may enhance performance of an assay by altering 
the hybridization characteristics Tm) of a probe or an INVADER oligonucleotide. 
In some embodiments, a stacker oligonucleotide may increase performance by enabling 
the use of a shorter probe. In other embodiments, a stacker oligonucleotide may enhance 
performance by altering the folded structure of the target nucleic acid. In yet other 
embodiments, the enhancing activity of the stacker oligonucleotide may involve these 
and other mechanisms in combination. 

In other embodiments, the target site may be shifted. In some embodiments, 
reactions are optimized by testing multiple probe sets that shift along a suspected 
accessible site. In preferred embodiments, such probe sets shift along the accessible site 
in one to two base increments. In embodiments wherein accessible sites have previously 
been predicted only by computer analysis, physical detection of the accessible sites may 
be employed to optimize a probe set design. In preferred embodiments, the RT-ROL 
method of detecting accessible sites is employed. In some embodiments, optimization of 
a probe set design may require shifting of the target site to a newly identified accessible 
site. 

In some embodiments, e.g., wherein an accessible site has been identified yet 
probe set performance is low, a change in the design of a probe 5* arm may improve 
assay performance without altering the site targeted. In other embodiments, altering the 
length of an ARRESTOR oligonucleotide (e.g., increasing the length of the portion that is 
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complemOTtary to the 5* ami region of the probe) may reduce background signal, thus 
increasing the probe stet performance. 

Other variations on oligonucleotide design may be employed to alter performance 
in an assay. Some modifications may be employed to shift the ideal operating 
5 temperature of a probe set design into a preferred temperature range. For example, the 
use of shorter oligonucleotides and the incorporation of mismatches generally act to 
reduce the TmS, and thus reduce the ideal operating temperatures, of designed 
oHgonucleotides. Conversely, the use of longer oligonucleotides and the employment of 
stacking oligonucleotides generally act to increase the TmS, and thus increase the ideal 

1 0 operating temperatures of the designed oligonucleotides. 

Other modifications may be employed to alter other aspects of oligonucleotide 
performance in an assay. For example, the use of base analogs or modified bases can 
alter enzyme recognition of the oligonucleotide. In some embodiments, such modified 
bases are used to protect a region of an oligonucleotide fix>m nuclease cleavage. In other 

IS embodiments, modified bases are used to affect the ability of an oligonucleotide to 
participate as a member of a cleavage structure that is not in a position to be cleaved 
to serve as an INVADER oligonucleotide to enable cleavage of a probe). These modified 
bases may be referred to as "blocker" or "blocking" modifications. In some 
embodiments, assay oligonucleotides incorporate 2 -0-methyl modifications. In other 

20 embodiments, assay oligonucleotides incorporate 3' terminal modifications NH2» 3' 
hexanol, 3' phosphate, 3* biotin). 

In yet other embodiments, the components of the reaction may be altered to affect 
assay performance. For example, oligonucleotide concentrations may be varied. 
Oligonucleotide concentrations can affect multiple aspects of the reaction. Since melting 

25 temperatures of complexes are partly a fimction of the concentrations of the components 
of the complex, variation of the concentrations of the oligonucleotide components can be 
used as one facet of reaction optimization. In the methods of the present invention, 
ARRESTOR oligonucleotides may be used to modulate the availability of the primary 
probe oligonucleotides in an INVADER assay reaction. In some embodiments, an 

30 ARRESTOR oligonucleotide may be excluded. Other reaction components may also be 
varied, including enzyme concentration, salt and divalent ion concentration and identity. 
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VI. Kits for performing the RNA INVADER Assay 

In some embodiments, the present invention provides kits comprising one or more 

5 of the components necessary for practicing the present invention. For example, the 
present invention provides kits for storing or delivering the enzymes of the present 
invention and/or the reaction components necessary to practice a cleavage assay (e.g., the 
INVADER assay). The kit may include any and all components necessary or desired for 
the enzymes or assays including, but not limited to, the reagents themselves, buffers, 

1 0 control reagents (e.g., tissue samples, positive and negative control target 

oUgonucleotides, etc.), solid supports, labels, written and/or pictorial instructions and 
product infonnation, inhibitors, labeling and/or detection reagents, package 
environmental controls (e.g., ice, desiccants, etc.), and the like. In some embodiments, 
the kits provide a sub-set of the required components, v/herein it is expected that the user 

1 5 will supply the remaining components. In some embodiments, the kits comprise two or 
more separate containers wherein each container houses a subset of the components to be 
delivered. For example, a first container (e.g., box) may contain an enzyme (e.g., 
structure specific cleavage enzyme in a suitable storage buffer and container), while a 
second box may contain oUgonucleotides (e.g., INVADER oligonucleotides, probe 

20 oligonucleotides, control target oligonucleotides, etc.). In some embodiments one or 
more the reaction components may be provided in a predispoised format (i.e., 
premeasured for use in a step of the procedure without re-measurement or re-dispensing). 
In some embodiments, selected reaction components are mixed and predispensed 
together. In preferred embodiments, predispensed reaction components are predispensed 

25 and are provided in a reaction vessel (including but not limited to a reaction tube or a 
well, as in, a microtiter plate). In particularly preferred embodiments, predispensed 
reaction components are dried down (eg., desiccated or lyophilized) in a reaction vessel. 

Additionally, in some embodiments, the present invention provides methods of 
delivering kits or reagents to customers for use in the methods of the present invention. 

30 Hie methods of the present invention are not limited to a particular group of customers. 
Indeed, the methods of the present invention find use in the providing of kits or reagents 
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to customers in many sectors of the biological and medical community, including, but not 
limited to customers in academic research labs, customers in the biotechnology and 
medical industries, and customers in governmental labs. The methods of the present 
invention provide for all aspects of providing the kits or reagents to the customers, 
5 including, but not limited to, mariceting, sales, delivery, and technical support. 

In some embodiments of the present invention, quality control (QC) and/or 
quality assurance (QA) experiments are conducted prior to delivery of the kits or reagents 
to customers. Such QC and QA techniques typically involve testing the reagents in 
experiments similar to the intended commercial uses using assays similar to those 

1 0 described herein). Testing may include experiments to determine shelf life of products 
and their ability to withstand a wide range of solution and/or reaction conditions 
temperature, pH, light, etc.). 

In some embodiments of the present invention, the compositions and/or methods 
of the present invention are disclosed and/or demonstrated to customers prior to sale {e.g., 

15 through printed or web-based advertising, demonstrations, etc.) indicating the use or 
functionality of the present invention or components of the present invention. However, 
in some embodiments, customers are not informed of the presence or use of one or more 
components in the product being sold. In such embodiments, sales are developed, for 
example, through the improved and/or desired function of the product (e.^., kit) rather 

20 than through knowledge of why or how it works e., the user need not know the 
components of kits or reaction mixtures). Thus, the present invention contemplates 
making kits, reagents, or assays available to users, whether or npt the user has knowledge 
of the components or workings of the system. 

Accordingly, in some embodiments, sales and marketing efforts present 

25 information about the novel and/or improved properties of the methods and compositions 
of the present invention. In other embodiments, such mechanistic information is withheld 
from marketing materials. In some embodiments, customers are surveyed to obtain 
information about the type of assay components or delivery systems that most suits their 
needs. Such information is useful in the design of the components of the kit and the 

30 . design of marketing efforts. 
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Vn. The INVADER Assay for Direct Detection and Measurement of Specific 
RNA Analytes. 

The following section provides a few illustrative examples of mRNAs that may be 
detected or measured using the methods, compositions and systems of the present 
invention. 

HOUSEKEEPING CONTROLS 

KNAs that are generally present in predicable or invariant amounts in test samples 
provide useful control targets for detection assays. These controls can be useful in 
several ways, including but not limited to providing confirmation of the proper function 
of an assay, and as a standard against which a test result for another KNA can be 
compared or measured to aid in interpretation of a result. mRNAs for the following 
genes find particular use in the methods of the present uivention. 
Human Ubiquitin and Mouse/Rat Ubiquitin 

The ubiquitin system is a major pathway for selective protein degradation. 
Degradation by this system is instrumental in a variety of cellular functions such as DNA 
repair, cell cycle progression, signal transduction, transcription, and antigen presentation. 
The ubiquitin pathway also eliminates proteins that are misfolded, misplaced, or that are 
in other ways abnormal. This pathway requires tiie covalent attachment of ubiquitin 
(El), a highly conserved 76 ammo acid protem, to defined lysine residues of substrate 
proteins. 

Human, rat and mouse glyceraldehyde-3-phosphate dehydrogenase (GAPDH) 

GAPDH is an important enzyme in the glycolysis and gluconeogenesis pathways. 
This homotetrameric enzyme catalyzes the oxidative phosphorylation of D- 
glyceraldehyde-3-phosphate to 1,3-diphosphoglycerate in the presence of cofactor and 
inorganic phosphate. A variety of diverse biological properties of GAPDH have been 
reported. These include functions in endocytosis, mRNA regulation, tKNA export, DNA 
replication, DNA repair, and neuronal apoptosis. 
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CYTOKINES 

A growing family of regulatory proteins that deliver signals between cells of the 
immune system has been identified. Called cytokines, these proteins have been found to 
control the growth and development, and bioactivities, of cells of the hematopoietic and 
5 immune systems. Cytokines exhibit a wide range of biological activities with target cells 
from bone marrow, peripheral blood, fetal liver, and other lymphoid or hematopoietic 
organs. The present invention describes methods for the detection of expression of 
cytokines, including but not limited to of the exemplary members of the cytokine family 
listed below. 

10 

Human Oncostatin M 

Qncostatin M is a secreted single-chain polypeptide cytokine that regulates the 
growth of certain tumor-derived and normal cell Hnes. A number of cell types have been 
found to bind the oncostatin M protein. While it has been shown to inhibit proliferation 
15 of a number of tumor cell types, it has also been implicated in stimulating proliferation of 
Kaposi's sarcoma cells. 

Human transforming growth factor-beta (TGF-P) 

Transforming growth fector-beta (TGF-beta) is a member of a family of 

20 structurally-related cytokines that elicit a variety of responses, including growth, 

differentiation, and morphogenesis, in many different cell types. In vertebrates, at least 
five different forms of TGF-beta, termed TGF-betal to TGF-beta5, have been identified; 
they all share a high degree (60%-80%) of amino-acid sequence identity. While TGF- 
betal was initially characterized by its ability to induce anchorage-indq)endent growth of 

25 normal rat kidney cells, its effects on most cell types are anti-mitogenic. It is strongly 
growth-inhibitory for many types of cells, including both normal and transformed 
epithelial, endothehal, fibroblast, neuronal, lymphoid, and hematopoietic cells. In 
addition, TGF-beta plays a central role in regulating the formation of extracellular matrix 
and cell-matrix adhesion processes. 

30 
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Hnman monocyte chemoattractant protein-1 (MCP-1) 

Within this family of cytokines, an emerging group of chemotactic cytokines, also 
called chemokines or intercrines, has been identified. Two subfamilies of chemokines 
have been recognized, alpha and beta, based on chromosomal location and the 
arrangement of the cysteine residues. 

The human gaies encoding the beta subfamily proteins are located on 
chromosome 17 (their mouse counterparts are clustered on mouse chromosome 11, which 
is the counterpart of human chromosome 17). Homology in the beta subfamily ranges 
from 28-45% intraspecies, from 25-55% interspecies. An exemplary member is the 
human protein MCP-1 (monocyte chemoattractant protein-1). MCP-1 exerts several 
eflFects specifically on monocytes. It is a potent chemoattractant for human monocytes in 
vitro and can stimulate an increase in cytosolic free calcium and the respiratory burst in 
monocytes. MCP-1 has been rqwrted to activate monocyte-mediated tumoristatic 
activity, as well as to induce tumoricidal activity. MCP-1 has been impUcated as an 
important fector in mediating monocytic infiltration of tissues inflammatory processes 
such as rheumatoid arthritis and alveoUtis. The factor may also play a fundamental role 
in the recruitment of monocyte-macrophages into developing atherosclCTotic lesions. 

Human tumor necrosis factor alpha (TNF-o) 

Tumor necrosis factor alpha (TNF-alpha also cachectin) is an unportant cytokine 
that plays a role in host defense. The cytokine is produced primarily in macrophages and 
monocytes in response to infection, invasion, injury, or inflammation. Some examples of 
inducers of TNF-alpha mclude bacterial endotoxins, bacteria, viruses, Upopolysaccharide 
(LPS) and cytokines including GM-CSF, IL-1, IL-2 and IFN-gamma. 

Despite the protective effects of the cytokine, overexpression of TNF-alpha often 
results in disease states, particularly in infectious, inflammatory and autoimmune 
diseases. This process may involve the apoptotic pathways. High levels of plasma TNF- 
alpha have been found in mfectious diseases such as sepsis syndrome, bacterial 
meningitis, cerebral malaria, and AIDS; autoimmune diseases such as rheumatoid 
arthritis, inflammatory bowel disease (including Crohn's disease), sarcoidosis, multiple 
sclerosis, Kawasaki syndrome, graft-versus-host disease and transplant (allograft) 



167 



wo 01/90337 



PCT/USOl/17086 



rejection; and organ failure conditions such as adult respiratory distress syndrome, 
congestive heart failure, acute liver failure and myocardial infarction. Other diseases in 
which TNF-alpha is involved include asthma, brain injury following ischemia, non- 
insulin-dependent diabetes raellitus, insulin-dependent diabetes mellitus, hepatitis, atopic 
5 dermatitis, and pancreatitis. Further, inhibitors of TNF-alpha have been suggested to be 
useful for cancer prevention. Elevated TNF-alpha expression may also play a role in 
obesity. TNF-alpha was found to be expressed in human adipocytes and increased 
expression, in general, correlated with obesity. 

10 Human interleukin-6 (IL-6) 

IL-6 is the standardized name of a cytokine called B lymphocyte differentiating 
factor, interferon beta2, 26 Kd protein, hybridoma/plasmacytoma growth factor, 
hepatocyte stimulating factor, etc. 

IL-6 induces activated B cells to be differentiated into antibody forming cells. 

15 For T cells, IL-6 induces T cells stimulated by mitogens to produce IL-2 and induces the 
expression of IL-2 receptor on a certain T cell line or thymocytes. For blood forming 
cells, IL-6 induces the growth of blood forming stem ceUs synergistically in the presence 
of IL-3. Furthermore, recently, it was reported that IL-6 acted like thrombopoietin. 
IL-6 is produced by various cells. It is produced by lymphocytes and is also 

20 produced by human fibroblasts stimulated by Poly (I)-Poly (C) and cycloheximide. 
Murine IL-6 is produced in mouse cells, which are stimulated by Poly (A)-Poly (U). 
Inducers for stimulation are diverse, and include known cytokines such as IL-1, TNF and 
IFN-beta, growth factors such as PDGF and TGF-beta, LPS, PMA, PHA, cholera toxin, 
etc. Moreover, it is reported that human vascular endothelial cells, macrophages, human 

25 glioblastomas, etc. also produce IL-6. Furthemiore, it is also known that the productivity 
can be further enhanced by stimulating cells using an inducer and subsequently treating 
the cells by a metabolic inhibitor such as verapamil, cycloheximide or actinomycin D, 
etc. 
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Humao interleukin lbeta(IL-ip) 

Interleukin-l (IL-1) is important to the activation of T and B lymphocytes and 
mediates many inflammatory processes. Two distinct forms of IL-1 have been isolated 
and expressed; termed IL-lbeta and IL-lalpha. IL-lbeta is the predominant form 
produced by human monocytes both at the mRNA and protein level. The two forms of 
human IL-1 share only 26% amino acid homology. Despite their distinct polypeptide 
sequences, the two forms of IL-1 have stractural similarities, in that the amino acid 
homology is confined to (Kscrete regions of the IL-1 molecule. The two forms of IL-1 
also possess identical biological properties, including induction of fever, slow wave 
sleep, and neutrophilia, T- and B-lymphocyte activation, fibroblast proliferation, 
cytotoxicity for certain cells, induction of coUagenases, synthesis of hepatic acute phase 
proteins, and increased production of colony stimulating factors and collagen. IL-1 also 
activates endottielial cells, resulting in increased leukocyte adhesiv«iess, PGI2 and PGE2 
(prostaglandins) release, and synthesis of platelet activating factor, procoagulant activity, 
and a plasminogen activator inhibitor. Clearly, IL-1 plays a central role in local and 
systemic host responses. Because many of the biological effects of IL-1 are produced at 
picomolar (pg) concentrations in vivo, JL-l production is likely a fimdamental 
characteristic of host defense mechanisms. 



Human interleukin 2 (IL-2) 

Interleukin-2 (JL-2) is the main growth factor of T lymphocytes. By regulating T 
helper lymphocyte activity IL-2 increases the humoral and cellular inamune responses. . 
By stimulating cytotoxic CDS T cells and NK cells, tiiis cytokine participates in tiie 
defense mechanisms against tumors and viral infections. IL-2 is used in therapy against 
metastatic melanoma and renal adenocarcinoma. IL-2 is used in clinical tiials in many 
forms of cancer. It is also used m HIV infected patients and leads to a significant 
increase in CD4 counts. Human IL-2 is a protein of 133 amino acids (aa) composed of 
four alpha helices connected by loops of various length, its tridimensional structure has 
been established. IL-2R is composed of three chains alpha, beta and gamma. IL2Ralpha 
controls the affinity of flie receptor IL-2Rbeta and IL-2Rgamma are responsible for IL-2 
signal transduction. The differrait molecular areas of IL-2 interacting witii tiie fliree 
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chains of the IL-2 R have been defined. More specifically it has been determined that a 
helix A as well as the NH2 terminal area of IL-2 (residues 1 to 30) control the interactions 
IL-2/BL-2Rbeta. 



5 Human interleukin 8 (ILr8) 

Human IL-8 is a cytokine that has variously been called neutrophil-activating 
protein, neutrophil chemotactic factor (NCF) and T-cell chemotactic factor. IL-8 can be 
secreted by several types of cells upon appropriate stimulation. IL-8 is secreted by 
activated monocytes and macrophages as well as by embryonic fibroblasts. 

10 IL-8 is known to induce neutrophil migration and to activate functions of 

neutrophils such as degranulation, release of superoxide anion and adhesion to the 
endothelial cell monolayer. There are a number of conditions that are known to involve 
leukocyte infiltration into lesions. These include pulmonary diseases such as puhnonary 
cystic fibrosis, idiopathic pulmonary fibrosis, adult respiratory distress syndrome, 

IS sarcoidosis and empyema; dermal diseases such as psoriasis, rheumatoid arthritis; and 
inflammatory bowel disease (Crohn's Disease). 

The amino acid sequence characterizing human IL-8 was described by 
Matsushima, et al. in PCT application WO89/08665. More recently, it was shown that 
monocyte-derived IL-8 was evidently variably processed at the N-terminus and that the 

20 IL-8 originally disclosed by Matsushima et al. was accompanied by two forms of the 
factor which had seven or five additional amino acids at the N-terminus (Y oshimura, et 
al, Mol Immunol 26:87 [1989]). The longest form accounted for about 8%, the next 
longest form for about 47%, and the shortest form for about 45% of the total IL-8 derived 
firom monocytes. 

25 

Human interleukin 10 (IL-10) 

Interleukin-lO (IL-10), a recently discovered lymphokine, was originally 
described as an inhibitor of interferon-gamma synthesis and is postulated as a major 
mediator of the humoral class of immune response. Two classes of often mutually 
30 exclusive immune responses are the humoral (antibody-mediated) and the delayed-type 
hypersensitivity. 
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It is postulated that these two differing immune responses may arise from two 
types of helper T-cell clones, namely Thl and Th2 helper T-cells. which demonstrate 
distinct cytokine secretion patterns. Mouse Thl cell clones secrete interferon-gamma, 
and and preferentially induce the delayed-type hypersensitivity response while Th-2 
cell clones secrete IL-4, IL-5 and E.-10 and provide support for the humoral responses. 
The contrast in immune response could result because interferon-gamma secreted by the 
Thl cell clones inhibits Th2 clone proliferation in vitro, while EL-IO secreted by the Th2 
ceU clones inhibits cytokine secretion by the Thl ceU clones. Thus the two T-helper cell 
types may be mutually inhibitory and may provide the underpinning for the two 
dissimilar immune responses. 

IL-10 has been cloned and sequenced from both murine and human T cells. Both 
sequences contain an open reading frame encoding a polypeptide of 178 amino acids with 
an N-terminal hy&ophobic leader sequence of 18 amino acids, and have an amino acid 
sequence homology of 73%. 



Human interleukin 4 (IL-4) 

Interleukin-4 (IL-4, also known as B cell stimulating factor, or BSF-1) was 
originally characterized by its ability to stimulate the proliferation of B cells in response 
to low concentrations of antibodies directed to surface immunoglobulin. More recently, 
IL-4 has been shown to possess a far broader spectrum of biological activities, including 
growth co-stimulation of T cells, mast cells, granulocytes, megakaryocytes, and 
erythrocytes. In addition, IL-4 stimulates the proliferation of several IL-2- and IL-3- 
dependent cell lines, induces the expression of class H major histocompatibility complex 
molecules on resting B cells, and enhances the secretion of IgE and IgGl isotypes by 
stimulated B cells. Both murine and human IL-4 have been definitively characterized by 
recombinant DNA technology and by purification to homogeneity of the natural murine 
protein. 

The biological activities of IL-4 are mediated by specific cell surface receptors for 
IL-4 that are expressed on primary cells and in vitro cell lines of mammalian origin. IL-4 
binds to the receptor, which then transduces a biological signal to various immune 
effector cells. 
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Human interferon gamma (IFN-y) 

Like the interleukins, interferons belong to the class of the cytokines and are listed 
in various classes: interferon-alpha, interferon-beta, interferon-gamma, interferon-omega 
5 and interferon-tau. Interferon-gamma is a glycoprotein, the amino acid sequence of 
which has been known since 1982. In the mature condition the interferon-gamma has 
143 amino acids and a molecular weight of 63 to 73 kilodaltons. 

The tertiary and quaternary structure of the non-glycosylised protein was clarilSed 
in 1991. According to this, interferon-gamma exists as a homodimer, the monomers 

10 being orientated in contrary directions in such a way that the C-terminal end of one 

monomer is located in the vicinity of the N-terminal end of the other monomer. Each of 
these monomers in all has six alpha-helices. Interferon-gamma is also called 
immunointerferon, as it has non-specific antiviral, antiprolifmtive and in particular 
immunomodulatory effects. Its production in T-helper-lymphocytes is stimulated by 

15 mitogens and antigens. The effect of the expressed interferon-gamma has not yet been 
precisely clarified, but is subject to intensive research. In particular, interferon-gamma 
leads to the activation of macrophages and to the synthesis of histocompatability antigens 
of the class 2. In vitro, the activity of interferon-gamma is normally determined as a 
reduction in the virus-induced cytopathic effect, which arises firom treatment with 

20 interferon-gamma. Due to its antigen-non-specific antiviral, antiproliferative and 

immunomodulatory activity it is suitable as a human therapeutic agent, for example of 
kidney tumours and chronic granulomatosis. 

CYTOCHROME P450s 

25 The term cytochrome P-450 refers to a family of enzymes (located on the 

endoplasmic reticulum, with high concentrations of the proteins in the cells of the liver 
and small intestine) that are primarily responsible for the metabolism of xenobiotics such 
as drugs, carcinogens and environmental chemicals, as well as several classes of 
endobiotics such as steroids and prostaglandins. Members of the cytochrome P450 

30 family are present in varying levels and their expression and activities are controlled by 
variables such as chemical enviroimient, sex, developmental stage, nutrition and age. 
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More than 200 cytochrome P450 genes have been identified. There are multiple 
forms of these P450 genes and each of the individual forms exhibit degrees of specificity 
towards individual chemicals in the above classes of compounds. In some cases, a 
substrate, whether it be drug or carcinogen, is metabolized by more then one of the 
cytochromes P450. Genetic polymorphisms of cytochromes P450 result in 
phenotypically-distinct subpopulations that differ in their abiUty to perform 
biotransformations of particular drugs and other chemical compounds. 

The present invention provides methods for the detection cytochrome P450 
raRNAs, including but not limited to, Human CYP lAl, Human CYP 1 A2, Human CYP 
2B1. Human CYP 2B2, Human CYP 2B6, Human CYP 2C19, Human CYP 2C9, Human 
CYP 2D6, Human CYP 3A4, Human CYP 3A5, Human CYP 3A7, Rat CYP 2E1, Rat 
CYP 3AI, Rat CYP 3A2, Rat CYP 4A1, Rat CYP 4A2 and Rat CYP 4A3. 
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EXAMPLES 

The following examples serve to illustrate certain preferred embodiments and 
aspects of the present invention and are not to be construed as lindting the scope thereof 

In the disclosure which follows, the following abbreviations apply: AfU 
(Archaeoglobiis fiilgidus); Mth {Methanobacterium thermoautotrophicum); Mja 
(Metfianococcus jannaschii); Pfu [Pyrococcus furiostis); Pwo (Pyrococcus woesei)\ Taq 
{Thermus aquaticus)\ Taq DNAP, DNAPra^, and Taq Pol I {T aquaticus DNA 
polymerase I); DNAPStf (the Stofifel fragment of DNAPra?); DNAPEcl {E. coli DNA 
polymerase I); Tth {Thermus thermophilus)\ Ex. (Example); Fig. (Figure);**C (degrees 
Centigrade); g (gravitational field); hr (hour); min (minute); olio (oligonucleotide); rxn 
(reaction); vol (volume); w/v (weight to volume); v/v (volume to volume); BSA (bovine 
sman albumin); CTAB (cetyltrimethylammonium bromide); HPLC (high pressure liquid 
chromatography); DNA (deoxyribonucleic acid); p (plasmid); |xl (microliters); ml 
(milliliters); ^g (micrograms); mg (milligrams); M (molar); mM (milliMolar); fiM 
(microMolar); pmoles (picomoles); amoles (attomoles); zmoles (zq)tomoles); 
nm (nanometers); kdal (kilodaltons); OD (optical density); EDTA (ethyl^e diamine 
tetra-acetic acid); FITC (fluorescein isothiocyanate); SDS (sodium dodecyl sul&te); 
NaP04 (sodium phosphate); NP-40 (Nonidet P-40); Tris (tris(hydroxymethyl)- 
aminomethane); PMSF (phenyhnethylsulfonylfluoride); TBE (Tiis-Borate-EDTA, z,e., 
Tris buffer titrated with boric acid rather than HCl and containing EDTA); 
PBS (phosphate buffered saline); PPBS (phosphate buffered saline containing 1 mM 
PMSF); PAGE (polyacrylamide gel electrophoresis); Tween (polyoxyethylene-sorbitan); 
ATCC (American Type Culture Collection, Rockville, MD); Coriell (Coriell Cell 
Repositories, Camden, NJ); DSMZ (Deutsche Sammlung von Mikroorganismen und 
Zellculturen, Braunschweig, Germany); Ambion (Ambion, Inc., Austin, TX); Boehringer 
(Boehringer Maimheim Biochemical, Indianapolis, IN); MJ Research (MJ Research, 
Watertown, MA; Sigma (Sigma Chemical Company, St. Louis, MO); Dynal (Dynal A.S., 
Oslo, Norway); Gull (Gull Laboratories, Salt Lake City, UT); Epicentre (Epicentre 
Technologies, Madison, WI); Lampire (Biological Labs., Inc., Coopersberg, PA); MJ 
Research (MJ Research, Watertown,MA); National Biosciences (National Biosciences, 
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Plymoutb, MN); NEB (New England Biolabs. Beverly. MA); Novagen (Novagen, Inc., 
Madison. WI); ; Proraega (Promega, Corp., Madison. WI); Stratagene (Stratagene 
Cloning Systems, La Jolla, CA); Clonetech (Clonetech, Palo Alto. CA) Phannacia 
(Phaimacia, Piscataway, NJ); Milton Roy (Milton Roy. Rochester. NY); Amersham 
(Amersham International, Chicago, EL); and USB (U.S. Biochemical, Cleveland, OH). 
Glen Research (Glen Research, Sterling, VA); Coriell (Coriell Cell Repositories. 
Camden. NJ); Gentra (Centra, MinneapoUs, MN); Third Wave Technologies (Third 
Wave Technologies, Madison. WI); PerSeptive Biosystems (PerSeptive Biosystems, 
Framington, MA); Microsoft (Microsoft, Redmond, WA); Qiagen (Qiagen, Valencia, 
CA); Molecular Probes (Molecular Probes, Eugene, OR); VWR (VWR Scientific, ); 
Advanced Biotechnologies (Advanced Biotechnologies. INC., Columbia, MD); and 
Prakin Ehner (also known as PE Biosytems and AppUed Biosystems, Foster City, CA). 

EXAMPLE! 
Rapid screening of colonies for 5' nuclease activity 

Hie native 5' nucleases and the enzymes of the present invention can be tested 
directly for a variety of ftmctions. These include, but are not limited to, 5' nuclease 
activity on RNA or DNA targets and background specificity using alternative substrates 
representing structures that may be present in a target detection reaction. Examples of 
nucleic acid molecules having suitable test structures are shown schematically in Figures 
18A-D and Figures 21-24. The screening techniques described below were developed to 
quickly and efficiently characterize 5' nucleases and to detamine whether the new 5' 
nucleases have any improved or desired activities. Enzymes that show improved cycling 
rates on RNA or DNA targets, or that result in reduced target-independent cleavage vaedt 
more thorough investigation. In general, the modified proteins developed by random 
mutagenesis were tested by rapid colony SCTeen on the substrates shown in Figures 1 8A 
and 18B. A rapid protein extraction was then done, and a test of activity on alternative 
structures. (e.g., as shown in Figures 18C-D) was performed using the protein extract. 
Either the initial screen, or further screening and characterization of enzymes for 
improved activity may be performed using other cleavage coiiq)lexes, such as those 
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diagrammed in Figures 21-24. It is not intended that the scope of the invention be limited 
by the particular sequences used to form such test cleavage structures. One skilled in the 
art would understand how to design and create coinparable nucleic acids to form 
analogous structures for rapid screening. 
5 This order of testing may be chosen to reduce the number of tests overall, to save 

time and reagents. The order of testing for enzyme function is not intended to be a 
limitation on the present invention. Those mutants that showed reasonable cycling rates 
with the RNA or DNA targets may then be cultured overnight, and a rapid protein 
extraction done. Alternatively, any subset or all of the cleavage tests may be done at the 
10 same time. 

For convenience, each type of rapid screen may be done on a separate microtiter 
plate. For example, one plate may be set up to test RNA INVADER activity, one plate 
set up to test for DNA INVADER activity. As many as 90 different colonies may be 
screened on one plate. Hie colonies screened can be from a variety of sources, such as 
IS clones of unaltered (native) S* nucleases, from one mutagenesis reaction {e.g. , many 
colonies from a single plate) or from a variety of reactions (colonies selected from 
multiple plates). 

Ideally, positive and negative controls should be run on flie same plate as the 
mutants, using the same preparation of reagents. One example of a good positive control 

20 is a colony containing the unmodified enzyme, or a previously modified enzyme whose 
activity is to be compared to new mutants. For example, if a mutagenesis reaction is 
performed on the Taq DN RX HT construct (described below), the unmodified Taq DN 
RX HT construct would be chosen as the standard for comparing the effects of 
mutagenesis on enzymatic activity. Additional control enzymes may also be incorporated 

25 into the rapid screening test. For example, Tth DN RX HT (described below; unless 
otherwise specified, the TaqPol and TthPol enzymes of the following discussion refer to 
the DN RX HT derivative) may also be included as a standard for enzymatic activity 
along with the Taq DN RX HT. This would allow a comparison of any altered enzymes 
to two known enzymes having different activities. A negative control should also be run 

30 to determine the background reaction levels (i.e., cleavage or probe degradation due to 
sources other than the nucleases being compared). A good negative control colony would 



176 



wo 01/90337 



PCTAJSOl/17086 



be one containing only the vector used in the cloning and mutagenesis, for example, 
colonies containing only the pTrc99A vector. 

Two fectors that may influence the number of colonies chosen from a specific 
mutagenesis reaction for the initial rapid screen are 1) total number of colonies obtained 
from the mutagenesis reaction, and 2) whether the mutagenesis reaction was site-specific 
or randomly distributed across a whole gene or a region of a gene. For example, if only 
5-10 colonies are present on the plate, all colonies can easily be tested. If hundreds of 
colonies are present, a subset of ttiese may be analyzed. GeneraUy 10-20 colonies are 
tested from a site-specific mutagenesis reaction, while 80 to 100 or more colonies are 
routinely tested from a single random mutagenesis reaction. 

Where indicated, the altwed 5' nucleases described in these experimental 
acamples were tested as detaOed below. 

A. Rapid screen: INVADER activity on RNA target (Figure 18A) 

A 2X substrate mix was pr^ared, comprising 20 mM MOPS, pH 7.5, 10 mM 
MgS04, 200 mM KCl, 2 \M FRET-probe oligo SEQ TD NO:223 
(5'-Fl-CGCT-cy3-TCTCGCTCGC-3'), 1 INVADER oUgo SEQ ID NO:224 
(5'-ACGGAACGAGCGTCTTTG-3'), and 4 nM RNA target SEQ ID NO:225 (5'-GCG 
AGC GAGA CAG CGA AAG ACG CUC GUU CCG U-3'). Five pi of the 2X substrate 
mix were dispensed into each sample well of a 96 well microtiter plate (Low Profile 
MULUPLATE 96, M. J. Research, Inc.). 

Cell suspensions were pi^ared by picking single colonies (mutants, positive 
control, and negative control colonies) and suspending each one in 20nl of water. This 
can be done conveniently in a 96 well microtiter plate format, using one well per colony. 

Five jxl of the cell suspension was added to the appropriate test well such that the 
final reaction conditions were 10 mM MOPS, pH 7.5, 5 mM MgS04, 100 mM KCl, 1 pM 
FRET-probe oligo, 0.5 jiM INVADER oligo, and 2 nM RNA target. The weUs were 
covered with 10 pi of Clear CHILLOUT 14 (M.J. Research, Inc.) Uquid wax, and the 
samples were heated at 85°C for 3 minutes, then incubated at 59°C for 1 hour. After the 
incubation, the plates were read on a Cytofluor flourescense plate reader using fee 
following parameters: excitation 485/20, emission 530/30. 
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B. Rapid screen: INVADER activity on DNA target (Figure 18B) 

A 2X substrate mix was prepared, comprising 20 mM MOPS, pH 7.5, 10 mM 
MgS04, 200 mM KCl, 2 \M FRET-probe oligo SEQ ID NO:223 
5 (5'-Fl-CGCT-Cy3-TCTCGCTCGC-30, 1 ^M INVADER oUgo SEQ ID NO:224 
(5*-ACGGAACGAGCGTCTTTG-3*), 1 nM DNA target SEQ ID NO:226 (5'-GCG 
AGC GAGA CAG CGA AAG ACG CTC GTT CCG T-3'). Five \il of the 2X substrate 
mix was dispensed into each sample well of a 96 well microtiter plate (MJ Low Profile). 
Cell suspensions were prepared by picking single colonies (mutants, positive 
10 control and negative control colonies) and suspending them in 20 \xl of water, generally 
in a 96 well microtiter plate format 

5 ^1 of the cell suspension were added to the appropriate test well such that the 
final reaction conditions were 10 mM MOPS, pH 7.5, 5 mM MgS04, 100 mM KCl, 1 \xM 
FRET-probe oligo, 0.5 ^iM INVADER oligo, and 0.5 nM DNA target. Wells were 
15 covered with 10|a 1 of Clear CHILLOUT 14 (MJ. Research, Inc.) liquid wax, and the 
reactions were heated at 85°C for 3 minutes, then incubated at 59°C for 1 hour. After the 
hour incubation, the plate were read on a Cytofluor flourescan plate reader using the 
following parameters: excitation 485/20, emission 530/30, gain 40, reads per well 10. 

20 C. Rapid protein extraction (crude cell lysate) 

Those mutants that gave a positive or an unexpected result in either the RNA or 
DNA INVADER assay were further analyzed, specifically for background activity on the 
X-structure or the hairpin substrate (Figure 18C and D, respectively). A rapid colony 
screen format can be employed, as described above. By simply changing the substrate, 

25 tests for background or aberrant enzymatic activity can be done. Another approach 
would be to do a rapid protein extraction from a small overnight culture of positive 
clones, and then test this crude cell lysate for additional protein fimction. One possible 
rapid protein extraction procedure is detailed below. Two to five ml of LB (containing 
the appropriate antibiotic for plasmid selection; See e.g., Maniatis, books 1,2 and 3) were 

30 inoculated with the remaining volume of the 20 |j,l water-cell suspension and incubated at 
37°C overnight. About 1.4 ml of the culture were transferred to a 1.5 ml microcentrifiige 
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tube, and miciocentrifuged at top speed (e.g.. 14.000 ipm in an Eppendorf 5417 table top 
microcentrifuge), at room temperature for 3-5 minutes to pellet the cells. The supernatant 
was removed, and the ceU peUet was suspended in 100 nl of TES buffer pH 7.5 (Sigma). 
Lysozyme (Promega) was added to a final concentration of O.S^g/nl and samples were 
5 incubated at room temperature for 30 minutes. Samples were then heated at 70°C for 10 
minutes to inactivate the lysozyme, and the cell debris was pelleted by 
microcentrifugation at top speed for 5 minutes. The supernatant was removed and this 
crude cell lysale was used in ttie following enzymatic activity assays. 

10 D. Rapid screen: background specificity X stoTicture substrate (Figure 18C) 
Reactions were performed under conditions as detailed above. One ^1 of crude 
cell lysate was added to 9 of reaction components for a final volume of 10 nl and final 
concentrations of 10 mM MOPS, pH 7.5. 5 mM MgS04, 100 mM KCU 1 pM 
FRET-piobe oligo (SEQ ID N0:223), 0.5nM X-staicture INVADER oligo SEQ ID 

15 N0:227 (5'-ACGGAACGAGCGTCTTTCATCTGTCAATC-3'). and 0.5 nM DNA 
target (SEQ ID NO:226). Wells were covered with 10 \il of Clear CHILLOUT 14 (M.J. 
Research. Inc.) liquid wax, and the reactions were heated at SS'C for 3 minutes, then 
incubated at 59*C for 1 hour. After the incubation, the plates were read on a Cytofiuor 
fluorescence plate reader using the following parameters: excitation 485/20, emission 

20 530/30, gain 40, reads per well 10. 

E. Rapid screen: background specificity hairpin substrate (Figure 18D) 

Reactions were performed under conditions as detailed above. One ^1 of crude 
cell lysate was added to 9 ^1 of reaction components for a final volume of 10 jil and final 
25 concentrations of 10 mM MOPS, pH 7.5. 5 mM MgS04. 100 mM KCl. 1 nM 
FRET-probe oligonucleotide (SEQ ID N0:223), and 0.5 nM DNA target (SEQ ID 
NO:226). WeUs were covered with 10 ^1 of Clear CHILLOUT 14 (M.J. Research, Inc.) 
liquid wax, and the reactions were heated at 85°C for 3 minutes, then incubated at SSX 
for 1 hour. After the hour incubation, the plate were read on a Cytofluor plate reader 
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using the following parameters: excitation 485/20, emission 530/30, gain 40, reads per 
well 10. 

F. Activity assays with IrTl and IdT targets (Figures 24) 

5 The 5* nuclease activities assays were carried out in 10 of a reaction containing 

10 mM MOPS, pH 7.5, 0.05% Tween 20, 0.05% Nonidet P-40, 10 ^ig/ml IRNA, 100 mM 
KCl and 5 mM MgS04. The probe concentration (SBQ K) NO: 167) was 2 mM. The 
substrates QfTl (SEQ ID NO: 228) or IdT (SEQ E) NO: 229) at 10 or 1 nM final 
concentration respectively) and approximately 20 ng of an enzyme, prepared as in 

10 Example 3, were mixed with the above reaction bxifPer and overlaid with CHILLOUT 
(MJ Research) liquid wax. Reactions were brought up to reaction tenq)erature 57 **C, 
started by addition of MgS04, and incubated for 10 min. Reactions were then stopped by 
the addition of 10 of 95% formamide containing 10 mM EDTA and 0.02% methyl 
violet (Sigma). Samples were heated to 90°C for 1 minute immediately before 

15 electrophoresis through a 20% denaturing acrylamide gel (19:1 cross-linked), with 7 M 
urea, and in a buffer of 45 mM Tris-borate, pH 8.3, 1.4 mM EDTA. Unless otherwise 
indicated, 1 nl of each stopped reaction was loaded per lane. Gels were then scanned on 
an FMBIO-100 fluorescent gel scanner (Hitachi) using a 505 nm filter. The firaction of 
cleaved product was deteimined &om intensities of bands corresponding to uncut and cut 

20 substrate with FMBIO Analysis software (version 6.0, Hitachi). The fi-action of cleavage 
product did not exceed 20% to ensure that measurements approximated initial cleavage 
rates. The turnover rate was defined as the number of cleaved signal probes generated 
per target molecule per minute under these reaction conditions (1/min). 

25 G. Activity assays with X structure (X) and hairpin (HP) targets (Figures 22) 
The 5' nuclease activity assays were carried out in 10 |al of a reaction containing 
10 mM MOPS, pH 7.5, 0.05% Tween 20, 0.05% Nonidet P-40, 10 ^ig/ml tRNA, 100 mM 
KCl and 5 mM MgS04. Bach oligo for formation of either the hairpin structure assembly 
(22A, SEQ ID NOS: 230 and 231) assembly or the X stmcture assembly (22B, SEQ ID 

30 NOS: 230-232) was added to a final concentration of 1 urn, and approxunately 20 ng of 
test enzyme prepared as described in Example 3, were mixed with the above reaction 
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buflfer and overlaid with CHILLOUT (MJ Research) Uquid wax. Reactions were brought 
up to reaction temperatiire 60 »C, started by addition of MgS04. and incubated for 10 
min. Reactions were then stopped by the addition of 10 (xl of 95% formamide containmg 
10 mM EDTA and 0.02% methyl violet (Sigma). Samples were heated to 90'C for 1 
minute immediately before electrophoresis through a 20% denaturing acrylamide gel 
(19:1 cross-linked), with 7 M urea, and in a buffer of 45 mM Tris-borate, pH 8.3, 1 .4 mM 
EDTA, Unless otherwise indicated, 1 jU of each stopped reaction was loaded per lane. 
Gels were then scanned on an FMBIO-100 fluorescent gel scanner (Hitachi) using a 505 
ran filter. The fraction of cleaved product was detemiined from intensities of bands 
corresponding to uncut and cut substrate with FMBIO Analysis software (version 6.0, 
Hitachi). The fraction of cleavage product did not exceed 20% to ensure tiiat 
measurements approximated initial cleavage rates. The bunover rate was defined as tiie 
number of cleaved signal probes generated per target molecule per minute under these 
reaction conditions (1/min). 



H. Activity assays with human IL-6 target (Figure 10) 

The 5' nuclease activities assays wwe carried out in 10 ]il reactions containing 10 
mM MOPS, pH 7.5, 0.05% Tween 20, 0.05% Nonidet P-40, 10 jig/ml tRNA, 100 mM 
KCl and 5 mM MgS04. Reactions comprising flie DNA IL-6 substeate contained 0.05 
nM IL-6 DNA target (SEQ ID NO: 163) and 1 pM of each probe (SEQ ID NO: 162) and 
INVADER (SEQ ID NO: 161) oligonucleotides, and were carried out at 60°C for 30 min. 
Reactions comprising the 11^6 RNA target (SEQ ID NO: 160) were performed under the 
same conditions, except that the IL-6 RNA target concentration was I nM and the 
reactions were performed at 57°C for 60 min. Each reaction contained approximately 20 
ng of test enzyme, prepared as described in Example 3. 

L Activity assays with synUicticr25mer target (F^re 23) 

Reactions comprising tiie syntiietic r25mer target (SEQ ID NO: 233) were carried 
out under tiie same reaction conditions (10 mM MOPS, pH 7.5, 0.05% Tween 20, 0.05% 
Nonidet P-40, 10 ng/ml tRNA, 100 mM KCl and 5 mM MgS04) and 1 nM of each probe 
(SEQ ID NO: 234) and INVADER (SEQ ID NO: 235) oligonucleotides, except fliat tiie 
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r25mer target concentration was 5 nM and the reactions were performed at 58°C for 60 
min. Approximately 20 ng of each test enzyme was added to the reactions. Enzymes 
were pr^ared as described in Example 3. 

5 Any of the tests described above can be modified to derive the optimal conditions 

for enzymatic activity. For example, enzyme titrations can be done to determine the 
optimal enzyme concentration for maximum cleavage activity, and lowest background 
signal. By way of example, but not by way of limitation, many of the mutant enzymes 
were tested at 10, 20 and 40 ng amounts. Similarly, a temperature titration can also be 

10 incorporated into the tests. Since modifying the structure of a protein can alter its 

temperature requiremmts, a range of temperatures can be tested to identify the condition 
best suited for the mutant in question. 

Examples of the results fix)m such screens (using q)proximately 20 ng of the 
mutant enzyme) are shown in Tables 3-8, and Figures 12, 14, IS, 19, and 25. 

15 

EXAMPLE 2 

Cloning and Expression of 5' nucleases of DNA polymerases and mutant 

polymerases 

20 A. DNA polymerases of Thermus aquaticus and Thermus thermophilus 
1. Cloning of TaqPol and TthPol 

Type A DNA polymerases from eubacteria of the genus Thermus share extensive 
protein sequence identity (90% in the polymerization domain, using the Lipman-Pearson 

25 method in the DNA analysis software from DNAStar, WI) and behave similarly in both 
polymerization and nuclease assays. Therefore, the genes for die DNA polymerase of 
Thermus aquaticus (TaqPol), Thermus thermophilus (TthPol) and Thermus scotoductus 
were used as representatives of this class. Polymerase genes fit>m other eubacterial 
organisms, including, but not limited to, Escherichia coli. Streptococcus pneumoniae, 

30 Mycobacterium smegmatiSy Thermus thermophilus ^ Thermus sp.j Thermotoga maritima, 
Thermosipho qfricanus^ and Bacillus stearothermophilus are equally suitable. 
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a. Initial TaqPol Isolation: mutant TaqA/G 

The Taq DNA polymerase gene was ampUfied by polymerase chain reaction from 
genomic DNA from Thermus aqmticus, strain YT-1 (Lawyer et al., supra), using as 
primers the oUgonucleotides described in SEQ ID NOS:236 and 237. The resulting 
fragment of DNA has a recognition sequence for the restriction endonuclease EcoBI at 
the 5' end of the coding sequence and a BgM sequence at the 3' end of the coding strand. 
Cleavage wifli BglH leaves a 5' overhang or "sticky end" that is compatible with the end 
generated by Bamm. The PCSl-ampUfied DNA was digested with £coRI and fiamffl. 
The 2512 bp fragment containing the coding region for the polymerase gene was gel 
purified and then Ugated into a plasmid that contains an inducible promoter. 

In one embodiment of the invention, the pTTQlS vector, which contains the 
hybrid trp-lac (tac) promoter, was used (M.J.R. Stark, Gene 5:255 [1987]). The tac 
promoter is under the control of the E. coli lac repressor protein. Repression allows the 
synthesis of the gene product to be suppressed until the desired level of bacterial growth 
has been achieved, at which point repression is removed by addition of a specific inducer, 
isopropyl-b-D-thiogalactopyranoside (IPTG). Such a system allows the controlled 
expression of foreign proteins that may slow or prevent growth of transformants. 

Particularly strong bactaial promoters, such as the synthetic ?tac, may not be 
adequately suppressed when present on a multiple copy plasmid. If a higihly toxic protein 
is placed under control of such a promoter, the small amount of expression leaking 
through, even in the absence of an inducer, can be harmful to die bacteria. In another 
embodiment of the invention, another option for repressing synthesis of a cloned gene 
product is contemplated. A non-bacterial promoter from bacteriophage T7, found in the 
plasmid vector series pET-3, was used to express the cloned mutant Taq polymerase 
genes (Studier and Mofifatt, J. Mol. Biol., 189:113 [1986]). This promoter initiates 
transcription only by T7 RNA polymerase. In a suitable strain, such as 
BL21(DE3)pLYS, the gene for the phage T7 RNA polymerase is carried on the bacterial 
genome under control of the lac operator. This arrangement has the advantage that 
expression of the multiple copy gene (on tiie plasmid) is completely dependent on the 



183 



wo 01/90337 



PCT/USOl/17086 



expression of T7 RNA polymerase, which is easily suppressed because it is present in a 
single copy. 

These are just two examples of vectors having suitable inducible promoters. 
Others are well known to those skilled in the art, and it is not intended tiiat the improved 

5 nucleases of the present invention be limited by the choice of expression system^ 

For ligation into the pTTQ18 vector, the PGR product DNA containing the Tag 
polymerase coding region (termed malTaq for reasons discussed below, SEQ ED 
NO:238) was digested with EcoRI and BglU and this fragment was ligated under 
standard "sticky end" conditions (Sambrook et al Molecular Clonings Cold Spring 

10 Harbor Laboratory Press, Cold Spring Harbor, pp. 1 .63-1 .69 [1989]) into the EcdSl and 
jBflmHI sites of the plasmid vector pTTQlS. Expression of this construct yields a 
translational fusion product in which flie first two residues of the native protein 
(Met-Arg) are replaced by three from the vector (Met-Asn-Ser), but the remainder of the 
PGR product's protein sequence is not changed (SEQ ID NO:239), The construct was 

15 transformed into the JM109 strain of E. coli, and the transformants were plated under 
incompletely repressing conditions that do not pennit growth of bactCTa expressing the 
native protein. These plating conditions allow the isolation of genes containing 
pre-existing mutations, such as those that result from the infidelity of Tag polymerase 
during the amplification process. 

20 Using this amplification/selection protocol, a clone was isolated containing a 

mutated Tag polymerase gene (amtTag). The mutant was first detected by its phenotype, 
in which temperature-stable 5' nuclease activity in a crude cell extract was normal, but 
polymerization activity was almost absent (approximately less than 1% of wild type Tag 
polymerase activity). Polymerase activity was detmnined by primer extension reactions. 

25 The reactions were carried out in 10 (tl of buffer containing 10 mM MOPS, pH 7.5, 5 
mM MgS04, 100 mM KCl. In each reaction, 40 ng of ^izyme were used to extend 10 
MM (dT)25-30 primer in the preesnce of either 10 |iM poly (A)286 or 1 ^iM poly (dA)273 
template, 45 |xM dTTP and 5 jiM Fl-dUTP at 60°C for 30 minutes. Reactions were 
stopped witii 10 ^il of stop solution (95% formamide, 10 mM EDTA, 0.02% methyl violet 

30 dye). Samples (3 |li1) were fractionated on a 15% denaturing acrylamide gel (19:1 
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crossed-Iinked) and the fiaction of incorporated Fl-dUTP was quantitated using an 
FMBIO-100 fluorescence gel scanner (Hitachi) equipped with a 505 nm emission filter. 

DNA sequence analysis of the recombinant gene showed that it had changes in 
the polymerase domain resulting in two amino acid substitutions: an A to G change at 
nucleotide position 1394, which causes a Glu to Gly change at amino acid position 465 
(numbered according to the natural nucleic and amino acid sequences, SEQ ID NOS:153 
and 157), and another A to G change at nucleotide position 2260, which causes a Gto to 
Arg change at amino acid position 754. Because the Ghi to Gly mutation is at a 
nonconserved position and because the Glu to Arg mutation alters an amino acid that is 
conserved in virtually all of the known Type A polymerases, the latter mutation is most 
likely the one responsiT)le for curtailing the synthesis activity of this protein. The 
nucleotide sequence for the construct is given in SEQ ID NO:39. The enzyme encoded 
by this sequence is refened to as Taq A/G. 

b. Initial TthPol Isolation 

The DNA polymerase enzyme from the bacterial species Themus themophilus 
(Tth) was produced by cloning the gene for this protein into an expression vector and 
overproducing it in E. coll cells. Genomic DNA was prepared from 1 vial of dried 
Themus themophilus strain HB-8 from ATCC (ATCC #27634). THe DNA polymerase 
gene was amplified by PCR using ttie following primers: 
5'-CACGAATTCCGAGGCGATGCTTCCGCTC-3' (SEQ ID NO:240) and 
5'-TCGACGTCGACTAACCCTTGGCGGAAAGCC-3' (SEQ ID NO:241). The 
resulting PCR product was digested with EcoSl and Satt. restriction endonucleases and 
inserted into EcdSJ/Sal I digested plasmid vector pTrc99G (described in Example 2C1) 
to create the plasmid pTrcTth-1. This Tth polymerase construct is missing a single 
nucleotide that was inadvertently omitted from the 5' oUgonucleotide, resulting in the 
polymerase gene being out of frame. This mistake was corrected by site specific 
mutagenesis of pTrcTth-1 as described in Examples 4 and 5 using the following 
oligonucleotide: 5'.GCATCGCCTCGGAATTCATGGTC-3' (SEQ ID NO:242), to create 
the plasmid pTrcTth-2. The protein and the nucleic acid sequence encoding the protein 
are referred to as TthPol, and are Usted as SEQ ID NOS:243 and 244 respectively. 
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€. Large Scale preparation of recombinant proteins 
The recombinant proteins were purified by the following technique which is 
daived fix>m a Taq DNA polymerase preparation protocol (Engelke et aly Anal. 

5 Biochem., 191:396 [1990]) as follows. E, coli cells (strain JM109) containing either 
pTrc99A TaqPol, pTrc99GTthPol were inoculated into 3 ml of LB containing 100 mg/ml 
ampicillin and grown for 16 hrs at 'iTC. The entire ovomight culture was inoculated into 
200 ml or 350 ml of LB containing 100 mg/ml ampicillin and grown at 37^*0 wifli 
vigorous shaking to an Aeoo of 0.8. DPTG (1 M stock solution) was added to a final 

10 concentration of 1 mM and growth was continued for 16 hrs at 37°C. 

The induced cells were pelleted and the cell pellet was weighed. An equal 
volume of 2X DG buffer (100 mM Tris-HCl, pH 7,6, 0.1 mM EDTA) was added and the 
pellet was suspended by agitation. Fifty mg/ml lysozyme (Sigma) were added to 1 
mg/ml final concentration and the cells incubated at room temperature for 15 min. 

15 Deoxycholic acid (10% solution) was added dropwise to a final concentration of 0.2 % 
while vortexing. One volume of H2O and 1 volume of 2X DG buffer were added, and the 
resulting mixture was sonicated for 2 minutes on ice to reduce the viscosity of the 
mixture. After sonication, 3 M (NH4)2S04 was added to a final concentration of 0.2 M, 
and the lysate was centrifuged at 14000 x g for 20 min at 4®C. The supernatant was 

20 removed and incubated at 70°C for 60 min at which time 10% polyethylimine (PEI) was 
added to 0.25%. After incubation on ice for 30 min., the mixture was centrifiiged at 
14,000 X g for 20 min at 4*^0. At this point, the supernatant was removed and the protein 
precipitated by the addition of (NH4)2S04 as follows. 

Two volumes of 3 M (NH4)2S04 were added to precipitate the protein. The 

25 mixture was incubated overnight at room temperature for 16 hrs centriftiged at 14,000 x g 
for 20 min at 4°C, The protein pellet was suspended in 0.5 ml of Q buffer (50 mM 
Tris-HCl, pH 8.0, 0.1 mM EDTA, 0.1% Tween 20). For the Mja FEN-1 preparation, 
solid (NH4)2S04 was added to a final concentration of 3 M (--75% saturated), the mixture 
was incubated on ice for 30 min, and the protein was spun down and suspended as 

30 described above. 
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The suspended ptotein preparations were quantitated by determination of the A279 
dialyzed and stored in 50% glycerol. 20 mM Tris HCl. pH8.0. 50 mM KCl. 0.5% Tween 
20, 0.5% Nonidet P-40, with 100 jig/ml BSA. 

B. DNA polymerases of Thermm fliiformis and Thermus scoioductus 
1. Cloning of Thermus flttfomis and Thermus scotoductus 
One vial of lyopWUzed Thermus filiformis (Tfi) obtained from DSMZ (Deutsche 
Sanmilung von Mikroorganismen und Zellculhiren, Braunschweig, Germany, strain 
#4687) was rehydiated in 1 ml of Castaiholz medium (DSMZ medium 86) and 
inoculated into 500 ml of Castenholz medium preheated to 50''C. The culture was 
incubated at TO'C with vigorous shaking for 48 hours. After growth, the ceUs were 
harvested by centrifogation at 8000 x g for 10 minutes, the cell peUet was suspended in 
10 ml of TE (10 mM TrisHCL, pH 8.0, 1 mM EDTA), and the cells were frozen at -20''C 
in 1 ml aUquots. A 1 ml ahquot was thawed, lysozyme was added to 1 mg/ml, and the 
cells were incubated at 23»C for 30 mmutes. A solution of 20% SDS (sodium dodecyl 
sulfite) was added to a final concentration of 0.5% followed by extraction with buffered 
phenol. The aqueous phase was fiirflier extracted with 1:1 phcnol:dilorofonn, and 
exbacted a final time with chloroform. One-tenth volume of 3 M sodium acetate, pH 5.0 
and 2.5 volumes of ethanol wae added to tiie aqueous phase and mixed. The DNA was 
pelleted by caitrifiigation at 20,000 x g for 5 minutes. The DNA pellet was washed with 
70% ethanol, air dried and resuspended in 200 jil of TE and used directly for 
amplification. Thermus scotoductus (Tsc, ATCC # 51532) was grown and genomic DNA 
was pr^ared as described above for Thermus filiformis. 

The DNA polymerase I gene 60m Tfi (GenBank accession #AF030320) could not 
be amplified as a single fi^gment Therefore, it was cloned in 2 separate fiagments into 
the expression vector pTrc99a. The 2 fragments overlap and share a Not I site which was 
created by introducing a silent mutation at position 1308 of the Tfi DNA polymerase open 
reading frame (ORF) in the PGR oligonucleotides. The 3' half of the gene was ampUfied 
using the Advantage cDNA PGR kit (Glonetech) witii the following oUgonucleotides; 
5'-ATAGCCATGGTGGAGCGGCGGCTCTCCCGG (SEQ ID NO:245) and 
5'-AAGCGTCGAGTCAATCCTGCTTCGCCTCCAGCC (SEQ ID NO:246). The PGR 
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product from this reaction was ^proximately 1200 base pairs in length. It was cut with 
the restriction en2ymes Not I and Sal I, and the resulting DNA was Ugated into pTrc99a 
cut with NotI and Sail to create pTrc99a-Tfi3\ The 5* half of the gene was amplified as 
described above using the following two primers; 

5 S'AATCGAATTCACCCCACnTrTTGACCTGGAGG (SEQ ID NO:247) and 

5»-CCGGGAGAGCGGCCGCTCCAC (SEQ ID NO:248). The resulting 1300 base pair 
fragment was cut with restriction enzymes Eco RI and Not I and Ugated into 
pTrc99a-Tfi3* cut with Notl and EcdSl to produce pTrc99a-TfiPol, SEQ ID NO:249 (the 
corresponding amino acid sequence is listed in SEQ ID NO:250). 

10 The DNA polymerase I gene from Thermus scotoductus was anq)lified using the 

Advantage cDNA PGR kit (Clonetech) using the following two primers; 
5*-ACTGGAATTCCTGCCCCTCTTTGAGCCCAAG (SEQ ID NO:251) and 
5'-AACAGTCGACCTAGGCCTTGGCGGAAAGCC (SEQ ID NO:252). The PGR 
product was cut with restriction enzymes Eco RI and Sal I and Ugated into Eco RI, Sal I 

15 cut pTrc99a to create pTrc99a-TscPol SEQ ID NO:253 (the corresponding amino acid 
sequence is Usted in SEQ ID NO:2S4). 

2. Expression and purification of Thermus flUformis and Thermus 
scotoductus 

20 Plasmids were transformed into protease deficient E, coli strain 6L21 (Novagen) 

or strain JM109 (Promega Corp., Madison, WI) for protein expression. Flasks containing 
200 ml of LB containing 100 M.g/ml ampicilUn were inoculated with eitiher a single 
colony from an LB plate or from a frozen stock of the appropriate strain. Aft^ several 
hours of growth at 37^0 with vigorous shaking, cultures was induced by the addition of 

25 200 ^il of 1 M isothiopropyl-galatoside (IPTG). Growth at 37**C was continued for 16 
hours prior to harvest Cells were pelleted by centrifiigation at 8000 x g for IS minutes 
followed by suspension of the cell pellet in 5 ml of TEN (10 mM TrisHCl, pH 8.0, 1 mM 
EDTA, 100 mM NaCl). 100 |il of 50 mg/ml lysozyme were added and the ceUs 
incubated at room temperature for 15 minutes. DeoxychoUc acid (10%) was added to a 

30 final concentration of 0.2%. After thorough mixing, the cell lysates were sonicated for 2 
minutes on ice to reduce the viscosity of the mixture. CeUular debris was pelleted by 
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centrifugation at 4°C for 15 minutes at 20,000 x g. The supernatant was removed and 
incubated at VCC for 30 min after which 10% polyethylimine (PEI) was added to 0.25%. 
After incubation on ice for 30 minutes, the mixture was centrifiiged at 20,000 x g for 20 
min at 4''C. At this point, the siq>anatant containing the raizyme was removed, and the 
protdn was precipitated by Ihe addition of 1 .2 g of ammonium sulfate and incubation at 
4"'C for I hour. The protein was pelleted by centrifugation at 4''C for 10 minutes at 
20,000 X g. The pellet was resuqjoided in 4 ml of HPLC Buffer A (50 mM TrisHCl, pH 
8.0, 1 mM BDTA). The protean was fiuther purified by affinity chromatography using an 
Econo-Pac heparin cartridge (Bio-Rad) and a Dionex DX 500 HPLC instrument. Briefly, 
the cartridge was equiUbrated with HPLC Buffer A, and the enzyme extract was loaded 
on the column and eluted with a linear gradient of NaCl (0-2 M) in the same buffer. Pure 
protein elutes between 0.5 and 1 M NaCL The enzyme peak was coUected and dialyzed 
m 50% glycerol. 20 mM Tris HCl, pH 8, 50 mM KCl, 0.5% Tween 20, 0.5% Nonidet 
P40,100mg/mlBSA. 



C. Generation of potymerase mutants with reduced polymerase activity but 
unaltered 5* nuclease activity 

All mutants generated in section C were expressed and purified as described in 
Example 2A1C. 

1. Modified TaqPol Genes: TaqDN 

A polymerization deficient mutant of Tag DNA polymerase called TaqDN was 
constructed. TaqDN nuclease contains an asparagine residue in place of the wild-type 
aspartic acid residue at position 785 (D785N). 

DNA encoding the TaqDN nuclease was constructed firom ftie gene encoding the 
Taq A/G in two rounds of site-directed mutagaiesis. First, tiie G at position 1397 and the 
G at position 2264 of the Taq A/G gene (SEQ ID NO:238) were changed to A at each 
position to recreate a wild-type TaqPol gene. In a second round of mutagenesis, tiie wild 
type TaqPol gene was convarted to the Taq DN gene by changing the G at position 2356 
to A. These manipulations were performed as follows. 
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DNA encoding the Taq A/G nuclease was recloned from pTTQlS plasmid into 
the pTrc99A plasmid (Phannacia) in a two-step procedure. First, the pTrc99A vector 
was modified by removing the G at position 270 of the pTrc99A map, creating the 
pTrc99G cloning vector. To this end, pTrc99A plasmid DNA was cut with Ncol and the 

5 recessive 3' ends were fiUed-in using the Klenow fragment ofE. coli polymerase I in the 
presence of all four dNTPs at 3TC for 15 min. After inactivation of the Klenow 
fragment by incubation at SS^'C for 10 min, Qie plasmid DNA was cut with EcoBl and the 
ends were again filled-in using the Klenow fragment in the presence of all four dNTPs at 
ZTC for 15 min. The Klenow fragment was then inactivated by incubation at 6S^C for 

10 10 min. The plasmid DNA was efhanol precipitated, recircularized by ligation, and usQd 
to transform Exoli JM109 cells (Promega). Plasmid DNA was isolated from single 
colonies, and deletion of the G at position 270 of the pTrc99A map was confirmed by 
DNA sequencing. 

In a second step, DNA encoding the Taq A/G nuclease was removed from the 
15 pTTQlS plasmid using EcdSl and Sail and the DNA fragmrat carrying the Taq A/G 
nuclease gene was separated on a 1% agarose gel and isolated with Geneclean n Kit (Bio 
101, Vista, CA). The purified fragment was ligated mto the pTrc99G vector that had 
. been cut with £coRI and iSa/I. The ligation mixture was used to transform competent 
Exoli JM109 cells (Promega). Plasmid DNA was isolated from single colonies and 
20 insertion of the Taq A/G nuclease gene was confirmed by restriction analysis using 
i?coRIand>S'a/I. 

Plasmid DNA pTrcAG carrying the Taq A/G nuclease gene cloned into the 
pTrc99A vector was purified from 200 ml of JM109 overnight culture using QIAGEN 
Plasmid Maxi kit (QIAGEN, Chatsworth, CA) according to manufacturer's protocol. 

25 pTrcAG plasmid DNA was mutagenized using two mutagenic primers, E465 (SEQ ID 
NO:255) (Integrated DNA Technologies, Iowa) and R754Q (SEQ ID NO:256) 
(Integrated DNA Technologies), and the selection primer Trans Oligonucleotide 
AlwNI/Spel (Clontech, Palo Alto, CA, catalog #6488-1) according to TRANSFORMER 
Site-Directed Mutagenesis Kit protocol (Clontech, Palo Alto, CA) to produce a restored 

30 wild-type TaqPol gene (pTrcWT). 
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pTrcWT plastnid DNA carrying the wild-type TaqPol gene cloned into the 
pTrc99A vector was purified from 200 ml of JM109 overnight culture using QIAGEN 
Plasmid Maxi kit (QIAGEN, Chatsworth, CA) according to manufacturer's protocol. 
pTrcWT was then mutagenized using the mutagenic primer D785N (SEQ ID NO:257) 
(Ihtegrated DNA Technologies) and the selection primer Switch OUgonucleotide 
SpeVAIwNI (Clontech, Palo Alto, CA, catalog #6373-1) according to TRANSFORMER 
Site-Diiected Mutagenesis Kit protocol (Clontech, Palo Alto, CA) to create a plasmid 
containing DNA encoding the Taq DN nuclease. The DNA sequence encoding the Taq 
DN nuclease is provided in SEQ ID NO:258; the amino acid sequence of Taq DN 
nuclease is provided m SEQ ID NO:259. 

2. Modified TthPol Gene: TthDN 

The Tth DN construct was created by mutating the TthPol described above. The 
sequence encoding an aspartic acid at position 787 was changed by site-specific 
mutagraiesis as described above to a sequence encoding asparagine. Mutagenesis of 
pTrcTth-2 with tiie following oligonucleotide: 

5'-CAGGAGGAGCTCGTTGTGGACCTGGA-3* (SEQ ID NO:260) was performed to 
create the plasmid pTrcTthDN. The mutant protein and protein coding nucleic acid 
sequence is termed TthDN SEQ ID NOS:261 and 262 respectively. 

3. TaqDNHTandTthDNHT 

Six amino acid histidine tags (his-t^) were added onto the caiboxy teraiini of 
Taq DN and Tth DN. The site-directed mutagenesis was performed using the 
TRANSFORMER Site Directed Mutagmesis Kit (Clontech) according to the 
manufecturer's instructions. The mutagenic oUgonucleotides used on the plasmids pTaq 
DN and pTth DN were sequence 1 17-067-03, 

5'-TCTAGAGGATCTATCAGTGGTGGTGGTGGTGGTGCTCCTTGGCGGAGAGC- 
3'(SEQIDNO:263)and 

S'-TGCCTGCAGGTCQACGCTAGCTAGTGGTGGTGGTGGTGGTGACCCTTGGCG 
GAAAGCC-3' (SEQ ID NO:264), sequence 136-037-05. The selection primer Trans 
Oligo AlwNI/Spel (Clontech, catalog # 6488-1) was used for both mutagenesis reactions. 
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The resulting mutant genes were termed Taq DN HT (SEQ ID NO:265, nucleic acid 
sequence; SEQ ID NO:266, amino acid sequence) and Tth DN HT (SEQ ID NO:267, 
nucleic acid sequence; SEQ ID NO:268, amino acid sequence). 

5 4. Purification of Taq DN HT and Tth DN HT 

Both Taq DN HT and Tth DN HT proteins were expressed in E. coli strain JM109 
as described in Example 2B2. After ammonium sulfate precipitation and centrifiigation, 
the protein pellet was suspended in 0.5 ml of Q buffer (50 mM Tris-HCl, pH 8.0, 0.1 mM 
EDTAm 0.1% Twem 20). The protein was further purified by afiSnity chromatography 

10 using His-Bind Resin and Buffer Kit (Novagen) according to the manufacturer's 
instructions. 1 ml of His-Bind resin was transferred into a column, washed with 3 
column volumes of sterile water, charged with 5 volumes of IX Charge Buffer, and 
equilibrated with 3 volumes of IX Binding Buffer. Four ml of IX Binding Buffer was 
added to the protein sample and the sample solution was loaded onto the column. After 

15 washing with 3 ml of IX Binding Buffer and 3 ml of IX Wash Buffer, the bound His-Tag 
protein was eluted with 1 ml of IX Elute Buffer. The pure enzyme was then dialyzed in 
50% glycerol, 20 mM Tris-HCl, pH 8.0, 50 mM KCl, 0.5% Tween 20, 0.5% Nonidet 
P40, and 100 |ig.nil BSA. En2yme concentrations were determined by measuring 
absorption at 279 mn. 

20 

EXAMPLES 

RNA-dependent 5* nuclease activity of TthPol can be conferred on TaqPol by 
transfer of the N-terminal portion of the DNA polymerase domain 

25 

A. Preparation and purification of substrate structures having either a DNA or 
an RNA target strand 

The downstream (SEQ ID NO:162) and upstream probes (SEQ ID N0:161) and 
the IL-6 DNA (SEQ ID NO: 163) (Figure 10) target strand were synthesized on a 
30 PerSeptive Biosystems instrument using standard phosphoramidite chemistry (Glen 

Research). The synthetic KNA-DNA chimeric IrT target labeled with biotin at the 5'-end 
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(Figure 20A) was synthesized utilizing 2'-ACE RNA chemistry (Dhannacon Research). 
The 2'-protecting groups were removed by acid-catalyzed hydrolysis according to the 
manufecturer's instructions. The downstream probes labeled with 5'-fluorescein (Fl) or 
5'-tetrachloTO-fluorescein (TBT) at their 5' ends were purified by reverse phase HPLC 
using a Resource Q column (Amershara-Phaimacia Biotech). The 648-nucleotide TL-6 
RNA target (SEQ ED NO:160) (Figure 10) was synthesized by T7 RNA polymerase 
runoff-transcription of the cloned firagment of human IL-6 cDNA (nucleotides 64-691 of 
the sequence published in May a/., Proc. Natl. Acad. Sci.. 83:8957 [1986]) using a 
Megascript Kit (Ambion). All oligonucleotides were finally purified by separation on a 
20% denaturing polyaciylamide gel followed by excision and elution of the major band. 
Oligonucleotide concentration was determined by measuring absorption at 260 nm. The 
biotin labeled IrT target was incubated with a 5-fold excess of streptavidin (Promega) in a 
buffer containing 10 mM MOPS, pH 7.5. 0.05% Tween 20. 0.05% NP-40 and 10 ng/ml 
tRNA at room temperature for 10 min. 

B. Introduction of restriction sites to make cliimeras 

The restriction sites used for fonnation of chimerical proteins, described below, 
were chosen for convenience. The restriction sites in the foUowing example have been 
strategically placed to surround regions shown by crystal structure and other analysis to 
be fimctional domains (See, Figures 6, 7, and 19). Different sites, either naturally 
occurring or raeated via directed mutagenesis can be used to make similar constructs with 
other Type A polymerase genes from related organisms. It is desirable that the mutations 
all be silent with respect to protein fimction. By studying the nucleic acid sequence and 
the amino acid sequence of the protein, one can introduce changes in the nucleic acid 
sequence that have no effect on the corresponding amino acid sequence. If the nucleic 
acid change required affects an amino add. one can make tiie alteration such that the new 
amino acid has the same or similar characteristics of the one replaced. If neither of these 
options is possible, one can test the mutant enzyme for fimction to determine if the 
nucleic acid alteration caused a change in protein activity, specificity or fimction. It is 
not mtended that the invention be limited by the particular restriction sites selected or 
introduced for the creation of the improved enzymes of flie present invention. 
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C. Generation of Tth DN RX HT and Taq DN RX HT 

Mutagenesis was performed to introduce 3 additional, unique restriction sites into 
the polymerase domain of both the Taq DN HT and Tth DN HT enzymes. Site-specific 
5 mutagenesis was performed using the Transformer Site-Directed Mutagenesis Kit from 
(Clonetech) according to manufacturer's instractions. One of two different selection 
primers. Trans Oligo AlwNI/Spel or Switch OUgo Spel/AlwNI (Clontech, Palo Alto CA 
catalog #6488-1 or catalog #6373-1) was used for all mutagenesis reactions described. 
The selection oligo used in a given reaction is dependent on the selection restriction site 

10 present in the vector. All mutagenic primers were synthesized by standard synthetic 
chemistry. Resultant colonies were expressed in E.coli strain JM109. 

The Not I sites (amino acid position 328) were created using the mutagenic 
primers S'-gccgccaggggcggccgcgtccaccgggcc (SEQ ID NO:269) and 
5'-gcctgcaggggcggccgcgtgcaccggggca (SEQ ID NO:270) corresponding to the sense 

15 strands of the Taq DN HT and the Tth DN HT genes, req)ectively. The BstI (amino acid 
position 382) and Ndel (amino acid position 443) sites were introduced into both genes 
using sense strand mutagenic primes 5'-ctcctggacccttcgaacaccacccc (SEQ ID NO:271) 
and 5'-gtcctggcccatatggaggccac (SEQ ID NO:272). The mutant plasmids were 
over-expressed and purified using Qiagen QiaPrep Spin Mini Prep Kit (cat. # 27106). 

20 The vectors were tested for the presence of the restriction sites by DNA sequencing and 
restriction mapping. These constructs are termed Tth DN RX HT (DNA sequence SEQ 
ID NO:273; amino acid sequence SEQ ID NO:274) and Taq DN RX HT (DNA sequence 
SEQ ID NO:275; amino acid sequence SEQ ID NO:276). 

2S D« Chimeras 

The chimeric constructs shown in Figure 19 were created by exchanging 
homologous DNA fragments defined by the restriction endonuclease sites EcoRI (E) and 
BamHI (B), conamon for both genes, the cloning vector site Sail (S) and the new sites, 
NotI (N), BsfBI (Bs), Ndel (D) created at the homologous positions of both genes by site 

30 directed mutagenesis. In generating these chimeric enzymes, two different pieces of 
DNA are ligated together to yield the final construct. The larger piece of DNA that 
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contains the plasmid vector as well as part of the Taq or Tth (or parts of both) sequence 
will be termed the 'Srector." The smaller piece of DNA that contains sequences of either 
flie Taq or Tth (or parte of both) polymerase will be termed the "insert" 

AU restriction enzymes were &om New England Biolabs ox Promega and used in 
reactions with the accompanying buffer, according to the manufacturer's instmctions. 
Reactions were done in 20 jil volume with about 500 ng of DNA per reaction, at the 
optimal temperature for ttie specified enzyme. More than one enzyme was used in a 
single reaction (double digest) if flie enzymes were compatible witii respect to reaction 
buffer conditions and reaction temperature. If the enzymes in question were not 
compatible with respect to buffer conditions, the enzyme requiring the lowest salt 
condition was used first After the completion of that reaction, buffer conditions were 
changed to be optimal or better sutted to the second enzyme, and the second reaction was 
performed. These are common restriction enzyme digest strategies, well known to tiiose 
in the art of basic molecular biology (Maniatis, stq>ra). 

The digested restiiction fragments were gel isolated for optimal ligation 
efficiency. Two fil of lOX loading dye (50% glycerol. IX TAB, 0.5% bromophenol 
blue) were added to the 20 \d reaction. The entire volume was loaded and run on a 1%, 
IX TAB agarose gel containing 1 jil of a 1% ethidium bromide solution per 100 ml of 
agarose gel solution. Hie digested fragments were visualized under UV light, and the 
appropriate fragments (as detennined by size) were excised from the gel. These 
fragments were tiien purified using fbe Qiagen Gel Extractio Kit, (cat # 28706) according 
to tiie manufecturer's instructions. 

Ligations were performed in a 10 fil volume, using 400 units per reaction of T4 
DNA Ligase enzyme from New Bigland Biolabs (catalog #202L), witii the 
accompanying reaction buffer. Ligation reactions were done at room temperature for 1 
hour, with 1 fxl of each of tiie Qiagen-purified fragments (approximately 20-50 ng of each 
DNA, depending on recovery from the gel isolation). Ligation products were to 
transfoiroed into E. coli strain JM 109 and plated onto an appropriate growtii and 
selection medium, such as LB with 1 OOfig/ml of ampicillin to select for tiansformants. 

For each ligation reaction, six transfomiants were tested to deteranne if flie 
desired construct was present Plasmid DNA was purified and isolated using flie QiaPrep 
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Spin Mini Prep Kit, according to manufacturer's instructions. The constructs were 
verified by DNA sequencing and by restriction moping. 

Expression and purification of the chimeric enzymes was done as follows. 
Plasmids were transfonned into E. coli strain JM109 (Promega). Log phase cultures (200 

5 ml) of JM109 were induced with 0.5 mM IPTG (Promega) and grown for an additional 
16 hours prior to harvest. Crude extracts containing soluble proteins were prepared by 
lysis of pelleted cells in 5 ml of 10 mM Tris-HCl, pH 8.3, ImM EDTA, 0.5mg/ml 
lysozyme during incubation at room temperature for 15 minutes. The lysate was mixed 
with 5 ml of 10 mM Tris-HCl pH 7.8, 50 mM KCl, 1 mM EDTA, 0.5% Tween 20, 0.5% 

10 Nonidet P-40, heated at 72®C for 30 minutes, and cell debris was removed by 

centrifiigation at 12,000x g for 5 minutes. Final purification of the protein was done by 
affinity chromatograpy using an Econo-Pac heparin cartridge (Bio-Rad) and Dionex DX 
500 HPLC instrument Briefly, the cartridge was equilibrated with SOmM Tris-HCl pH 
. 8, 1 mM EDTA, and an enzyme extract dialyzed against the same buffer was loaded on 

15 ttie column and eluted with a linear gradient of NaCl (0-2 M) in the same buffer. The 
HPLC-purified protein was dialyzed and stored in 50% (vol/vol) glycerol, 20 mM 
Tris-HCl pH 8.0, 50 mM KCl, 0.5% Tween 20, 0.5% Nonidet P.40. and lOO^g/m BSA. 
The enzymes were purified to homogeneity according to SDS-PAGE, and the enzyme 
concentrations were determined by measuring absorption at 279 ma. 

20 

1. Construction of TaqTth(N) and TthTaq(N) 
The first exchange that was performed involved the polymerase domains of the 
two enzymes. Separation of the nuclease domain (the N-terminal end of the protein) 
fix>m the polymerase domain (the C-terminal portion of the protein) was accomplished by 

25 cutting both genes with the restriction endonucleases EcoRI and Notl. The 

approximately 900 base pair firagment from the Tth DN RX HT gene was cloned into the 
homologous sites of the Taq DN RX HT gene, and the approximately 900 base pair 
fiagment from the Taq DN RX HT gene was cloned into the homologous sites of Qie Tth 
DN RX HT gene, yielding two chimeras, TaqTth(N) (DNA sequence SEQ ID NO:69; 

30 amino acid sequence SEQ ID N0:2) which has the Taq DN RX HT 5* nuclease domain 
and the Tth DN RX HT polymerase domain, and TthTaq(N) (DNA sequence SEQ ID 
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NO:70; amino acid sequence SEQ ID N0:3) which is made up of the Tth DN RX HT 5' 
nuclease domain and the Taq DN RX HT polymerase domain. 



2. Construction of TaqTth(N-B) 

5 The Taq DN RX HT constract was cut with the enzymes Ndel and BamHI and 

fee larger, vector fragment was gel isolated as detailed above. The Tth DN RX HT 
construct was also cut with Ndel and BamHI and the smaller (approximately 795 base 
pairs) Tth fragment was gel isolated and purified. The Tth Ndel-BamHI insert was 
ligated into flie Taq Ndel-BamHI vector as detailed above to generate the TaqTth(N-B) 

10 (DNA sequence SEQ ID N0:71 ; amino acid sequence SEQ ID N0:4). 

3. Construction of TaqTth(B-S) 

The Taq DN RX HT constract was cut with the enzymes BamHI and Sail and the 
larger vector fragment was gel isolated as detailed above. The Tth DN RX HT construct 
15 was also cut wiih BamHI and SaH and the smaller (approximately 741 base pairs) Tth 
fragment was gel isolated and purified. The Tth BamHI-Sall insert was ligated into the 
Taq BamHI-Sall vector as detailed above to generate the TaqTth(B-S) (DNA sequence 
SEQ ID NO:72; amino acid sequence SEQ ID N0:5). 

20 4. ConstructloB of TaqTth(N-D) 

The Taq DN RX HT constract was cut with the enzymes NotI and Ndel and the 
larger vector fragment was isolated as detailed above. The Tth DN RX HT constract was 
also cut with NotI and Ndd and the smalla: (approximately 345 base pairs) Tth fragment 
was gel isolated and purified. The Tth Notl-Ndel insert was ligated into the Taq 
25 Notl-Ndel vector as detailed above to generate the TaqTth(N-D) (DNA sequence SEQ ID 
NO:73; amino acid sequence SEQ ID N0:6). 

5. Construction of TaqTthCD-B) 

The Taq DN RX HT construct was cut with the enzymes Ndel and BamHI and 
30 the larger vector fragment was isolated as detailed above. The Tth DN RX HT constract 
was also cut with Ndel and BamHI and the smallar (approximately 450 base pairs) Tth 



197 



wo 01/90337 



PCTAJSOl/17086 



fragment was gel isolated and purified The Tth Ndel-BamHI insert was ligated into the 
Taq Ndel-BamHI vector as detailed above to generate the TaqTth(D-B) (DNA sequence 
SEQ ID NO:74; amino acid sequence SEQ ID N0:7). 

5 6. Construction of TaqTth(Bs-B) 

The Taq DN RX HT construct was cut with the enzymes BstBI and BamHI and 
the larger vector fragment was isolated as detailed above. The Tth DN RX HT construct 
was also cut with BstBI and BamHI and the smaller (approximately 633 base pairs) Tth 
fragment was gel isolated and purified. The Tth Ndel-BamHI insert was ligated into the 

10 Taq Ndel-BamHI vector as detailed above to generate TaqTth(Bs-B) O^NA sequence 
SEQ ID N0;7S; amino acid sequence SEQ ID N0:8). 



7. Construction of TaqTth(N-Bs) 

The Taq DN RX HT construct was cut with the enzymes NotI and BstBI and the 
1 s larger vector Augment was isolated as detailed above. The Tth DN RX HT construct was 
also cut with NotI and BstBI and the smaller (approximately 162 base pairs) Tth fragment 
was gel isolated and purified. The Tth Notl-BstBI insert was ligated into the Taq 
Notl-BstBI vector as detailed above to generate TaqTth(N-Bs) (DNA sequence SEQ ID 
NO:76; amino acid sequence SEQ ID N0:9). 

20 

8. Construction of TthTaq(B-S) 

The Tth DN RX HT construct was cut with the enzymes BamHI and Sail and the 
larger vector firagmOTt was isolated as detailed above. The Taq DN RX HT construct was 
also cut with BamHI and SaU and the smaller (approximately 741 base pairs) Tth 
25 firagment was gel isolated and purified. The Taq BamHI-Sall insert was ligated into the 
Tth BamHI-Sall vector as detailed above to generate the TthTaq(B-S) (DNA sequence 
SEQ ID NO:77; amino acid sequence SEQ ID NO:10). 

9. Construction of Tth Taq(N-B) 

30 The Tth DN RX HT construct was cut with the enzymes NotI and B amHI and the 

larger vector fiiBgment was isolated as detailed above. The Taq DN RX HT construct was 
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also cut with Nod and BamHI and the smaller (approximately 795 base pairs) Tth 
fiagment was gel isolated and purified. The Taq NoU-Bamffl insert was Ugated into the 
Tth Notl-BamHI vector as detailed above to generate the TthTaq(N-B) (DNA sequence 
SEQ E) NO:78; amino .acid sequaice SEQ E) N0:1 1). 

The cleavage activities of these chimerical proteins were characterized as describe 
in Example I. part A, and a comparison of the cleavage cycling rates on an RNA target is 
shown in Figure 12. As further discussed in the Description of the Invention, these data 
show that elements found in the central third of the TthPol protein are important in 
conferring the TtiiPol-like RNA-dependent cleavage activity on tiie chimerical proteins 
comprising portions of TaqPol. 

EXAMPLE 4 

Alterations influencing RNA-dependent 5' nuclease activity do not necessarily 
influence RNA-dependent DNA polymerase activity 

TthPol is known to have, a more active RNA template dependent DNA 
polymerase than does the TaqPol (Myers and Gelfand, Biochemistiy 30:7661 [1991]). 
To determme whether the RNA template dependent 5' nuclease activity of tiie Thermus 
DNA Pol I enzymes is related to tiieir RNA-dependent polymerase activity, the D785N 
and D787N mutations used to create the polymerase-deficient versions of TaqPol and 
TtiiPol, respectively were reversed. Polymerase activity was similarly restored to tiie 
TaqTtii (N) (DNA sequence SEQ ID NO:79; amino acid sequence SEQ ID N0:12), 
TaqTfli(N-B) (DNA sequence SEQ ID NO:80; amino acid sequence SEQ ID N0:13), 
TaqTtii(B-S) (DNA sequence SEQ ID N0:81; amino acid sequence SEQ ID NO: 14) 
chimeras, and tiie TaqPol(W417L/G4l8K/ E507Q) (DNA sequence SEQ ID NO:82; 
amino acid sequence SEQ ID N0:15) mutant protdns. 

Polymerase ftinction was restored in ail tiie above mentioned enzyme mutants by 
inserting flie BamHI to Sail fragment of flie native, non-DN sequence into the selected 
chimera or mutant enzyme. For example, tiie mutant construct TaqTtii(N-B) was cut 
witii tiie restaiction enzyme BamHI (approximate amino acid position 593) and ttie 
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restriction enzyme Sail (^proximate amino acid position 840). The larger vector 
fragment was gel purified as described in Example 3D. The native TaqPol constmct was 
also cut with the restriction endonucleases BamHI and Sail, and the smaller insert 
fragment containing the native amino acid sequence was also gel purified. The insert 
5 fragment was then ligated into the vector as detailed in Experimental Example 3D. 

The poljonerase activities of these proteins were evaluated by extension of the 
dT25.35-oligonucleotide primer with fluorescein-labeled dUTP in the presence of either 
poly(dA) or poly(A) template. Primer extension reactions were carried out in 10 \il 
buffer containing 10 mM MOPS, pH7.5, 5 mM MgS04, 100 mM KCl. Forty ng of 

10 enzyme were used to extend 10 \iM (dT)25-3o primer in the presence of either 10 |jM 

poly(A)286 or 1 |nM poly(dA)273 template, 45 pM dTTP and 5 jiM Fl-dUTP at 60*^0 for 30 
min. Reactions were stopped with 10 ^1 of stop solution (95% formamide, 10 mM 
EDTA, 0.02% methyl violet dye). Samples (3 fil) were fractionated on a 15% denaturing 
acrylamide gel and the fraction of incorporated Fl-dUTP was quantitated using an 

15 FMBIO-100 fluorescent gel scanner (Hitachi) equipped with a 505 nm filter as described 
above. 

As shown in Figure 16, the DNA-dependent polymerase activities are very similar 
for all constructs used in this experiment, whereas the RNA-dependent polymerase 
activities of TthPol, TaqTth(N) and TaqTth(B-S) are at least 6-fold higher than the 
20 activities of TaqPol, TaqTth(N-B) and the TaqPol W417L/G418K/E507Q mutant. From 
the analysis of these results, it can be concluded that the high RNA-dependent DNA 
polymerase activity of TthPol is determined by the C-terminal half of the polymerase 
domain (roughly, amino acids 593-830) and that the RNA-dependent 5' nuclease and 
polym^ase activities are not related to each other, and are controlled by different regions. 

25 

EXAMPLES 

Specific point mutants in Taq DN RX HT developed from information from tlie 

chimeric studies 

30 The chimeric studies (Example 3, above) suggest that the part of the TthPol 

sequence determining its high RNA-dependent 5' nuclease activity comprises the 
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BstBI-BamHI region located approximately between amino acid 382 and 593. 
Comparison of the amino acid sequences between the BstBI and BamHI regions of Tth 
DN RX HT and Taq DN RX HT (SEQ ID NOS:165 and 164, respectively) revealed only 
25 differences (Figure 13). Among these, 12 amino acid changes were conservative 
while 13 of the differences resulted in a changes in charge. Since the analysis of the 
diimeric enzymes suggested that the critical mutations are located in both the BstBI-Ndel 
and the Ndel-BamHI regions of Tth DN RX HT, site specific mutagenesis was used to 
introduce the Tth DN RX HT specific amino acids into the BstBI-Ndel and Ndel-BamHI 
regions of the TaqTth(D-B) and the TaqTth(N-D) respectively. 

Six Tth DN RX HT specific substitutions were generated in the BstBI-Ndel 
region of the TaqTth(P-B) by single or double amino acid mutagenesis. Similarly, 12 
Tth DN RX HT specific amino acid changes were introduced at the homologous position 
of the Ndel-Bamm region of the TaqTth(N-D). 

Plasmid DNA was purified from 200 ml of JM109 overnight culture using 
QIAGEN Plasmid Maxi Kit (QIAGEN, Chatsworth, CA) according to the manufacturer's 
protocol to obtain enough starting material for aU mutagenesis reactions. All site specific 
mutations were introduced using the Transformer Site Directed mutagenesis Kit 
(Clontech) according to tiie manufacturer's protocol; specific sequence information for 
the mutagenic primers used for each site is provided below. One of two different 
selection primers, Trans OUgo AlwNI/Spel or Switch Oligo Spel/AIwNI (Clontech, Palo 
Alto, CA catalog #6488-1 or catalog #6373-1) was used for all mutagenesis reactions 
described. The selection oligo used in a given reactionds de^jendent on the restriction site 
present in the vector. All mutagenic primers wwe synthesized by standard synthetic 
chemistry. Resultant colonies were E.coli strain JM109. 

1. Construction of TaqTth(D-B) E404H (DNA sequence SEQ ID NO:83; 
amino acid sequence SEQ ID NO:16) 

Site specific mutagenesis was performed on pTrc99A TaqTtb(D-B) DNA using 
the mutagenic primer 240-60-01 5'-gag gag gcg ggg cac egg gcc gcc ctt-3' (SEQ ID 
NO:277) to introduce flie E404H mutation. 
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2. Construction of TaqTth(D-B) F413H/A414R (DNA sequence SEQ ID 
NO:84; amino acid sequence SEQ ID NO:17) 

Site specific mutagenesis was performed on pTrc99A TaqTai(D-B) DNA using 
the mutagenic primer 240-60-02 S'-ctt tec gag agg etc cat egg aac ctg tgg ggg agg-3' 
5 (SEQ ID NO:278) to introduce the F413H and the A414R mutations. 

3. Construction of TaqTthp*B) W417L/G418K (DNA sequence S£Q ID 
NO:85; amino acid sequence SEQ ID NO:18) 

Site specific mutagenesis was performed on pTrc99A TaqTfli(D-B) DNA using 
10 the mutagenic primer 240-60-03 5'-ctc ttc gcc aac ctg ctt aag agg ctt gag ggg gag-3' 
(SEQ ID NO:279) to introduce the W417L and the G41 8K mutations. 

4. Construction of TaqTth(D-B) A439R (DNA sequence SEQ ID NO:86; 
amino acid sequence SEQ ID NO:19) 

15 Site specific mutagenesis was performed on pTrc99A TaqTthO^D-B) DNA using 

the mutagenic primer 240-60-04 5'-agg ccc ctt tec egg gtc ctg gcc cat-3' (SEQ ID 
NO:280) to introduce the A439R mutation. 

5. Construction of TaqTth(N-D) L451R (DNA sequence SEQ ID NO:87; 
20 amino acid sequence SEQ ID NO:20) 

Site specific mutagenesis was performed on pTrc99AtaqTth(N-D) DNA using the 
mutagenic primer 240-60-05 5'-acg ggg gtg cgc egg gac gtg gcc tat-3' (SEQ ID NO:281) 
to introduce the L41S mutation. 

25 6. Construction of TaqTth(N-D) R457Q (DNA sequence SEQ ID NO:88; 

amino acid sequence SEQ ID NO:21) 
Site specific mutagenesis was performed on pTrc99AtaqTfli(N-D) DNA using the 
mutagenic primer 240-60-06 5 '-gtg gcc tat etc cag gcc ttg tec ctg-3' (SEQ ID NO:282) to 
introduce the L41 5Q mutation. 

30 
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7. Construction of TaqTth(N-D) V463L (DNA sequence SEQ D) NO:89; 
amino acid sequence SEQ D) NO:22) 

Site specific mutagenesis was perfonned on pTrc99AtaqTth(N-D) DNA using tbe 
mutagenic primer 240-60-07 S'-ttg tec ctg gag ctt gcc gag gag atc-3' (SEQ ID NO:283) 
to introduce the V463L mutation. 

8. Construction of TaqTth(N-D) A468R (DNA sequence SEQ ID NO:90; 
amino acid sequence SEQ ID NO:23) 

Site specific mutagenesis was perfonned on pTrc99AtaqTth(N-D) DNA using the 
mutagenic primer 240-60-08 5'-gcc gag gag ate cgc cgc etc gag gcc-3' (SEQ ID NO:284) 
to introduce the A468R mutation. 

9. Construction of TaqTth(N-D) A472E (DNA sequence SEQ ID NO:91; 
amino acid sequence SEQ ID NO:24) 

Site specific mutagenesis was perfonned on pTrc99AtaqTtb(N-D) DNA using the 
mutagenic primer 240-60-09 5'-gcc cgc etc gag gag gag gtc ttc cgc-3' (SEQ ID NO:285) 
to introduce tbe A472E mutation. 

10. Construction of TaqTth(N-D) G499R (DNA sequence SEQ ID NO:92; 
amino acid sequence SEQ ID NO:25) 

Site specific mutagenesis was perfonned on pTrc99AtaqTth(N-D) DNA using the 
mutagenic primer 240-60-10 5'-ttt gac gag eta agg ctt ccc gcc atc-3' (SEQ ID NO:286) to 
introduce the G499R mutation. 

n. Construction of TaqTth(N-D) E507Q (DNA sequence SEQ D) NO:93; 
amino acid sequence SEQ ID NO:26) 

Site specific mutagenesis was perfonned on pTrc99AtaqTth(N-D) DNA using the 
mutagenic primer 276-046-04 5'-atc gcc aag acg caa aag acc ggc aag-3' (SEQ ID 
NO:287) to introduce the E507Q mutation. 
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12. Construction of TaqTth(N-D) YS35H (DNA sequence SEQ ID NO:94; 
amino acid sequence SEQ ID NO:27) 

Site specific mutagenesis was perfonned on pTrc99 AtaqTth(N-D) DNA using the 
mutagenic primer 240-60-1 1 5'-aag ate ctg cag cac egg gag etc acc-3' (SEQ ID NO:288) 
S to introduce the YS3SH mutation. 

13. Construction of TaqTth(N-D) S543N (DNA sequence SEQ ID NO:95; 
amino acid sequence SEQ ID NO:28) 

Site specific mutagenesis was performed on pTrc99AtaqTth(N[-D) DNA using the 
10 mutagenic primer 240-60-12 5'-acc aag ctg aag aac acc tac att gac-3' (SEQ ID NO:289) 
to introduce the SS43N mutation. 

14. Construction of TaqTth(N-D) I546V (DNA sequence SEQ ID NO:96; 
amino acid sequence SEQ ID NO:29) 

15 Site specific mutagenesis was performed on pTrc99AtaqTth(N-D) DNA using the 

mutagenic primer 240-60-13 5'-aag age acc tac gtg gac ccc ttg ccg-3' (SEQ ID NO:290) 
to introduce the IS46V mutation. 

15. Construction of TaqTth(N-D) DS51S/I553V (DNA sequence SEQ ID 
20 NO:97; amino add sequence SEQ ID NO:30) 

Site specific mutagenesis was performed on pTrc99 AtaqTfli(N-D) DNA using the 
mutagenic primer 240-60-14 5'-att gac ccc ttg ccg age etc gtc cac ccc agg acg ggc-3' 
(SEQ ID NO:291) to mtroduce the D551S and the I553V mutations. 

25 16, Construction of TaqDN RX HT W417L/G418K/E507Q (DNA 

sequence SEQ ID NO:98; amino acid sequence SEQ ID NO:31) 

The TaqDN RX HT W417L/G418K/E507Q triple mutant was made by 
combining the TaqTth(D-B)W417L/G418K with the TaqTth(N-D) E507Q, 
TaqTthp-B)W417L/G418K was cut with the restriction enzymes Ndel and BamHI, and 
30 the larger, vector firagment was isolated as detailed in Example 3. The TaqTth(N-D) 
ES07Q construct was also cut with Ndel and BamHI and the smaller (approximately 795 
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base pairs) fragment was gel isolated and purified as detailed in Example 3. The 
Ndel-BamHI insert was ligated into the gel purified vector, as detaUed in Example 3. 

17. Construction of TaqDN RX HT W417L/E507Q (DNA sequence SEQ 
ID NO:99; amino acid sequence SEQ ID NO:32) 

Starting with TaqDN RX HT W417L/G418K/E507Q described above, mutagenic 
primer 337-01-02: 5'-TTC GCC AAC CTG CTT GGG AGG CTT GAG GGG GAG -3' 
(SEQ ID NO:292) was used in a site specific mutagenesis reaction to change the K at 
amino acid position 418 back to the wild-type amino acid, G. Site specific mutagenesis 
was done using the Ti^msformer Site Directed Mutagenesis Kit (Clonetech) according to 
the manufecturer's instructions, and as described in Experimental Example 4. 

18. Constractloii of TaqDN RX HT G418K/E507Q (DNA sequence SEQ 
ID NO:100; amino acid sequence SEQ ID NO:33) 

Starting with TaqDN RX HT W417L/G41 8K/E507Q described above, mutagenic 
primer 337-01-01: 5'-CTC TTC GCC AAC CTG TGG AAG AGG CTT GAG GGG -3' 
(SEQ ID NO:293) was used in a site specific mutagenesis reaction to change the L at 
amino acid position 417 back to Ihe wild-type amino acid, W. Site specific mutagenesis 
was done using the Transfonner Site Directed Mutagenesis Kit (Clonetech) according to 
Ihe manufecturer's instructions, and as described in Bq)erimental Example 4. 

Expression and purification of mutant proteins was done as detailed in Example 3, 
and the cleavage activities of these proteins were characterized as describe in Example 1, 
part A. A comparison of the cleavage cycling rates of a selection of these mutant 
proteins on an RNA target is shown in Figure 14. As further discussed in the Description 
of the Invention, these data show that amino acids in the regions 417/418 and amino acid 
507 are important in the conferring the TthPol-like KNA-dependent cleavage activity on 
the chimerical proteins comprising portions of TaqPol in combination witii portions of 
TthPol that are not indq)endently enable of providing enhanced RNA dependent activity 
(i.e., the D-B and N-D portions of Tth). As described in tiie Description of the Invention, 
Taq DN RX HT variant carrying only the W417L, G418K and E507Q substitutions were 
created. By comparing tiieir cleavage rates to that of Ttii DN RX HT on the IL-6 RNA 
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substrate as described in Example 1, these mutations were detemiined to be sufiBcient to 
increase the Taq DN RX HT activity to the Tth DN RX HT level. Figure 15 shows that 
the Taq DN RX FT W417L/G418K/E507Q and Taq DN RX HT G418K/E507Q mutants 
have 1 .4 times higher activity than Tth DN RX HT and more than 4 fold higher activity 
5 than Taq DN RX HT, whereas the Taq DN RX HT W417L/E507Q mutant has the same 
activity as the enzyme, which is about 3 fold higher than Taq DN RX HT, These results 
demonstrate that K41 8 and Q507 of TthPol are particularly important amino acids in 
providing RNA dq>endent 5' nuclease activity that is enhanced compared to TaqPol. 



10 EXAMPLE 6 

RNA-dependent 5* nuclease properties of the Taq DN RX HT G418K/ES07Q 5' 
nuclease are similar to Ttb DN RX HT with respect to salt and temperature optima 



To determine if the G418K/E507Q mutations caused any significant changes in 

15 &e properties of the Taq DN RX HT mutant in addition to the increased cleavage rate 
with the RNA target, the Taq DN RX HT G418K/E5d7Q (SEQ ID NO:33), Taq DN RX 
HT (SEQ E) N0276), and Tth DN RX HT (SEQ ID NO:274) enzymes were compared in 
the RNA template depend^t 5' nuclease assay under conditions where temperature and 
the concentrations of salt and divalent ions were varied. The iq)stream DNA and the 

20 template RNA strands of the substrate used in this study were linked into a single IrT 
molecule (SEQ ID NO: 166) as shown in Figure 20A, and the labeled downstream probe 
(SEQ ID NO: 167) was present in large excess. The 5' end of the target RNA strand was 
blocked with a biotin-streptavidin complex to prevent any non-specific degradation by 
' the enzyme during the reaction (Lyamichev et al. Science 260:778 [1993], Johnson et 

25 al. Science 269:238 [1 995]). The cleavage rates for Taq DN RX HT G41 8K/E507Q, 
Taq DN RX HT, and Tth DN RX HT are plotted as functions of temperature in Figure 
20B. The closed circles represent enzyme Taq DN RX HT, the open circles represent 
enzyme Tth DN RX HT, and the Xs represent enzyme Taq DN RX HT G418K/E507Q. 
The difference in the activities of Tth and Taq DN RX HT enzymes with the IrT substrate 

30 is even greater than the difference found witti the IL-6 RNA substrate when tested in a 
cleavage assay as described in Example 1. The G418K/E507Q mutations increase the 
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activity of the Taq enzyme more than tenfold and by 25% compared with the Tth 
emyme. All three enzymes show a typical temperature profile of the invasive signal 
amplification reaction and have the same optimal temperature. No significant effect of 
G418K/E507Q mutations on DNA dependent 5* nuclease activity of Taq DN RX HT with 
5 the all-DNA substrate analogous to IrT (SEQ ID NO: 168) under the same conditions was 
found. 

The effects of KCl and MgS04 concentrations on the 5* nuclease activity of Taq 
DN RX HT G418K/E507Q, Taq DN RX HT, and Tth' DN RX HT with the IrT substrate 
are shown in Figure 20C and D. The activities of all enzymes have similar salt 

10 dependencies with an optimal KCl concentration of 100 mM for Taq DN RX HT 
G418K/E507Q and Tth DN RX HT and 50 mM for Taq DN RX HT. The optimal 
MgS04 concentration for all enzymes is approximately 8 mM. The analysis of the data 
presented in Figure 20 suggests that the properties of Taq DN RX HT 041 8KyE507Q are 
much closer to those of Tth DN RX HT rather than Taq DN RX HT confirming the key 

15 role of the G418K/E507Q mutations in the recognition of the substrate with an RNA 
target. 

To understand the mechanism of the reduction of the 5* nuclease activity in the 
presence of an RNA versus a DNA target, the Michaelis constant (K^ and the maximal 
catalytic rate (kcat) of all fliree enzymes were detemiined, using an excess of the IrT 

20 substrate (SEQ ID NO:166) and the downstream probe (SEQ ID NO:167) and a limiting 
enzyme concentration. For these measurements, ten-jal reactions were assembled 
containing 10 mM MOPS, pH 7.5, 0.05% Tween 20, 0.05% Nonidet P-40, 10 ^g/ml 
tRNA, 4 mM MgCh, 1 nM of enzyme (Taq DN RX HT, Tth DN RX HT, or Taq DN RX 
HT G418K/E507Q) and different concentrations (0.125, 0.25, 0.5 or 1 jiM) of an 

25 equimolar mixture of the IiT target and the downstream probe. The cleavage kinetics for 
each enzyme and each substrate concentration were measured at 46°C. Reactions were 
stopped by the addition of 10 jxl of 95% formamide containing 10 mM EDTA and 0.02% 
methyl violet (Sigma). One |a1 of each stopped reaction digest was fractionated on a 20% 
denaturing acrylamide gel (19:1 cross-linked), with 7M urea, and in a buffer of 45 mM 

30 Tris-borate, pH 8.3, 1 .4mM EDTA Gels were scanned on an FMBIO-100 fluorescent 
gel scanner (Hitachi) using a 585 nm filter. The fraction of cleaved product (detemiined 
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from intensities of bands corresponding to uncut and cut substrate with FMBIO Analysis 
software, version 6.0, Hitachi) was plotted as a function of reaction time. The initial 
cleavage rates were detranined from the slopes of linear part of the cleavage kinetics and 
were defined as the concmtration of cut product divided by the enzyme concentration 
5 and the time of the reaction (in minutes). The Michaelis constant Km and the maximal 
catalytic rate Abat of each enzyme with IrT substrate were detennined from the plots of the 
initial cleavage rate as functions of the substrate concentration. 

It was found that all three enzymes have similar values (in the range of 
200-300 nM) and kcat values of approximately 4 min"^ for Taq DN RX HT and Tth DN 
10 RX HT and of 9 min ^ for Taq DN RX HT G418K/E507Q. That the G418K/E507Q 

mutations increase ttie kcat of Taq DN RX HT more than two fold, but have little effect on 
Km suggest that die mutations position the substrate in an orientation more appropriate for 
cleavage, rather than singly increase the binding constant 

IS EXAMPLE 7 

Use of molecular modeiing to further improve RNA-dependent 5' nuclease activity 

A. Point mutants 

To develop enzymes with altered function, sequence changes wm introduced by 
20 site-specific mutagenesis in predetennined locations or by random mutagenesis. 

Locations for site-specific mutagenesis were chosen based on evidence from chimeric 
studies, relevant published literature, and molecular modeling. Seven additional mutant 
enzymes were developed from the Tth DN RX HT enzyme, and twenty additional mutant 
enzymes were developed from the Taq DN RX HT enzyme, both discussed previously. 
25 Some of the mutant enzymes are the result of multiple mutagenesis reactions, that is, 

more than one change has been introduced to obtain the final product. Mutation reactions 
were done using the Tth DN RX HT construct (SEQ ID NO:273) described m Example 
2C2, or flie Taq DN RX HT construct (SEQ ID NO:275), described in Example 2C1 
unless otherwise stated. Plasmid DNA was purified from 200 ml of JM109 overnight 
30 culture using QIAGEN Plasmid Maxi Kit (QIAGEN, Oiatsworth, CA) according to the 
manufacturer's protocol to obtain enough starting material for all mutagenesis reactions. 
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All site-specific mutations were introduced using the Transformer Site Directed 
mutagenesis Kit (Clontech) according to the manufacturer's protocol. One of two 
different selection primers, Trans OUgo AlwNI/Spel or Switch OUgo Spel/AlwNI 
(Clontech, Palo Alto CA catalog #6488-1 or catalog #6373-1) was used for all 
mutagenesis reactions described. The selection oligo used in a given reaction is 
dependent on the restriction site present m the vector. All mutagenic primers for both the 
site-specific mutagenesis and the random mutagenesis were synthesized by standard 
synthetic chemistry. Resultant colonies for both types of reactions were Kcoli strain 
JM109. Random mutagenesis methods are described below. 

Mutants were tested via the rapid screening protocol detailed in Example 1 . 
Then, if more detailed analysis was desired, or if a larger protein prq)aration was 
required, expression and purification of mutant proteins was done as detailed m Example 
3. 

1. Construction of Tfh DN RX HT H641A, Tth DN RX HT H748A, Tth 
DNRXHTH786A 

Site specific mutagenesis was performed on pTrc99A Tth DN RX HT DNA using 
the mutagenic primer 583-001-02: 5'-gct tgc ggt ctg ggt ggc gat gtc ctt ccc ctc-3' (SBQ 
ID NO:294) to introduce the H641A mutation (DNA sequence SEQ ID NO: 101; amino 
acid sequence SEQ ID NO:34), or the mutagenic primer 583-001-03: 5' cat gtt gaa ggc 
cat ggc etc cgc ggc etc cct-3' (SEQ ID NO:295) to generate the H748A mutant (DNA 
sequence SEQ ID N0:102; amino acid sequence SEQ ID NO:35), or the mutagenic 
prima 583-001-04: 5'-cag gag gag etc gtt ggc gac ctg gag gag-3' (SEQ ID NO:296) to 
generate the H786Amutant enzyme (DNA sequence SEQ ID NO:103; amino acid 
sequence SEQ ID NO:36). 

2. Construction of Tth DN RX HT (H786A/G506K/Q509K) 
Starting with the mutant Tth DN RX HT H786A, generated above, site specific 

mutagenesis was done using the mutagenic primer 604-022-02: 5'-gga gcg ctt gcc tgt ctt 
ctt cgt ctt ctt caa ggc ggg agg cct-3' (SEQ ID NO:297) to generate this variant termed 
"TthAKK", (DNA sequence SEQ ID NO:104; amino acid sequence SEQ ID NO:37). 

3. Construction of Taq DN RX HT (W417L/G418KyE507Q/H784A) 
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Mutagenic oligonucleotide 158-029-02: 5 '-gag gac cag etc gtt ggc gac ctg aag gag 
cat-3' (SEQ ID NO:298) was used in a site specific mutagenesis reaction to introduce the 
H784A mutation and generate this construct termed *Taq4M*' (DNA sequence SEQ ID 
NO: 105; amino acid sequence SEQ ID NO:38). 
5 4. Construction of Taq4M H639A, Taq4M R587A, Taq4M G504K and 

Taq4MG80E 

Site specific mutagenesis was done on the Taq4M mutant, using primer 
473-010-11: 5'-gaggggcgggacatcgccacggagaccgccagc-3' (SEQ ID NO:299) to generate 
the Taq 4M H639A mutant (DNA sequence SEQ ID NO: 106; amino acid sequence SEQ 

10 ID NO:39), primer 473-010-10: 5*-cag aac ate ccc gtc gcc acc ccg ctt ggg cag-3' (SEQ 
ID NO:300) to generate Taq 4M R587A (DNA sequence SEQ ID NO:107; amino acid 
sequence SEQ ID NO:40), primer 300-081-06: 5'-ggg ctt ccc gcc ate aag aag acg gag aag 
acc-3' (SEQ ID NO:301) to generate Taq 4M G504K (DNA sequence SEQ ID NO:108; 
amino acid sequence SEQ ID N0:41), and primer 330-088-04: 5 '-eta ggg ctt ccc gcc ate 

15 aag aag acg caa aag acc ggc-3' (SEQ ID NO:302) to generate the Taq 4M G80E mutant 
(DNA sequence SEQ ID NO:109; amino acid sequence SEQ ID NO:42). 

5. Construction of Taq 4M P88E/P90E and Taq 4M L109F/A110T 
Starting with Taq 4M described above, site specific mutagraesis was done using 

primer 473-087-03: S'-ccg ggg aaa gtc etc etc cgt etc ggc ccg gcc cgc ctt-3' (SEQ ID 
20 NO:303) to generate the P88E/P90E mutations (DNA sequence SEQ ID NO:l 10; amino 
acid sequence SEQ ID NO:43), or primer 473-087-05: 5'-cgg gac etc gag gcg cgt gaa ccc 
cag gag gtc cac-3' (SEQ ID NO:304) to generate the L109F/A110T mutations (DNA 
sequence SEQ ID NO: 111; amino acid sequence SEQ ID NO:44). 

6. Constmction of Taq DN RX HT 

25 (W417L/G418K/G499R/AS02K/I503L /GS04K/ E507K/H784A) 

Two PGR reactions were performed, first using construct Taq4M (Taq 
W417L/G418K/G504K/E507Q/H784A) as a template. Using primers 158-84-01 
5'-CTCCTCCACGAGTTCGGC-3' (SEQ ID NO:305) and 535-33-02 5*-ACC GGT 
CTT CTT CGT CTT CTT CAA CTT GGG AAG CCT GAG CTC GTC AAA-3* (SEQ 

30 ID NO:306) a 620 base pair PCR fiiagment was generated. Another 510 base pair PCR 
product was generated using primer 535-33-01 5'-AAG ACG AAG AAG ACC GGT 
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AAG CGC TCC ACC AGC-3' (SEQ ID NO:307) and 330-06-03 5'-GTC GAC TCT 
AGA TCA GTG GTG GTG GTG GTG GTG CTT GGC CGC CCG GCG CAT C-3' 
(SEQ E) NO:308). The two PGR products overlap such that a final recombinant PGR 
ampUfication was done using the outside primers 158-84-01 and 330-06-03 to yield the 
1 182 base pair product. The recombinant PGR product was digested with the restriction 
enzymes NotI and BamHI according to the manufacturer's instructions to yield a 793 
base pair fi-agment The parent plasmid Taq4M was also digested with the same enzymes 
and used as the vector for ligation. All DNA fragments were TAB agarose gel purified 
prior to ligation. The fi^gment was ligated into the vector, and transfonned into JM109 
cells, thus incorporating the mutations G499R, A502K, I503L, and E507K as well as the 
restriction endonuclease site, Agel. This construct is tenned 'Taq SNT' (DNA sequence 
SEQ ID N0:1 12; amino acid sequmce SEQ ID NO:45). 

B« Random Mutagenesis 

Numerous enzymes with altered function were generated via random 
mutagenesis. The regions of the protein targeted for random mutagenesis were chosen 
based on molecular modeling data and from information in the Uterature. Different 
mutagenic primers were used to introduce mutations into different regions of the protein. 
Random mutagenesis was performed on the Taq variant Taq 4M G504K (Taq DN RX 
HT W417L/G418K/G504K/E507Q/H784Ay) (SEQ ID NO:108) described above and 
mutant PGR fragments generated in the mutagenesis reaction were exchanged for 
homologous regions in TaqSM (SEQ ID NO: 1 12) unless otherwise stated. 

Random mutagenesis was also performed on the Tth DN RX HT H786A (SEQ ID 
NO: 103) described above. Mutant PGR firagments generated with the Tth DN RX HT 
H786A template were exchanged for homologous regions in the unaltered Tth DN RX 
HTH786A. 

1. Random mutants in amino acid residues 500-507 or 513-520 
The fust mutagenic oligonucleotide, 535-054-01 : 5'-gga gcg ctt acc ggt ctt (ttg 
cgt ctt ctt gat ctt ggg aag) cct tag etc gtc aaa gag-3' (SEQ ID NO:309) was used in 
conjunction with 158-84-01: 5'-CTC CTC CAC GAG TTC GGC-3' (SEQ ID NO:310) to 
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install random residues from amino acid position 500 to 507 of Taq polymerase variant 
Taq DN RX HT W417L/G418K/G504K/E507Q/H784A (SEQ ID NO:108). This was 
accomplished by synthesizing the primer 535-054-01 such that only 91% of the bases 
within the parenthesis are unaltered while the remaining 9% of the bases are an equal 

5 mixture of the other 3 nucleotides. The initial, unaltered sequence of this oligo mcludes 
the G499R, A502K and the Q507K changes. 

To generate mutations in the region 500-507, primer 535-054-01 and primer 
158-84-01 were used in a PGR reaction, using the Advantage cDNA PGR kit (Clonetech) 
and Taq variant described above, as the target. This PGR fragments was then run on a 

10 1% TAE agarose gel, excised and purified with QIAquick Gel Extraction Kit (Qiagen, 
Valencia CA, catalog # 28706). The purified fragment was cut with NotI and Agel and 
ligated into pTaqSM that had been linearized with NotI and Agel. JM109 E.coli cells 
(Promega) were transformed with the ligated products. Glones were tested as described 
below. 

15 The second mutagenic oligonucleotide (used in a separate reaction) 535-054-02: 

5'-caa aag acc ggt aag cgc (tec acc age gcc gcc gtc ctg gag) gcc etc cgc gag gcc cac-3' 
(SEQ ID N0:31 1) was used m conjunction with 330-06-03: 5'-GTC GAG TGT AGA 
TCA GTG GTG GTG GTG GTG GTG CTT GGC CGC CCG GCG CAT C-3 * (SEQ ID 
NO:312) to install random residues fix)m amino acid 513-520. The bases within the 

20 parenthesis of primer 535-054-02 are also 91% wild-type and 3% each of the other 3 
nucleotides. 

To generate mutations in the region 513-520, primer 535-054-02 and primer 
535-054-02 were used in a PGR reaction with Taq DN RX HT W417L/G418K/G504K/ 
E507Q/H784A (SEQ ID NO:108) as template, as described above. The resulting PGR 
25 fragment was purified as above and cut with the restriction enzymes Agel and BamHI. 
The cut fragment was then ligated into the Taq8M construct, also linearized with Agel 
and BamHI. JMl 09 £.co// cells were transformed with the ligated products. Glones 
were tested as described Example 1 . Mutants developed fix>m these include: 

30 Taq DN RX HT W417L/G418K/G499R/A502K/K504N/E507K/H784A (Ml-13) (DNA 
sequence SEQ ID N0:1 13; amino acid sequence SEQ ID NO:46). 
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Taq DN RX HT W417UG418K/G499R/L500I/A502K/G5O4K/Q5O7Hm784A (Ml-36) 
(DNA sequence SEQ JD NO: 114; amino acid sequence SEQ ID NO:47). 
Taq DN RX HT W417LyG418K/G499RyA502K/I503UG504K;E507KyT5 14S/H784A> 
(NC-24) (DNA sequence SEQ ID NO: 1 15; amino acid sequence SEQ ID NO:48). 
Taq DN RX HT W417L/G418K/G499R/A502K/I503L/G504K/B507K/ V518L/H784A 
(M2-06) (DNA sequence SEQ ID NO: 1 16; amino acid sequence SEQ ID NO:49). 
2. TthDNRXHTH786A random mutagenesis 
To generate mutants in the helix-hairpin-helix region of the TthDN RX HT 
H786A (SEQ ID NO:36) enzyme, two different PGR reactions were performed using the 
H786A (SEQ ID NO:103) mutant as a template. The two PGR products overlap such that 
a recombinant PGR reaction can be performed (Higuchi, in PGR Technology, H. A. 
Erlich, ed., Stockton Press, New York. pp61-70 [1989]). This final PGR product is then 
exchanged with the homologous region of the TthDN H786A mutant by using restriction 
enzyme sites located on the ends of the fragment and within the TthDN H786A sequence. 

Starting with TthDN H786A discussed above, and using primer 604-08-06: 5'-gtc 
gga ggg gtc ccc cac gag-3' (SEQ ID N0:31 3) and primer 390-76-08: 5'-tgt gga att gtg 
age gg (SEQ ID NO:314), a 620 base pair PGR fragment was generated. PGR reactions 
were performed using the Advantage cDNA PGR kit (Clontech) according to 
manufecturer's instructions. This PGR product includes amino acids 1-194. No 
mutations were introduced via this reaction, however the restriction enzyme site EcoRI is 

pres^attheS' end. 

Starting with TthDN RX HT H786A discussed above, and usmg mutagenic 
primer 604-08-05: 5'-cto gtg ggg gac ccc tec gac aac etc (ccc ggg gtc aag ggc ate ggg 
gag aag acc gcc) etc aag ctt etc aag-3' (SEQ ID NO:315) and primer 209-74-02: 5'-gtg 
gcc tec ata tgg gcc agg ac-3* (SEQ ID NO:316) a 787 base pair PGR fragment was 
genwated. PGR reactions were done as above. This fragment does contain random 
mutations, due to the presence of the mutagenic primer, 604-08-05. The bases within the 
parenthesis of this primer were synthesized such that 91% of the sequence is wild-type, 
while the additional 9% is evenly divided between the remaining 3 bases. 

nie two PGR fragments overlap, and were combined in a recombinant PGR 
reaction. Primers 390-76-08 and 209-74-02 were added, and the Advantage cDNA PGR 
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kit (Clontech) was again used according to manufacturer's instructions. A 1380 base pair 
product was generated from this reaction. 

The recombinant PCR product was cut with the restriction enzymes EcoRI and 
NotI according to the manufacturer's instructions to yield a 986 base pair fragment. 
5 TthDN RX HT H786A was prepared by cutting with the same enzymes. The fragmmt 
was then ligated into the vector, and transformed into JM109 cells. New mutants 
developed from this set of reactions include: 

TthDN RX HT H786A/PI97R/K200R (DNA sequence SEQ ID N0:117; amino add 
10 sequence SEQ ID NO:50). 

TthDN RX HT H786A/K205 Y (DNA sequence SEQ ID NO: 118; amino acid sequence 
SEQIDN0:51). 

TthDN RX HT H786A/G203R (DNA sequence SEQ ID N0:1 19; amino acid sequmce 
SEQIDNO:52). 

15 3. Construction of Taq DN RX BTT W417L/G418K/H784A 

L109F/A110T/G499R/AS02K/I503L/G504K/E507Knr514S (Taq SS) 
Starting with Taq DN RX HT 
W417L/G4I8K/G499R/A502K/I503L/G504K/E507K/T514S/H784A (SEQ ID N0:115) 
mutant described above, primer 473-087-05: S'-cgg gac etc gag gcg cgt gaa ccc cag gag 
20 gtc cac-3 * (SEQ ID N0:3 1 7) was used in conjunction with the appropriate selection 
primer in a site specific mutagenesis reaction to incoiporate tiie LI 09F and Al lOT 
mutations to generate this enzyme, termed 'TaqSS" (DNA sequence SEQ ID NO:120; 
amino acid sequence SEQ ID NO:53). 

4. Construction of Taq DN RX HT W417L/G418L/H784A 
25 P88E/P90E/G499R/A502K/IS03L/G504K/E507K/T514S 
Starting with Taq DN RX HT 
W417I7G418KyG499R/A502Kyi503I7G504K/E507K/T514S^^ (SEQ ID N0:1 15) 
mutant described above, primer 473-087-03: 5*-ccg ggg aaa gtc etc etc cgt etc ggc ccg 
gcc cgc ctt-3' (SEQ ID N0:3 18) was used in conjunction with the appropriate selection 
30 primer in a site specific mutagenesis reaction to incorporate the P88E and P90E 
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mutations to generate this enzyme (DNA sequence SEQ ID N0:121; amino acid 
sequence SEQ ID NO:54). 

5. TaqSS random mutagenesis 

Random mutagenesis was used to introduce additional changes in the 
heUx-hairpin-helix domain of the TaqSS mutant (SEQ ID NO:120). The mutagenesis 
was done as described in example 9 above. In the first step, two different but overlapping 
PGR products were generated. One of the PGR products, generated with oUgos 
390-76-08 (SEQ ID NO:314), and 604-08-04: 5'-gtc gga etc gtc acc ggt cag ggc-3' (SEQ 
ID N0:3 19) incorporates the EcoRI site into the fragment, but does not incorporate any 
mutations. The second PGR product utiUzes mutagenic primer 604-08-03: 5'-ctg acc ggt 
gac gag tec gac aac ctt (ccc ggg gtc aag ggc ate ggg gag aag acg gcg) agg aag ctt ctg 
gag-3' (SEQ ID NO:320) and primer 209-74-02 (SEQ ID NO:316). This firagment 
contains random point mutations, and when combined via recombinant PGR with the first 
fragment, can be cut with the restriction enzymes EcoRI and Nott, and ligated into the 
TaqSS construct, also cut with EcoRI and Notl. The ligated construct was then 
transformed mto JM109. Golonies were screened as described below. Enzymes 
developed from this mutagenesis include: 

TaqSS K198N (DNA sequence SEQ ID N0:1 12; amino acid sequence SEQ ID NO:55). 
TaqSS A205Q (DNA sequence SEQ ID NO: 123; amino acid sequence SEQ ID NO:56). 
TaqSS I200M/A205G (DNA sequence SEQ ID NO;124; amino acid sequence SEQ ID 
NO:57). 

TaqSS K203N (DNA sequence SEQ ID NO: 125; amino acid sequence SEQ ID NO:58). 
TaqSS T204P CDNA sequence SEQ ID NO:126; amino acid sequence SEQ ID NO:59). 
6. Construction of TaqSS R677A 

To genwate razymes with sequence changes in both the arch region and in the 
polymerase region, additional specific point mutations were generated in TaqSS. Site 
specific mutagenesis was performed as described above using the oUgo 473-060-10: 
5'-tag etc ctg gga gag ggc gtg ggc cga cat gcc-3' (SEQ ID NO:321) to generate the 
TaqSS R677A mutant (DNA sequence SEQ ID NO: 127; amino acid sequence SEQ ID 
NO:60). 
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7. Construction of TaqTthAKK (DNA sequence SEQ ID NO:128; amino 
acid sequence SEQ ID NO:61) and TthTaqSM (DNA sequence SEQ ID NO:129; 
amino acid sequence SEQ ED NO:62) 

Chimeric mutant TaqTthAKK and TthTaqSM were generated by cutting Tth DN 
5 RX HT (H786A/G506K/Q509K) (SEQ ID NO:104; here abbreviated TthAKK) or Taq 
4M G504 (SEQ E) NO:108; here abbreviated Taq 5M) with the restriction endonucleases 
EcoRI and Notl. The smaller insert fragments as well as the larger vector fragments were 
gel purified as detailed in Example 3D, and the insert fragments were exchanged between 
the two mutants and ligated as described in Example 3D. Scre^iing and verification of 
10 the construct sequence was also done as in Example 3D. 

EXAMPLES 

Improvement of RNA-dependent 5' nuclease activity in other polymerases 

15 filformation gained from the TaqPoI/TthPol recombinations, mutagenesis and 

modelmg, was used to make comparable mutations in additional DNA polym«:ases and 
examined the effects on the cleavage activities of these enzymes. The DNA polymerases 
of Thennus filiformus (TfiPol) and Thermus scotoductus (TscPol) were cloned and 
purified as described in Example 2. The mutagenesis of these two proteins is described 

20 below. 

A. Construction of TfiPolDN2M 

Mutagenesis of pTrc99a-TfiPol (SEQ ID NO:249) was done using the 
QuikChange site-directed mutagenesis kit (Stratagene) according to the manufacturer's 
protocol. The P420K mutation was made with the following two oligonucleotides; 

25 5'-CTTCCAGAACCTCTTTAAACGGCTTTCCGAGAAG (SEQ ID NO:322) and 
5'-CTTCTCGGAAAGCCGTTTAAAGAGGTTCTGGAAG(SEQIDNO:323), The 
E507Q mutation was made with the following two oligonucleotides; 
5'-CCGGTGGGCCGGACGCAGAAGACGGGCAAGC (SEQ ID NO:324) and 
5'-GCTTGCCCGTCTTCTGCGTCCGGCCCACCGG (SEQ ID NO:325). The D785N 

30 mutation was made with the following two oligonucleotides; 

5*-CTCCTCCAAGTGCACAACGAGCTGGTCCTGG (SEQ ID NO:326) and 
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5'.CCAGGACCAC5CTCGTTGTGCACTTGGAGGAG (SEQ ID NO:327). The plasmid 
containing all three mutations is caUed pTrc99a-TfiPolDN2M, (DNA sequence SEQ ID 
NO:130; amino acid sequence SEQ ID NO:63). 
B. Constractioii of TscPolDN2M 

Mutagenesis of pTrc99a-TscPol (SEQ ID NO:253) was done with the 
QuikChange site-directed mutagenesis kit (Stratagene) according to the manufacturer's 
protocol. The E416K mutation was made with the foUowing two oligonucleotides; 
5'-GCCGCCCTCCTGAAGCGGCTTAAGGG (SEQ ID NO:328) and 
S'-CCCTTAAGCCGCTTCAGGAGGGCGGC (SEQ ID NO:329). The E505Q mutation 
was made with the following two oligonucleotides; 
5'-ATCGGCAAGACGCAGAAGACGGGCAAGC (SEQ ID NO:330) and 
y-GCTTGCCCGTCTTCTGCGTCTTGCCGAT (SEQ ID NO:331). The D783N 
mutation was made witii the following two oligonucleotides; 
5'-TTGCAGGTGCACAACGAACTGGTCCTC (SEQ ID NO:332) and 
5'-GAGGACCAGTTCGTTGTGCACCTGCAA (SEQ ID NO:333). The plasmid 
containing aU three mutations is called pTrc99a-TscPolDN2M, (DNA sequence SEQ ID 
NO; 131; amino acid sequence SEQ ID NO:64). 
C. Chimerics of Tsc, Tfl, Tth and Taq mutants 

1. Construction of TliTth AKK (DNA sequence SEQ ID NO:132; amino 
add sequence SEQ ID NO:65), TscTthAKK (DNA sequence SEQ ID 
NO:133; amino acid sequence SEQ ID NO:66), TffTaqSM (DNA 
sequence SEQ ID NO:134; amino acid sequence SEQ ID NO:67) and 
TscTaqSM (DNA sequence SEQ ID NO;135; amino acid sequence 
S£QIDNO:68) 

To generate chhneric enzymes between Tth DN KX HI (H86A/G506KyQ509K) 
(here abbreviated TthAKK, SEQ ID NO:104) or Taq 4M G504 (here abbreviated Taq 
5M, SEQ ID NO:108), and Tfi DN 2M (SEQ ID NO:130), or Tsc DN 2M (SEQ ID 
N0:131), additional restriction endonuclease sites wrae introduced by site specific 
mutagenesis mto the named Tfi and Tsc mutants. Mutagenic primers 700-011-01 5'-cag 
acc atg aat tec acc cca ctt ttt gac ctg gag-3' (SEQ ID NO:334) and 700-01 1 -02 5'-gtg gac 
gcg gcc gcc cga ggc cgc cgc cag ggc cag-3' (SEQ ID NO:335) were used to introduce an 
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EcoRI site at amino acid position 1 and a NotI site at amino acid position 33 1 in Tfi DN 
2M. Mutagenic primers 700-01 1-03 5'<^ acc atg aat tec ctg ccc etc ttt gag ccc aag-3' 
(SEQ ID NO:336) and 700-01 1-04 5'-gta aac cgc gcc gcc cca ggc ggc ggc caa ggc gtt-3* 
(SEQ ID NO:337) were used to introduce an EcoRI site at amino acid position 1 and a 
5 NotI site at amino acid position 327 in Tsc DN 2M. PGR reactions wore done using the 
Advance cDNA PGR kit (Clonetech) according to manufacturer's instructions and either 
Tfi DN 2M or Tsc DN 2M as target, with their corresponding primers. The 1017 base 
pair PGR products were cut with both EcoRI and NotI to yield 993 base pair insert 
fragments that were gel purified as described in Example 3D. The mutants Taq4M 
10 G504K (SEQ ID NO:108) and Tth DN RX HT (H786A/G506K/Q509K) (SEQ ID . 
NO: 104) were also cut with EcoRI and NotI, and the larger, vector fi:agment was gel 
isolated as above. Ligations were performed as detailed in Example 3D, as was the 
screening and verification of the new constructs. 

IS EXAMPLE 9 

Additional Enzymes Iiaving Improved RNA-dependent 5* naclease activity 

Generation of Tfi DN 2M(AN) 
20 To facilitate later cloning steps, an endogenous restriction enzyme site (Not I) was 

removed from the polymerase region of the TfiPolDN2M gene (SEQ ID NO: 130 
described in Example 8A), and a unique Not I site was inserted in a more advantageous 
position. 

The endogeneous Not I site was removed as follows. The QUIKCHANGE 
25 Site-Directed Mutagenesis Kit from (Stratagene) was used according to manufacturer's 
instructions with the mutagenic primers 5'-gag-gtg-gag-cgg-ccc-ctc-tcc-cgg-gtc-ttg (SEQ . 
ID NO; 338) and 5'-caa-gac-ccg-gga-gag-ggg-ccg-ctc-cac-ctc (SEQ ID NO: 339). The 
new constmct was named Tfi DN 2M(AN) (DNA sequence SEQ ID NO: 340; amino acid 
sequence SEQ ID NO: 341). 

30 
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Generation of Tfl DN 2Ma^, Tsc DN 2M(N) 

To instaU a unique NotI site (at amino acid position 328) in Tfi DN 2M{AN) 
(SEQ JD NO: 340), primers 886-088-07 (SEQ ID NO: 342) S'-tgg-cgg-cgg-cct-cgg-gcg- 
gcc-gcg-tcc-acc-ggg-caa-ca-3' and 700-010-03 5'-ctt-ctc-tca-tcc-gcc-aaa-aca-gcc (SEQ 
ID NO: 343) were used in a PGR reaction with Tfi DN 2M(AN) (SEQ ID NO: 340) as 
template. The resulting PGR fiagment was purified and cut with the restriction enzymes 
NotI and San. The cut fragment was then Ugated into the TfiTthAKK (SEQ ED NO: 132, 
desraibed in example 8,Q construct which was also digested with NotI and Sail. The 
new construct was termed Tfi DN 2M(N) (DNA sequence SEQ ID NO: 345; amino acid 
sequence SEQ ID NO: 346). 

To introduce a Not I restriction aidonudease site into mutant TscPol DN 2M 
(previously described in Example 8B above), PGR was perfonned with primers: 886-088- 
05 (SEQ ID NO: 344) 5'-tgg-ccg-ccg-cct-ggg-gcg-gcc-gcg-ttt-acc-ggg-cgg-ag-3' and 
700-010-03 (SEQ ID NO: 343) 5'-ctt-ctc-tca-tcc-gcc-aaa-aca-gcc-3' using TscPolDN2M 
(SEQ ID NO: 131) as template. The Notl-SaU digested fragment of the PGR product was 
then sub-cloned into NotI and SaH digested TscTthAKK vector (SEQ ID NO: 133). The 
resulting construct was termed Tsc DN 2M(N) PNA sequence SEQ ED NO:347; amino 
acid sequence SEQ ED NO:348). 

Generation of Tfi DN 2M(N)AKK and Tsc DN2M(N)AKK 

To generate the "AKK" set of mutations (G504K/E507K/H784A/D785N), site 
specific mutagenesis was done on the Tfi DN 2M(N) mutant (DNA sequence SEQ ID 
NO: 345), using primers 959-022-01 to -04: 5'-ggc-ctc-acc-ccg-gtg-aag-cgg-acg-aag-aag- 
acg-ggc-aag-cgc-3', 5'-gcg-ctt-gcc-cgt-ctt-ctt-cgt-ccg-ctt-cac-cgg-ggt-gag-gcc-3', 5 '-ctc- 
ctc-ctc-caa-gtg-gcc-aac-gag-ctg-gtc-ctg-3',5'-cag-gac-cag-ctc-gtt-ggc-cac-ttg-gag-gag. 
gag-3' (SEQ ID NO: 354-357) to generate the Tfi 2M(N)AKK mutant (DNA sequence 
SEQ ID NO: 358; amino acid sequence SEQ ED NO: 359). This construct is termed 
"TfiAKK". 

To mstall the "AKK" set of mutations (G502K/E505K/H782A/D783N), into the 
TscDN 2M (N) construct, pNA sequence SEQ ID NO: 347) primers 959-022-05 to -08: 
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5 '-ggg-ctt-ccc-gcc-atc-aag-aag-acg-aag-aag-acg-ggc-aag-cgc-3', 5'-gcg-ctt-gcc-cgt-ctt- 
ctt-cgt-K5tt-K)tt-gat-ggc-ggg-aag-Kxx-3^5'-atg-ctt-ttg-cag-gtg-gc^^ 
5'-gag-gac-cag-ttc-gtt-ggC"Cac-ctg-caa-aag-cat-3' (SEQ ID NO: 360-363) were used to 
generate Tsc2M(N)AKK (DNA sequence SEQ ID NO: 364; amino acid sequence SEQ 
5 ID NO: 365). This construct is termed 'Tsc AKK". 

Constniction of point mutants by recombinant PCR 
Construction of TthAKK(P195A) and TthAKK(P195K) 

To introduce mutations at amino acid position 195 (either a P195A or P195K) in 

10 the nuclease domain of TthAKK construct, mutagenic primer 785-073-01 (P195A) 5'- 
ccc-tcc-gac-aac-ctc-gcc-ggg-gtc-aag-ggc-atc-3' (SEQ ID NO: 370) or 785-073-02 
(P195K) 5'-ccc-tcc-gac-aac-ctc-aag-ggg-gtc-aag-ggc-atc-3* (SEQ ID NO: 371) and 
primer 209-074-02: 5'-gtg-gcc-tcc-ata-tgg-gcc-agg-ac-3' (SEQ ID NO:316) were used in 
a PCR reaction to generate a 787 base pair fragment. Another PCR fragment was 

15 obtained by using the primers: 390-076-08 5'-tgt-gga-att-gtg-agc-gg-3* (SEQ ID 

NO:314) and 785-073-03 5'-gag-gtt-gtc-gga-ggg-gtc-3' (SEQ ID NO: 372) in a reaction 
with the same template. 

The two PCR fragments overlap and were combined in a recombinant PCR 
reaction. The outside primers 390-076-08 and 209-074-02 were added, and the 

20 Advantage cDNA PCR kit (Clontech) was used according to manufacturer's instructions. 
A 1380 base pair product was generated from this reaction. 

The recombinant PCR product was cut with the restriction enzymes EcoRI and 
NotI to yield a 986 base pair fragment. The TthAKK construct was prqpared by cuttmg 
with the same enzymes. The fragment was then ligated into the vector, and transformed 

25 into IM109 cells. New mutants developed from this set of reactions include: 

TthAKK(P195A) (DNA sequence SEQ ID NO: 373; ammo acid sequence SEQ ED NO: 
374) and TthAKK(PI95K) (DNA sequence SEQ ID NO: 375; amino acid sequence SEQ 
ID NO: 376). 

Construction of TthAKK(N417K/L418K) 
30 The same approach was used to constmct TthAKK(N417K/L418K). Two 

overlapping PCR fragments were generated by mutagenic primers: 785-73-07 5'-gag- 
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agg-cUHMt-Cgg-aag-aag-ctt-aag-cgc-ctcgag-S' (SEQ ID NO: 377) and 700-10-03 5'-ctt- 
ctctca-tec-gco-aaa-aca-gcc-3' (SEQ JD NO:343). and primers 158-084-01 5'-ctc-ctc-cac- 
gag-tto-ggc-3' (SEQ ID NO:310) and 785-73-08 5*-ccg-atg-gag-cct-ctocga-3' (SEQ ID 
NO: 378). The two products were combined and amplified with outside primers 700-10- 
03 and 158-084-01. The recombmant PGR product was cut with the restriction enzymes 
Nofl and BamHI and ligated into the Notl/BaniHI pre-cut T&AKK construct. This 
mutant was termed TthAKK(N417K/L418K) (DNA sequence SEQ ID NO: 379; amino 
acid sequence SEQ ID NO: 380). 

Construction of additional TthAKK point mutants by site-directed mutagenesis 
Construction of TthAKK(P25SL) 

Site specific mutagenesis was perfomied on the TthAKK construct using the 
mutagenic primer 886-049-05 and 886-049-06: 5'-gtg-cgc-acc-gac-ctc-cto^;tg-gag-gtg- 
gac-ctc-3' (SEQ ID NO: 381), 5'-gag-gtc-cac-cto-cag-gag-gag-gtc-ggt-gcg-cac-3' (SEQ 
ID NO: 382) to generate TthAKK(P255L) (DNA sequence SEQ ID NO: 383; amino acid 
sequence SEQ ID NO: 384). 



Construction of TtbAKK(F311Y) 

Site specific mutagenesis was performed on the TthAKK construct using the 
20 mutagenic primer 886-049-09 and 886-049-10: 5*-ggg-gcc-ttc-gtg-ggc-tac-gtc-ctc-tcc- 
cgc-ccc-3' (SEQIDNO:P385), 5'-ggg-gcg-gga-gag-gac-gta-gcc-cac-gaa-ggc-ccc-3' 
(SEQ ID NO: 386) to generate TthAKK(F3 11 Y) (DNA sequence SEQ ID NO: 387; 
amino acid sequence SEQ ID NO: 388). 



25 Construction of TtbAKK(N221H/R224Q) 

Site specific mutagenesis was performed on the TthAKK construct using the 
mutagenic primer 886-049-01 and 886-049-02: S'-gaa-aac-ctc-ctc-aag-cac-ctg-gac-cag- 
gta-aag-cca-gaa-aao-3' (SEQ ID NO: 389), 5'-gtt-ttc-tgg-ctt-tac-ctg-gtc-cag-gtg-ctt-gag- 
gag-gtt-ttc-3' (SEQ ID NO: 390) to generate TthAKK(N221H/R224Q) (DNA sequence 
30 SEQ ID NO: 391 ; amino acid sequence SEQ ID NO: 392). 
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Construction of TthAKK(R2SlH) 

Site specific mutagenesis was performed on TthAKK construct using the 
mutagenic primer 886-049-03 and 886-049-04: 5'-gag-ctc-tcc-cgg-gtg-cac-acc-gac-ctc- 
ccc-ctg-3' (SEQ ID NO: 393), 5'-cag-ggg-gag-gtc-ggt-gtg-cac-ccg-gga-gag-ctc-3' (SEQ 
5 ID NO: 394) to generate TthAKK(El251H) (DNA sequence SEQ ID NO: 395; amino acid 
sequence SEQ ID NO: 396). 

Construction of TthAKK(P2SSL/R2SlH) 

Site specific mutagenesis was performed on the TthAKK(P255L) construct using 
10 the mutagenic primer 886-088-01 and 886-088-02: 5'-gag-ctc-tcc-cgg-gtg-cac-acc-gac- 
ctc-ctc-ctg-3' (SEQ ID NO: 397), 5'-cag-gag-gag-gtc-ggt-gtg-cac-ccg-gga-gag-ctc-3' 
(SEQ ID NO: 398) to generate TthAKK(P255L/R251H) (DNA sequence SEQ ID NO: 
399; amino acid sequence SEQ ID NO: 400). 

15 Construction of Tth AKK L429V (DNA sequence SEQ ID NO: 401; amino 

acid sequence SEQ ID NO: 402) 

Palm region randomization mutagenesis was p^onned on the Tth AKK construct 

using the mutagenic primer 680-79-02 5 '-etc cat egg aac etc ctt [aag cgc etc gag ggg gag 

gag aag etc ctt tgg] etc tac cac gag gtg-3' (SEQ ID NO: 403) and reverse primer 680-79- 
20 04 5*-AAG GAG GTT CCG ATG GAG-3' (SEQ ID NO: 404) to introduce the random 

mutations in the Pahn region. Brackets indicate a synthesis of 91% base shown and 3% 

all other bases. 

Construction of Tth AKK E425V PNA sequence SEQ ID NO: 405; amino 
acid sequence SEQ ID NO: 406) 

25 Pahn region randomization mutagenesis was performed on the Tth AKK construct 

using the mutagenic primer 680-79-02 5*-ctc cat egg aac etc ctt [aag cgc etc gag ggg gag 
gag aag etc ctt tgg] etc tac cac gag gtg-3 ' (SEQ ID NO: 403) and reverse primer 680-79- 
04 5'-AAG GAG GTT CCG ATG GAG-3' (SEQ ID NO: 404) to introduce the random 
mutations in the Pahn region. Brackets indicate a synthesis of 91% base shown and 3% 

30 all other bases. 
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Construction of Tth AKK L422N/E425K (DNA sequence SEQ ID NO: 407; 
amino acid sequence SEQ ID NO: 408) 

Palm region randomization mutagenesis was performed on the Tth AKK construct 
using the mutagenic primer 680-79-02 5'-ctc cat egg aac etc ctt [aag cgc etc gag ggg gag 
gag aag etc ctt tgg] etc tac cac gag gtg-3' (SEQ ID NO: 403) and reverse primer 680-79- 
04 5'-AAG GAG GTT CCG ATG GAG-3' (SEQ ID NO: 404) to introduce the random 
mutations in the Pabn region. Brackets indicate a synthesis of 91% base shown and 3% 
all ofhor bases. 

Construction of Tth AKK L422FAV430C (DNA sequence SEQ ID NO: 409; 
amino add sequence SEQ ID NO: 410) 

Pahn region randomization mutagenesis was performed on Tth AKK DNA using 
the mutagenic primer 680-79-02 5'-ctc cat egg aac etc ctt [aag cgc etc gag ggg gag gag 
aag etc ctt tgg] etc tac cac gag gtg-3' (SEQ ID NO: 403) and reverse primer 680-79-04 
5'-AAG GAG GTT CCG ATG GAG-3' (SEQ ID NO: 404) to mtroduce the random 
mutations in the Palm region. Brackets indicate a synthesis of 91% base shown and 3% 
all otiier bases. 

Construction of Ttti AKK A504F (DNA sequence SEQ ID NO: 411; amino 
acid sequence SEQ ID NO: 412) 

Site saturation mutagenesis was performed on Tth AKK construct using the 
mutagenic primer 680-80-03 5'-cag gag ctt agg ctt ccc ram ttg aag aag acg aag aag aca-3' 
(SEQ ID NO: 413) and reverse primer 680-80-06 5'-cct aag etc gtc aaa gag-3' (SEQ ID 
NO: 414) to introduce flie random mutations in the A504 amino acid. 

Construction of Ttii AKK A504V (DNA sequence SEQ ID NO: 415; amino 
acid sequence SEQ ID NO: 416) 

Site saturation mutagenesis was performed on Tth AKK construct using the 
mutagenic primer 680-80-03 5'-cag gag ctt agg ctt ccc nnn ttg aag aag acg aag aag aca-3' 
(SEQ ID NO: 413) and reverse prima: 680-80-06 5'-cct aag etc gtc aaa gag-3' (SEQ ID 
NO: 414) to introduce the random mutations in the A504 amino acid. 
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Construction of Tth AKK A504S (DNA sequence SEQ ID NO: 417; amino 
acid sequence SEQ ID NO: 418) 

Site saturation mutag^esis was performed on Tth AKK construct using the 
5 mutagenic primer 680-80-03 5'-cag gag ctt agg ctt ccc mm ttg aag aag acg aag aag aca-3* 
(SEQ ID NO: 413) and reverse primer 680-80-06 5'-<:ct aag etc gtc aaa gag-3* (SEQ ID 
NO: 414) to introduce the random mutations in the AS04 amino acid. 

Construction of Tth AKK S517G (DNA sequence SEQ ID NO: 419; amino 
10 acid sequence SEQ ID NO: 420) 

Site saturation mutagraesis was performed on Tth AKK construct using the 
mutagenic primer 680-80-07 5'-GGC AAG CGC TCC AGO NNN GCC GCG GTG CTG 
GAG GCC CTA CGG-3' (SEQ ID NO: 421) and reverse primer 680-80-10 5'-GGT 
GGA GCG CTT GCC-3* (SEQ ID NO: 422) to introduce the random mutations in the 
15 SS17 amino acid. 

Construction of Ttli AKK AS18L (DNA sequence SEQ ID NO: 423; amino 
acid sequence SEQ ID NO: 424) 

Site saturation mutagenesis was perfomied on Tth AKK construct using the 
20 mutagenic primer 680-80-07 5'-GGC AAG CGC TCC ACC AGC NNN GCG GTG CTG 
GAG GCC CTA CGG-3' (SEQ ID NO: 425) and reverse primer 680-80-10 5'-GGT 
GGA GCG CTT GCC-3* (SEQ ID NO:422) to introduce the random mutations in the 
AS18 amino acid. 

25 Construction of Tth AKK AS18R (DNA sequence SEQ ID NO: 426; amino 

acid sequence SEQ ID NO: 427) 

Site saturation mutagenesis was performed on Tth AKK construct using the 
mutagenic primer 680-80-07 5'-GGC AAG CGC TCC ACC AGC NNN GCG GTG CTG 
GAG GCC CTA CGG-3* (SEQ ID NO: 425) and reverae primer 680-80-10 5'-ggt gga 
30 gcg ctt gcc-3 * (SEQ ID NO: 422) to introduce the random mutations in the A5 1 8 amino 
acid. 
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Construction of TaqSM L4S1R (DNA sequence SEQ ID NO: 428; amino acid 

sequence SEQ ID NO: 429) 

Site directed mutagenesis was performed on the Taq 5M construct using the 
mutagenic primer 240-60-05 S'-acg-ggg-gtg-cgc-cgg-gac-gtg-gcc-tat 3' (SEQ ID NO: 
430) to introduce the L451R mutation in the Taq 5M enzyme. 

Construction of Tth AKK A504K (DNA sequence SEQ ID NO: 431; amino 
acid sequence SEQ ID NO: 432) 

Site directed mutagenesis was performed on the Tth AKK construct using the 
mutagenic primer 680-69-04 S'-ctt agg ctt ccc aag ttg aag aag acg aag aag aca-3' (SEQ 
ID NO: 433) and reverse primer 680-69-05 5'-tgt ctt ctt cgt ctt ctt caa ctt ggg aag cct aag- 
3' (SEQ ID NO: 434) to introduce the A504K mutation in the Tth AKK enzyme. 

Construction of Tth AKK H641A (DNA sequence SEQ ID NO: 435; amino 
acid sequence SEQ ID NO: 436) 

Site directed mutagenesis was performed on Tth AKK construct using the 
mutagenic primer 680-69-08 5'-gag ggg aag gac ate gcc acc cag acc gca agc-3' (SEQ ID 
NO: 437) and reverse primer 583-01-02 5'-gct tgc ggt ctg ggt ggc gat gtc ctt ccc etc- 3' 
(SEQ ID NO: 438) to introduce the H641 A mutation in the Tth AKK eazyme. 

Construction of Tth AKK T508P (DNA sequence SEQ ID NO: 439; amino 
acid sequence SEQ ID NO: 440) 

Site directed mutagenesis was performed on TthAKK construct using the 
mutagenic primer 680-70-01 5'-ccc-gcc ttg aag aag ccg aag aag aca ggc aag-3' (SEQ ED 
NO: 441) and reverse primer 680-70-02 5'-ctt gcc tgt ctt ctt egg ctt ctt caa ggc ggg-3' 
(SEQ ID NO: 442) to introduce the T508P mutation in the Tth AKK enzyme. 

Chimeras and mutations in chimeras. 

Fusion between TthAKK enzyme and alpha-peptide 
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A TthAKK-lacZ-alpha-peptide chimeric fusion was constructed to allow detection 
of mutations (including frame-shifts, deletions, insertions, etc.) which cause the inability 
of expression of the full-length fusion protein based on the colony blue-white screening 
(Wu et al, Nucleic Acids Research, 24:1710 [1996]). 

5 Site specific mutagenesis was performed on TthAKK DNA using the mutagenic 

primers 959-041-03, 5'-cac-cac-cac-cac-cac-cac-gtc-gac-tag-tgc-tag-cgt-cga-cta-gct-gca- 
ggc-atg-caa-gct-tgg-c-3' (SEQ ID NO: 477) and 959-041-04, 5*-gcc-aag-ctt-gca-tgc-ctg- 
cag-cta-gtc-gac-gct-agc-act-agt-cga-cgt-ggt-ggt-ggt-ggt-ggt-g-3* (SEQ ID NO: 478) to 
generate pTthAKK-L with Sail site following the 6xHis tag at the C-terrainus of TthAKK 

10 for the insertion of lacZ alpha peptide. The alpha peptide (of 201 amino acids) was first 
PGR amplified fi:om the pCRII-TOPO vector (Invitrogen) witii primers 959-041-01, 5'- 
cag-gaa-gcg-gcc-gcg-tcg-aca-tga-cca-tga-tta-cgc-caa-gc-3' (SEQ ID NO: 479) and 959- 
093-01, 5'-ggg-ccc-gcc-agg-gtc-gac-tca-ggg-cga-tgg-ccc-act-acg-tga-3' (SEQ ID NO: 
480). The PGR product was then digested with restriction enzyme Sail and ligated into 

15 the pTthAKK-L vector, which was also cut with the same enzyme to generate the 

chimeric construct TthAKK-alpha peptide (DNA sequence SEQ ID NO: 481; amino acid 
sequence SEQ ID NO: 482). The orientation of the insert was confirmed by sequencing. 

Construction of TthTscAKK and TthTfiAKK enzymes 

20 The TthAKK construct was cut with the enzymes EcoRI and NotI and the smaller 

insert fragment was gel isolated. The TscAKK or TfiAKK constructs were also cut with 
EcoRI and NotI and the larger fragment was gel isolated and purified. The Tth insert 
(nuclease domain) was ligated into the TscAKK and TfiAKK vectors (polymerase 
domain) to generate TthTscAKK (DNA sequence SEQ ID NO: 447; amino acid sequence 

25 SEQ ID NO: 448) and TthTfiAKK pNA sequence SEQ ID NO: 449; amino acid 
sequence SEQ ID NO: 450) chimeric constructs. 



FT mutations of Tth polymerase to improve substrate specificity 
Construction of Taq(FT)TthAKK 

30 Site specific mutagenesis was performed on the TaqTthAKK construct using the 

mutagenic primer 473-087-05: 5'-cgg-gac-ctc-gag-gcg-cgt-gaa-ccc-cag-gag-gtc-cac-3' 



226 



wo 01/90337 



PCT/USOl/17086 



(SEQ ED NO: 483) to introduce the L107F/A108T mutations and generate 
Taq(FT)TthAKK (DNA sequence SEQ ID NO: 484; amino acid sequence SEQ ID NO: 
485). 

Construction of Tfi(Fr)TtliAKK 

Site specific mutagenesis was performed on the TfiTthAKK construct using the 
mutagenic primers 785-096-01: 5'-gtg-gac-ctt-ctg-ggc-ttt-acc-cgc-ctc-gag-gcc-ccg-3' 
(SEQ ID NO: 486) and 785-096-02: 5'-cgg-ggc-ctc-gag-gcg-ggt-aaa-gcc-cag-aag-gtc- 
cac-3' (SEQ ID NO: 487) to introduce the L107FA^108T mutations and generate 
Tfi(FT)TthAKK PNA sequence SEQ ID NO: 349; amino acid sequence SEQ ID NO: 
488). 

Tfi(FT) DN 2M(N) and Tsc(FT) DN 2M(N) mutants 

The L107FAf'108T mutations were introduced by isolating the NotI and Sail 
fragment of the Tfi DN 2M(N) mutant (DNA sequence SEQ ID NO: 345) and inserting it 
into a Notl-Sall pre-digested Tfi(FT)TthAKK (DNA sequence SEQ ID NO: 349; amino 
acid sequence SEQ ID NO: 488) to yield the Tfi(FT) DN 2M(N) mutant (DNA sequence 
SEQ ID NO: 350, amino acid sequence SEQ ID NO: 35 1). To add the L107F or the 
E108T mutations into this Tsc-based construct, an identical procedure was done. The 
NotI and Sail cut fragement of Tsc DN 2M(N) mutant pNA sequence SEQ ID NO: 348) 
was inserted into Notl-Sall pre-digested Tsc(FT)TthAKK (DNA sequence SEQ ID NO: 
491) vector to yield Tsc(FT) DN 2M0SI) mutant (DNA sequence SEQ ID NO: 352, amino 
acid sequence SEQ ID NO: 353). 

Construction of Tfi(FT) AKK and Tsc(FT) AKK 

Starting with flie Tfi^DN2M(N) construct described previously (DNA 
sequence SEQ ID NO: 350), primers (959-022-01 to -04: 5'-ggc-ctc-acc-ccg-gtg-aag- 
cgg-acg-aag-aag-acg-ggo-aag-cgc-3',5'-gcg-ctt-gcc-cgt-ctt-ctt-cgt-ccg-ctt-cac-cgg-ggt- 
gag-gcc-3',5'-ctc-ctc-ctc-caa-gtg-gcc-aac-gag-ctg-gtc-ctg-3",5'-cag-gac-cag-ctc-gtt-ggc- 
cac-ttg-gag-gag-gag-3' (SEQ ID NO: 354-357) were used to introduce the "AKK" set of 
mutations (H784A, G504K and E507K) by site specific mutagenesis. The resulting 
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mutant construct is termed Tfi(FT)AKK (DNA sequence SEQ ID NO:366, amino acid 
sequence SEQ ID NO:367). 

Likewise, primers 959-022-05 to -08: 5'-ggg-ctt-ccc-gcc-atc-aag-aag-acg-aag- 
aag-acg-ggc-aag-cgc-3', 5*-gcg-ctt-gcc-cgt-ctt-ctt-cgt-ctt-ctt-gat-ggc-ggg-aag-ccc-3', 5 
5 atg-ctt-ttg-cag-gtg-gcc-aac-gaa-ctg-gtc-ctc-3', 5 '-gag-gac-cag-ttc-gtt-ggc-cac-ctg-caa-aag- 
cat-3* (SEQ ID NO: 360-363) were used in a site specific mutagenesis reaction, with the 
Tsc(FT)DN2M(N) mutant (DNA sequence SEQ ID NO: 352) as template to generate the 
Tsc(FT)AKK mutant (DNA sequence SEQ ID NO: 368; amino acid sequace SEQ ID 
NO: 369). 

10 Construction of Tsc(FT)TthAKK 

Site specific mutagenesis was performed on the TscTthAKK construct using the 
mutagenic primers 785-008-03 5*-ttt-acc-cgc-ctc-gag-gtg-ccg-ggc-3* (SEQ ID NO: 489) 
and reverse primer 680-21-03 5'-cgg cac etc gag gcg ggt aaa gcc caa aag gtc cac-3' (SEQ 
ID NO: 490) to introduce the L107F/E108T mutations and generate Tsc(FT)TthAKK 

15 (DNA sequence SEQ ID NO: 454; amino acid sequence SEQ ID NO: 491). 
ConstructioD of Taq(FT)TscAKK and Taq(FT)TfiAKK en2^es 

The Taq(FT)TthAKK construct was cut with the enzymes EcoRI and NotI and the 
smaller insert fragment was gel isolated* The TscAKK or TfiAKK constructs were also 
cut with EcoRI and NotI and the larger fragment was gel isolated and purified. The 

20 Taq(FT) insert (nuclease domain) was ligated into the TscAKK and TfiAKK vectors 
(polymerase domam) to generate the Taq(FT)TscAKK (DNA sequence SEQ ID NO: 
443; amino acid sequence SEQ ID NO: 444) and Taq(FT)TfiAKK (DNA sequence SEQ 
ID NO: 445; amino acid sequence SEQ ID NO: 446) chimeric constructs. 
Construction of TaqEFT-Tth(AKK) (DNA sequence SEQ ID NO:501; amino 

25 acidsequence SEQ ID NO: 502) 

Site specific mutagenesis was performed on Taq(FT)-Tth(AKK) DNA (SEQ ID 
NO: 484) using the mutagenic primers 436-013-08: 5 -atc-gtg-gtc-ttt-gac-gcc-gag-gcc- 
ccc-tcc-ttc-c-3* (SEQ ID NO:503) and 436-013-09 5 -gga-agg-agg-ggg-cct-cgg-cgt-caa- 
aga-cca-cga-t-3* (SEQ ID NO:504) to introduce the K70E mutation. 

30 

Additional mutations to improve enzyme activity of Taq(EFr)TthAKK 
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Constnicaon of TaqEFr-Tth(AKIQ-A/Ml (DNA sequence SEQ ID NO:505; amino 
acid sequence SEQ ID NO: 506) 

Site specific mutagenesis was perfonned on Taq(EFT)-Tth(AKK) DNA (SEQ ID 
NO: 501) using the mutagenic primers 1044-038-01: S'-cag-acc-atg-aat-tcg-gag-gcg-atg- 
ctg-ccc-ctc-ttt.3' (SEQ ID NO:507) and 1044-038-02: (SEQ ID NO: 508) 
5*-aaa-gag-ggg-cag-cat-cgc-ctc-cga-att-cat-ggt-ctg-3' to introduce the G4EA mutation. 

Construction of TaqEFr-m(AKK)-B/M2 (DNA sequence SEQ ID NO;509; 
amino acid sequence SEQ ID NO: 510) 

Site specific mutagenesis was performed on Taq(EFr)-Tth(AKK) DNA (SEQ ID 
NO: 501) using the mutagenic primers 1044-038-03: 5'-gco-tac-cgc-acc-ttc-ttt-gcc-ctg- 
aag-ggc-ctc-3 and 1044-038-04: 5'-gag-gcc-ctt-Kag-ggc-aaa-gaa-ggt-gcg-gta-ggc-3' (SEQ 
ID NO: 511 and 512, respectively) to introduce the H29F mutation. 

Construction of TaqEFT-Tth(AKK)-C/M3 (DNA sequence SEQ ID NO:513; 
amino acid sequence SEQ ID NO: 514) 

Site specific mutagenesis was performed on Taq(EFT)-Tth(AKK) DNA (SEQ ID 
NO: 501) using the mutagenic primers 1044-038-05: 5'-ctc-cto-aag-gpc-ctc.aga-gag-gac- 
ggg-gac-gcg-3' and 1044-038-06: 5'-cgo-gtc-ccc-gtc-ctc-tct-gag-ggo-ctt-gag-gag-3' (SEQ 
ID NO: 515 and 516, respectively) to introduce the K57R mutation. 
Construction of TaqEFT-Tth(AKK>D/M5 (DNA sequence SEQ ID NO:517; 
amino acid sequence SEQ ID NO:518) 

Site specific mutagenesis was performed on Taq(EFT)-Tth(AKK) DNA (SEQ ID 
NO: 501) using the mutagenic primers 1044-038-09: 5'-gac-gac-gtc-ctg-gcc-acc-ctg-gcc- 
aag-aag-gcg-3* and 1044-038-10: 5'-cgc-ctt-ctt-ggc-cag-ggt-ggo-cag-gac-gtc-gtc-3' 
(SEQ ID NO: 519 and 520, respectively) to introduce the S125T mutation. 
Construction of TaqEFT-Tth(AKK)-E/M6 (DNA sequence SEQ ID NO:521; 
amino acid sequence SEQ ID NO: 522) 

Site specific mutagenesis was performed on Taq(BFT)-Tth(AKK) DNA (SEQ ID 
NO: 501) using the mutagenic primers 1044-038-11: 5'-ggg-gag-aag-acg-gcg-ctc-aag-ctt- 
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ctg-gag-gag-3' and 1044-038-12: 5'-ctc-ctc-cag-aag-ctt-gag-cgc-cgt-ctt-ctc-ccc-3* (SEQ 
ID NO: 523 and 524, respectively) to introduce the R206L mutation. 

Construction of TaqEFT-Tth(AKK)-F/M7 (DNA sequence SEQ ID NO:525; amino 
5 acid sequence SEQ ID NO: 526) 

Site specific mutagenesis was perfoimed on Taq(BFT)-Tth(AKK) DNA (SEQ ID 
NO: 501) using the mutagenic primers 1044-038-13: 5'-gag-cco-gac-cgg-gag-ggg-ctt- 
aag-gcc-ttt-ctg-gag-agg-3' and 1044-038-14: 5'-cct-ctc-cag-aaa-ggc-ctt-aag-ccc-ctc-ccg- 
gtc-ggg-ctc-3 (SEQ ID NO: 527 and 528, respectively) to introduce the R269G and 
10 R27 IK mutations. 

Construction of TaqEFT-Tth(AKK)-G/M8 (DNA sequence SEQ ID NO;S29; amino 
acid sequence SEQ ID NO: 530) 

Site specific mutagenesis was performed on Taq(EFT)-Tth(AKK) DNA (SEQ ID 
NO: 501) using the mutagenic primers 1044-038-15: 5'-cac-gag-ttc-ggc-ctt-ctg-gga-ggg- 
15 gag-aag-ccc-cgg-gag-gag-gcc-ccc-tgg-ccc-3' and 1044-038-16: 5'-ggg-cca-ggg-ggc-ctc- 
ctc-ccg-ggg-ctt-ctc-ccc-tcc-cag-aag-gcc-gaa-ctc-gtg-3' (SEQ ID NO: 531 and 532, 
respectively) to introduce the E290G, S291G, P292E, A294P, and L295R mutations. 

Construction of TaqEFr-Ttli(AKIQ-H/M9 (DNA sequence SEQ ID NO:533; amino 
20 acid sequence SEQ ID NO: 534) 

Site specific mutagenesis was performed on Taq(EFT)-Tth(AKK) DNA (SEQ ID 
NO: 501) using the mutagenic primers 1044-038-17: 5'-ctg-gcc-ctg-gcc-gcc- tgo-agg- 
ggc-ggc-cgc-gtg-3* and 1044-038-18: 5'-cac-gcg-gcc-gcc-cct-gca-ggc-ggc-cag-ggc-cag-3' 
(SEQ ID NO: 535 and 536, respectively) to introduce the A328C mutation. 

25 

Construction of TaqEFT-Tth(AKIQ-I/M10 (DNA sequence SEQ ID NO:S37; amino 
acid sequence SEQ ID NO: 538) 

Site specific mutagenesis was performed on Taq(EFT)-Tth(AKK) DNA (SEQ ID 
NO: 501) using the mutagenic primers 1080-015-01 (SEQ ID NO:539) 5'-ggg gag aag 
30 acg gcg agg aag ctt ctg aag gag tgg ggg agc-3' and 1080-015-02 (SEQ ID NO: 540) 5'- 
gct ccc cca etc ctt cag aag ctt cct cgc cgt ctt etc ccc-3 ' to introduce the E210K mutation. 
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Constructioii of TaqEFT-Tth(AKK)-Ml-9 (DNA sequence SEQ ID NO:541; amino 
add seqaence SEQ ID NO: 542) 

Seven independent PCR reactions were performed, using construct TaqEFT- 
Tth(AKK) (SEQ ID: 501) as a template, with the following pairs of mutagenic primers: 
PCR Reaction 1; 1044-038-01 5'-cag-acc-atg-aat-tcg-gag-gcg-atg-ctg-ccc-ctc-ttt-3' (SEQ 
IDNO:507) and 1044-038-04 5'-gag-gcc-ctt-cag-ggc-aaa-gaa-ggt-gcg-gta-ggc-3' (SEQ 
ID NO:512) to yield a 108 base pair fragment, PCR Reaction 2; 1044-038-03 5'-gcc-tac- 
cgc-acc-ttc-ttt-gcc-ctg-aag-ggc-ctc-3' (SEQ ID NO: 51 1) and 1044-038-06 5'-cgc-gtc- 
ccc-gtc-ctc-tct-gag-ggo-ctt-gag-gag-3' (SEQ ID NO: 516) to yield a 1 17 base pair 
ftagment, PCR Reaction 3; 1044-038-05 5*-ctc-ctc-aag-gcc-ctc-aga-gag-gac-ggg-gac- 
gcg-3' (SEQ ID NO: 515) and 1044-038-10 5'-cgc-ctt-ctt-ggc-cag-ggt-ggc-cag-gac-gto- 
gto-3' (SEQ ED NO: 520) to yield a 237 base pair fragment, PCR Reaction 4; 1044-038- 
09 5'-gac-gac-gto-ctg-gcc-acc-ctg-gcc-aag-aag-gcg-3' (SEQ ID NO: 519) and 1044-038- 
12 5'-ctc-ctc-cag-aag-ctt-gag-cgp-cgt-ctt-ctc-ccc-3' (SEQ ID NO: 524) to yield a 276 base 
pair fragment, PCR Reaction 5; 1044-038-11 5'-ggg-gag-aag-acg-gcg-ctc-aag-ctt-ctg- 
gag-gag-3' (SEQ ID NO: 523) and 1044-038-14 5'-cct-ctc-cag-aaa-ggc-ctt-aag-ccc-ctc. 
ccg-gtc-ggg-ctc-3' (SEQ ED NO: 528) to yield a 228 base pair fragment. PCR Reaction 6; 
1044-038-13 5'-gag-ccc-gac-cgg-gag-ggg-ctt-aag-gcc-ttt-ctg-gag-agg-3' (SEQ ID NO: 
527) and 1044-038-16 5'-ggg-ccarggg-ggc-ctc-ctc-ccg-ggg-ctt-ctc-ccc-tcc-cag-aag-gcc- 

gaa-ctc-gtg-3' (SEQ ID NO: 532) to yield a 1 13 base pair fragment, PCR Reaction 7; 
1044-038-15 5'-cac-gag-ttc-ggc-ctt-ctg-gga-ggg-gag-aag-cco-cgg-gag-gag-gcc-ccc-tgg- 
ccc-3' (SEQ ID NO: 531) and 1044-038-18 5'-cac-gcg-gcc-^cct-gca-ggc-ggc-cag-ggc- 
cag-3' (SEQ ID NO: 536) to yield a 157 base pair fragment The seven PCR products 
overly such that PGR aii?)lification in the aibsaice of primors yielded the ^ropriate 
1005 base pair product to introduce the K70E, G4EA, H29F. K57R. S125T, R206L, 
R269G, R271K, E290G, S291G, P292E, A294P, L295R, and A328C mutations. The 
product was furttier amplified using the outside primers: 1044-038-1 (SEQ ID NO: 507) 
and 1044-038-18 (SEQ ID NO: 536) and cloned into pPCR-Script-Amp. From this 
construct, the nuclease domain was again amplified using primers: 1044-038-1 (SEQ ID 
NO: 507) and 1044-038-18 (SEQ ID NO: 536) and digested with EcoRI and Notl. The 
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digested PGR product was ligated into an EcoRI and NotI digested TaqEFT-Tth(AKK) 
construct and transformed into JM109 for protein expression and screening. 

Constniction of Taq£FT-Tth(AKK)-Ml-10 (DNA sequence S£Q ID NO:543; amino 
5 acid sequence SEQ ID NO: 544) 

Seven independent PGR reactions were performed, using construct TaqEFT- 
Tth(AKK) (SEQ ID: 501) as a template, with the following pairs of mutagmc primers: 
PGR Reaction 1; 1044-038-01 5'-cag-acc-atg-aat-tcg-gag-gcg-atg-ctg-ccc-ctc-ttt-3' (SEQ 
E)NO:507) and 1044-038-04 5'-gag-gcc-ctt-cag-ggc-aaa-gaa-ggt-gcg-gta-ggc-3* (SEQ 

10 ID NO:512) to yield a 108 base pair fragment, PGR Reaction 2; 1044-038-03 5'-gcc-tac- 
cgc-acc-ttc-ttt-gcc-ctg-aag-ggc-^tc-3" (SEQ ID NO: 511) and 1044-038-06 5'-cgc-gtc- 
ccc-gtc-ctc-tct-gag-ggc-ctt-gag-gag-3' (SEQ ID NO: 516) to yield a 1 17 base pair 
fragment, PGR Reaction 3; 1044-038-05 5'-ctc-ctc-aag-gcc-ctc-^a-gag-gac-ggg-gac- 
gcg-3' (SEQ ID NO: 515) and 1044-038-10 5'-cgc-ctt-ctt-ggc-cag-ggt-ggc-cag-gac-gtc- 

15 gtc-3* (SEQ ID NO: 520) to yield a 237 base pair fragment, PGR Reaction 4; 1044-038- 
09 5'-gac-gac-gtc-ctg-gcc-acc-ctg-gcc-aag-aag-gcg-3* (SEQ ID NO: 519) and 1080-42-02 
5'-gc ttc cag get ccc cca etc ctt cag aag ctt gag cgc cgt ctt etc ccc-3' (SEQ ID NO: 546) to 
yield a 299 base pair fragment, PGR Reaction 5; 1080-42-01 5'-ggg gag aag acg gcg etc 
agg ctt ctg aag gag tgg ggg age ctg gaa gc-3' (SEQ ID NO: 545) and 1044-038-14 5'-cct- 

20 ctc-cag-aaa-ggc-ctt-aag-ccc-ctc-ccg-gtc-ggg-ctc-3' (SEQ ID NO: 528) to yield a 228 base 
pair fragment, PGR Reaction 6; 1044-038-13 5'-gag-ccc-gac-cgg-gag-ggg-ctt-aag-gcc-ttt- 
ctg-gag-agg-3' (SEQ ID NO: 527) and 1044-038-16 5'-ggg-cca-ggg-ggc-ctc-ctc-ccg-ggg- 
ctt-ctc-ccc-tcc-cag-aag-gcc-gaa-ctc-gtg-3' (SEQ ID NO: 532) to yield a 113 base pair 
fragment, PGR Reaction 7; 1044-038-15 5*-cac-gag-ttc-ggc-ctt-ctg-gga-ggg-gag-aag-ccc- 

25 cgg-gag-gag-gcc-ccc-tgg-ccc-3* (SEQ ID NO: 531) and 1044-038-18 5'-cac-gcg-gcc-gcc- 
cct-gca-ggC7ggc-cag-ggc-cag-3* (SEQ ID NO: 536) to yield a 157 base pair fragment 
The seven PGR products overlap such that PGR amplification in the absence of primers 
yielded the ^propriate 1 005 base pair product to introduce the K70E, G4EA, H29F, 
K57R, S125T, R206L, E210K, R269G, R271K, E290G, S291G, P292E, A294P, L295R, 

30 and A328C mutations. The product was ftirther ampUfied using the outside primers: 
1044-038-1 (SEQ ID NO: 507) and 1044-038-18 (SEQ ID NO: 536) and cloned into 
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pPCR-Script-Amp. From this construct, the nuclease domain was again amplified using 
primers: 1044-038-1 (SEQ ID NO: 507) and 1044-038-18 (SEQ ID NO: 536) and 
digested with EcoRI and Notl. The digested PGR product was ligated into an EcoRI and 
NotI digested TaqEFT-Tth(AKK) construct and transformed into JM109 for enzyme 
5 expression and screaiing. 

K69E Mutation of FEN en^es to further improve substrate specificity 



Construction of Tth(K69E)AKK, Taq(K69E)TthAKK, Tfi(K69E)TthAKK and 
10 Tsc(K69E)TthAKK and mutants 

Site specific mutagenesis was performed on TthAKK, TaqTthAKK, TfiTthAKK 
and TscTthAKK DNAs using the mutagenic primers 5'-atc-gtg-gtc-ttt-gac-gcc-gag-gcc- 
ccc-tcc-ttc-c-3' (SEQ ED NO: 492) and 5'-gga-agg-agg-ggg-cct-cgg-cgt-caa-aga-cca-cga- 
t-3' (SEQ ID NO: 493) to introduce the K69E mutation and to generate Tth(K69E)AKK 
15 (DNA sequence SEQ ID NO: 452; amino acid sequence SEQ ID NO: 494), 

Taq(K69E)ThAKK, (DNA sequence SEQ ID NO: 495; amino acid sequence SEQ ID 
NO: 496), Tfi(K69E)TthAKK (DNA sequence SEQ ID NO: 497; amino acid sequence 
SEQ ID NO: 498)and Tsc(K69E)TthAKK (DNA sequence SEQ ID NO: 499; amino acid 
sequence SEQ ID NO: 500) mutant en2ymes. 

20 

Construction of Tsc(167-334)TthAKK 

Two overlapping PGR ftagments were generated by primers 390-76-08: 5*-tgt- 
gga-att-gtg-agc-gg-3' (SEQ ID NO:314) and 1044-041-01: 5'-ctt-ctc-tca-tcc-gcc-aaa-aca- 
gCC-3' (SEQ ID NO: 451) with template Tth(K69E)AKK, (DNA sequence SEQ ID NO: 

25 452), and primers 1044-041-02: 5'-ctc-cto-cac-gag-ttc-ggc-3' (SEQ ID NO: 453) and 
209-074-02: 5'-gtg-gcc-tcc-ata-tgg-gcc-agg-ac-3' (SEQ ID NO:316) with template 
Tsc(K69E)TaiAKK, (DNA sequence SEQ ID NO: 454). Two products were combined 
and ampUfied with outside primers 390-76-08 and 209-074-02. The recombinant PGR 
product was cut with the restriction enzymes EcoRI and Notl and ligated into the vector 

30 TthAKK which was prepared by cutting with the same enzymes to yield Tsc(167- 
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333)TthAKK construct (DNA sequence SEQ ID NO: 455; amino add sequence SEQ ID 
NO: 456). 

Construction of Tsc(222-334)TthAKK 

5 Two overtyping PGR fragments were generated by primers 390-76-08: 5'-tgt- 

gga-att-gtg-agc-gg-3* (SEQ IDNO:314) and 1044-041-03: 5'-ttc-cag-gtg-ctt-gag-gag-gtt- 
ttc-cag-3' (SEQ ID NO: 457) with template Tth(K69E)AKK (DNA sequence SEQ ID 
NO: 452), and primers 1044-041-04; 5'-ctc-ctc-aag-cac-ctg-gaa-cag-gtg-aaa-3* (SEQ ID 
NO: 458) and 209-074-02: 5'-gtg-gcc-tcc-ata-tgg-gcc-agg-ac-3' (SEQ ID NO:316) with 

10 template Tsc(K69E)TthAKK, (DNA sequence SEQ ID NO: 454). Two products were 
combined and amplified with outside primers 390-76-08 and 209-074-02. The 
recombinant PGR product was cut with the restriction enzymes EcoRI and NotI and 
ligated into the vector TthAKK which was prepared by cutting with the same enzymes to 
yield Tsc(222-334)TthAKK construct (DNA sequence SEQ ID NO: 459; amino acid 

15 sequence SEQ ID NO: 460). 

Construction of Tfi(222-334)TthAKK 

Two overlapping PGR fragments were generated by primers 390-76-08: 5*-tgt- 
gga-att-gtg-agc-gg-3' (SEQ ID NO:314) and 1044-058-05: 5'-gto-cag-gtt-ctt-gag-gag-gtt- 

20 ttc-cag-3* (SEQ ID NO: 461) with template Tth(K69E)AKK (DNA sequence SEQ ID 
NO: 452), and primers 1044-058-06: 5'-ctc-ctc-aag-aac-ctg-gac-cgg-gta-aag-3* (SEQ ID 
NO: 462) and 209-074-02: 5'-gtg-gcc-tcc-ata-tgg-gcc-agg-ac-3' (SEQ ID NO:316) with 
template Tfi(K69E)TthAKK (DNA sequence SEQ ID NO: 497). Two products were 
combined and amplified with outside primers 390-76-08 and 209-074-02. The 

25 recombinant PGR product was cut with &e restriction enzymes EcoRI and NofI and 

ligated into the vector TthAKK which was prepared by cutting with the same enzymes to 
yield Tfi(222-334)TfliAKK construct (DNA sequence SEQ ID NO: 463; amino acid 
sequence SEQ ID NO: 464). 
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Construction of Tfi(167-334)TthAKK 

Two ovCTlapping PGR fragments were generated by primers 390-76-08 5'-tgt- 
gga-att-gtg-agc-gg.3' (SEQ ID NO:314) and 1044-058-07 5'-gac-gtc-ctt-cgg-ggt-gat- 
gag-gtg-gcc-3' (SEQ ID NO: 465) with template TtJi(K69E)AKK, (DNA sequence SEQ 

5 ID NO: 452), and primers 1044-058-08 5'-atc-acc-ccg-aag-gac-gtc-cag-gag-aag-3' (SEQ 
ID NO: 466) and 209-074-02 5'-gtg-gcc-tcc-ata-tgg-gcc-agg-ac-3' (SEQ ID NO:316) 
Willi template Tfi(K69E)TthAKK, (DNA sequence SEQ ED NO: 498). Two products 
were combined and amplified with outside primers 390-76-08 and 209-074-02. The 
recombinant PGR product was cut with the restriction razymes EcoRI and NotI and 

10 ligated into the vector TthAKK which was prepared by cutting with the same enzymes to 
yield Tfi(l67-333)TthAKK construct (DNA sequence SEQ ID NO: 467; amino acid 
sequence SEQ ID NO: 468). 

Constriiction ofTsc(lll-334)TthAKK 

15 Two overiapping PGR fragments were generated by primers 390-76-08 5'-tgt- 

gga-att-gtg-agc-gg-3' (SEQIDNO:3l4)and 1044-058-01 5'-ctc-gag-gcg-ggt-aaa-ccc- 
cag-gag-gtc-3' (SEQ ID NO: 469) with template Tth(K69E)AKK (DNA sequence SEQ 
ID NO: 452), and primers 1044-058-02 5'-ggg-ttt-acc-cgc-ctc-gag-gtg-ccc-ggc-3' (SEQ 
ID NO: 470) and 209-074-02 5*-gtg-gcc-tcc-ata-tgg-gcc-agg-ac-3' (SEQ ID NO:316) 

20 with template Tsc(K69E)TthAKK (DNA sequence SEQ ID NO: 499). Two products 
were combined and ampHfied with outside primers 390-76-08 and 209-074-02. The 
recombinant PGR product was cut with the restriction enzymes EcoRI and NotI and 
ligated into the vector TthAKK wtich was prepared by cutting with the same raizymes to 
yield Tsc(167-333)TthAKK construct pNA sequence SEQ ID NO: 471; amino acid 

25 sequence SEQ ID NO: 472). 

Construction of Tsc(l-167)TthAKK 

Two overlapping PGR fragments were generated by primers 390-76-08 5'-tgt- 
gga-att-gtg-agc-gg-3' (SEQ ID NO:314) and 1044-058-03 5'-aag-cca-ctc-cgg-ggt-gat- 
30 cag-gta-aco-3' (SEQ ID NO: 473) with template Tsc(K69E)TthAKK (DNA sequence 
SEQ ID NO: 499), and primers 1044-058-04 5'-atc-acc-ccg-gag-tgg-ctt-tgg-gag-aag-3' 
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(SEQ ID NO: 474) and 209-^74-02 5'-gtg-gcc-tcc-ata-tgg-gcc-agg-ac-3' (SEQ ID 
NO:316) with template Tth(K69E)AKK (DNA sequence SEQ ID NO: 452). Two 
products were combined and amplified with outside primers 390-76-08 and 209-074-02. 
The recombinant PGR product was cut with the restriction enzymes EcoRI and NotI and 
5 ligated into the vector TthAKK which was prepared by cutting with the same enzymes to 
yield Tsc(l-.167)TthAKK construct (DNA sequence SEQ ID NO: 475; amino acid 
sequence SEQ ID NO: 476). 

Modification of AfuFEN enzymes 

10 

Construction of pAfuFEN 

Plasmid pAfiiFENl was prepared as described [U.S. Patent ^plication Ser. No. 
09/684,938, WO 98/23774, each incorporated herein by reference in their entireties]. 
Briefly, genomic DNA was prepared from one vial (approximately 5 ml of culture) of 

15 live A. Mgidus bacteria from DSMZ (DSMZ #4304) with the DNA XTRAX kit (Gull 
Laboratories, Salt Lake City, UT) according to the manufacturer's protocol. The final 
DNA pellet was resuspended in 100 |il of TE (10 mM Tris HCl, pH 8.0, 1 mM EDTA). 
One microliter of the DNA solution was employed in a PGR using the ADVANTAGE 
cDNA PGR kit (Glonetech); the PGR was conducted according to manufacturer's 

20 recommendations. 

The 5' end primer is complementary to the 5' end of the Afu FEN-1 gene except it 
has a 1 base pair substitution to create an Nco I site. The 3* end primer is complemetary 
to the 3' end of the Afu FEN-1 gene downstream from the FBN-1 ORF except it contains 
a 2 base substitution to create a Sal I site. The sequences of the 5* and 3' end primers are 

25 5'-CCGTCAAGATTTAGGATGGGTGCGGA-3' (SEQ ID NO:617) and 

5'-GCGGCAGCTCGTAGTGGACATGCTTTTCGTG (SEQ ID NO:618), respectively. 
Cloning of the resulting fragment was as described for the PfiiFENl gene, U.S. Patent 
application Ser. No. 5,994,069, incorporated herein in its entirety for all purposes, to 
create the plasmid pTrc99-AFFENl . For expression, the pTrcAftiHis plasmid was 

30 constructed by modifying pTrc99-AFFENl , by adding a histidine tail to facilitate 
purification. To add this histidine tail, standard PGR primer-directed mutagenesis 
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10 



15 



methods were used to insert the coding sequence for six histidine residues between the 
last amino acid codon of the pTrc99-AFFENl codmg region and the stop codon. The 
resulting plasmid was termed pTrcAfoHis. The protein was then expressed as described 
and purified by binding to a Ni-H- affinity column. 

Construction of Afu(Y236A), A46-5 

Two overlapping PGR fi-agments were generated from template AfiiFEN (SEQ ID 
NO:556) with primers: 785-73-04 5'-ctg-gtc-ggg-acg-gac-gcc-aat-gag-ggt-gtg-aag-3' 
(SEQ ID NO: 547) and 700-10-03 5'-ctt-ctc-tca-tcc-gcc-aaa-aca-gcc-3' (SEQ ID 
NO:343). and primers: 390-076-08 5'-tgt-gga-att-gtg-agc-gg-3' (SEQ D) NO:314) and 
785-73-06 5'-gtc-cgt-ccc-gac-cag-aat-3' (SEQ ID NO: 548). The two products were 
combined and amplified with outside primers 700-10-03 and 390-076-08. The 
recombinant PGR product was cut with the restriction enzymes Ncol and SaH and Ugated 
into the AfuFEN construct which was prepared by cutting with the same enzymes to yield 
the Aiu(Y236A) constmct (DNA sequence SEQ ID NO: 549; amino acid sequence SEQ 
ID NO: 550). 



Constroction of Afu(Y236R), A56-9 

Two overlapping PGR Augments were generated using the template AfuFEN with 

20 primers: 785-73-05 5'-ctg-gtc-ggg-acg-gac-agg-aat-gag-ggt-gtg-aag-3' (SEQ ID NO: 
551) and 700-10-03 5'-ctt-ctc-tca-tcc-gcc-aaa-aca-gcc-3' (SEQ ID NO:343), and primers: 
390-076-08 5'-tgt-gga-att-gtg-agc-gg-3' (SEQ E) NO:314) and 785-73-06 5'-gtc-cgt-ccc- 
gac-cag-aat-3' (SEQ ID NO: 548). The two products were combined and amplified with 
outside primers 700-10-03 and 390-076-08. The recombinant PGR product was cut with 

25 the restriction enzymes Ncol and Sail and ligated into the AfuFEN construct which was 
prepared by cutting with the same enzymes to yield Afu(Y236R) construct (DNA 
sequence SEQ ID NO: 552; amino acid sequence SEQ ID NO: 553). 
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2. Chimeras of FEN enzymes and Thermus Polymerase derivatives 
The following enzyme constructs combine portions of the AfuFEN enzyme 
polymerase domain and the polymerase domain of Thermus ploymerases. These 
combinations were designed based on information generated by molecular modeling. 

5 

Construction of Afti336-Tth296(AKK), ATl-1 

Two overlapping PGR fragments were generated. The first fragment was made 
using the AfuFEN construct (SEQ ID NO:556) as template with the primers: 390-076-08 
5'-tgt-gga-att-gtg-agc-gg-3' (SEQ ID NO:3I4) and 390-065-05 5'-gaa-cca-cct-ctc-aag- 

10 cgt-gg-3* (SEQ ID NO: 554). The second fragment was made using TthAKK (SEQ ID 
NO:558) as template and the primers: 700-049-01 5'-acg-ctt-gag-agg-tgg-ttc-ctg-gag- 
gag-gcc.ccc-tgg-3' (SEQ ID NO: 555) and 390-076-09 5*-taa-tct-gta-tca-ggc-tg-3* (SEQ 
ID NO:557). The two products contain a region of sequence overlap, and were combined 
and amplified with outside primers 390-076-08 and 390-076-09 in a recombinant PGR 

1 5 reaction. The recombinant PGR product was cut with the restriction enzymes BspEI and 
Sail and ligated into the AfuFEN construct which was prepared by cutting with the same 
OTzymes to yield Afu336-Tth296(AKK) construct (DNA sequence SEQ ID NO: 559; 
amino acid sequence SEQ ID NO: 560). 
Construction of Afii328-Tth296(AKK), AT2-3 

20 Two overlapping PGR fragments were generated, the first by primers: 390-076-08 

5*-tgt-gga-att-gtg-agc-gg-3' (SEQ ID NO:314) and 700-049-02 5*-ggt-tga-ctt-cag-agc-ttt- 
gag-3* (SEQ ID NO: 561) with template AftiFEN (DNA sequence SEQ ID NO:556), and 
the second by primers: 700-049-03 5'-aaa-gct-ctg.aag-tca-acc-ctg-gag-gag-gcc-ccc-tgg-3' 
(SEQ ID NO: 562) and 390-076-09 5'-taa-tct-gta-tca-ggc.tg-3* (SEQ ID NO:557) with 

25 template TthAKK (DNA sequaice SEQ ID NO:558). The two products were combined 
and amplified with outside primers 390-076-08 and 390-076-09. The recombinant PGR 
product was cut with the restriction enzymes BspEI and Sail and ligated into the vector 
pAfiiFEN which was prepared by cuttmg with the same enzymes to yield Afu336- 
Tth296(AKK) construct (DNA sequence SEQ ID NO: 563; amino acid sequence SEQ ID 

30 NO: 564). 
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Construction of Afii336-Taq5M 

The TaqSM construct (SEQ ID N0:41) was cut with the enzymes Nott and SaH 
and the smaller insert fragment was gel isolated. The Afu336-Tth296(AKK) construct 
(DNA sequence SEQ ID NO: 559) was also cut with the same restriction enzymes and 
the larger vector fragment was purified. The insert (TaqSM polymerase domain) was 
then ligated into the vector to generate the Aiu336-Taq5M construct (DNA sequence 
SEQ ID NO: 565; amino acid sequence SEQ ID NO: 566). 

Constmction of Afu336-TaqDN 

The TaqDNHT construct was cut with the enzymes Nott and SaU and the smaUer 
insert fragment was gel isolated. The Afu336-Tth296(AKK) construct pNA sequence 
SEQ ID NO: 559) was also cut with the same restriction enzymes and the larger vector 
fragment was purified. The insert (TaqDN polymerase domain) was then Ugated into the 
vector to generate the Afii336-Taq5M construct (DNA sequence SEQ ID NO: 567; amino 
add sequKice SEQ ID NO: 568). 

Random Chimerization of Thermus Polymerases 

Numerous enzymes widi altoed fimctions were generated via random 
chimerization of Thermus polymerases based on the principal of DNA shuffling (Volkov 
and Arnold. Methods in Enzymology 328:456 [2000]). The procedure below was used to 
develq) the fr>llowing chuneras. 

The genes of interest for random chimerization were PCR amplified with primers: 
390-076-08 5'.tgt-gga-att-gtg-agc-gg-3' (SEQ ID NO:314) and 209-074-02: 5'-gtg-gcc- 
tcc-ata-tgg-gcc-agg-ac-3' (SEQ ID NO:316) to generate the approximately 1.4 kbp 
templates. About 2 ng of the DNA templates were mixed in equal proportion, and then 
digested with DNase I (0.33 U) in a 30 jil reaction at 15° C for approximately 1 minute to 
generate fragments 50-200 bp in size. DNA firagments were purified in a 4 % agarose gel 
and extracted by QIAEXH gel extraction kit (QIAGEN) according to manufacturer's 
instructions. The purified Augments (10 ^1) were added to 10 nl of 2XPCRpre-mix (5- 
fold diluted, cloned Pfii buffer, 0.4 mM each dNTP. 0.06 U/jil cloned Pfii DNA 
polymerase, STRATAGBNE Cat.# 600153, with accompanying buffer) for the fragment 
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reassembly reaction (PCR program: 3 min 94 followed by 40 cycles of 30 sec 94 **C, 1 
min 52 **C, 2 min + 5 s/cycle 72 followed by 4 min at 72 °C). The reassembled . 
products (1 III of a 10-fold dilution) were then PCR amplified with a pair of nested 
primers: 072-090-01 5'-gag-cgg-ata-aca-att-tca.cac-agg-3' (SEQ IDNO: 569) and 189- 

5 082-01 5'-tgc-ccg-gtg-cac-gcg-gcc-gcc-cct-gca-ggc-3' (SEQ ID NO: 570) using the 
CLONTECH GC melt cDNA PCR kit (Cat.# K1907-Y) according to the manufacture's 
instructions. The purified PCR products were digested with restriction enzymes EcoRI 
and NotI and then ligated into the TthAKK construct that was prepared by cutting with 
the same enzymes. The ligation mixture was transformed into JM109 competent cells 

1 0 and colonies were screened for enzyme activity. 

1. Generation of Random Chimeras S26 and S36 

The templates used to develop these chimeric enzymes were the nuclease domains 
15 &om TthAKK, TaqTfliAKK, TscTthAKK and the TfiTthAKK constracts. Two clones 
were found to show improvement of activity based on primary activity screening. 
Random chimeras S26 (DNA sequoice SEQ ID NO: 571; amino acid sequence SEQ ID 
NO: 572) and S36 (DNA sequence SEQ ID NO: 573; amino acid sequence SEQ ID NO: 
574) were then sequenced and isolated. 

20 

2. Introduction of L107F/E108T, LlOPFmiOT and K69E mutations in S26 
and 836 to improve substrate specificity 

Construction of S26(FT) 

Site specific mutagenesis was performed on pS26 DNA using the mutagenic 
25 primers: 785-008-03 5'-ttt-acc-cgc-ctc-gag-gtg-ccg-ggc-3' (SEQ ID NO: 489) and 680- 
21-03 5*-cgg-cac-ctc-gag-gcg-ggt-aaa-gcc-caa-aag-gtc-cac-3* (SEQ ID NO: 490) to 
introduce the L107F/E108T mutations and g^erate S26(FT) (DNA sequence SEQ ID 
NO: 575; amino acid sequence SEQ ID NO: 576). 
Construction of S36(FT) 
30 Site specific mutagenesis was performed on pS36 DNA using the mutagenic 

primers: 785-096-01: 5'-gtg-gac-ctt-ctg-ggc-ttt-acc-cgc-ctc-gag-gcc-ccg-3' (SEQ ID NO: 
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486) and 785-096-02: 5'-cgg-ggc-ctc-gag-gcg-ggt-aaa-gcc-cag-aag-gtc-cac-3' (SEQ ID 

NO: 487) to introduce the L109FA^110T mutations and generate S36(FT) (DNA 

sequence SEQ ID NO: 577; amino acid sequaice SEQ ID NO: 578). 

Construction of S26(K69E) 
5 Site specific mutagenesis was performed on S26 DNA using the mutagenic 

primers: 5'-atc-gtg-gtc-ttt-gac-gcc-gag-gcc-cco-tcc-ttc-c-3' (SEQ ID NO: 492) and 5'- 

gga-agg-agg-ggg-cct-cgg-cgt-caa-aga-cca.cga-t-3' (SEQ ID NO: 493) to introduce the 

K69E mutation and to generate S26(K69E) (DNA sequence SEQ ID NO: 579; amino 

acid sequence SEQ ID NO: 580). 

10 Construction of S26(FT/K69£) 

Site specific mutagenesis was performed on S26(FT) DNAs using the mutagerac 

primers: 5'-atcgtg-gtc-ttt-gac-gcc-gag-gcc-ccc-tcc-ttc-c-3' (SEQ ID NO: 492) and 5'- 

gga-agg-agg-ggg^ct^gg-cgt-caa-aga-cca<ga-t.3' (SEQ ID NO: 493) to introduce the 

K69E mutation and to generate S26(FT/K69E) (DNA sequence SEQ ID NO: 581; amino 

15 acid sequence SEQ ID NO: 582). 

3. More random chimeras, N3D7, N1A12 N1C4, and N2C3 
The templates used to generate these chimerics were the nuclease domains of 
Tth(K69E)AKK, Taq(K69E)TthAKK:, Tsc(K69E)TthAKK and Tfi(K69E)TthAKK 
20 constructs. Four clones were shown to have improved activity based on the primary 
activity screening. Random chimeras N3D7 (DNA sequence SEQ ID NO: 583; amino 
add sequence SEQ ID NO: 584), NIA12 (DNA sequence SEQ ID NO: 585; amino acid 
sequence SEQ ID NO: 586), N1C4 (DNA sequence SEQ ID NO: 587; amino acid 
sequence SEQ ID NO: 588) and N2C3 (DNA sequence SEQ ID NO: 589; amino acid 
25 sequence SEQ ID NO:590) were them sequenced and isolated respectively. 

EXAMPLE 10 
Test For The Dependence Of An Enzyme On 
The Presence Of An Upstream Oligonucleotide 

30 When choosing a structure-specific nuclease for use in a sequential invasive 

cleavage reaction it is preferable that the enzyme have Uttie ability to cleave a probe 1) in 
the absence of an upstream oUgonucleotide, and 2) in the absence of overlap between the 
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upstream oligonucleotide and the downstream probe oligonucleotide^ Figs. 26a-e depicts 
several structures that can be used to examine the activity of an enzyme is confronted 
with each of these types of structures. The structure a (Fig. 26a) shows the alignment of 
a probe oligonucleotide with a target site on bacteriophage Ml 3 DNA (Ml 3 sequences 

5 shown in Fig. 26 are provided in SEQ ID NO:217) in the absence of an upstream 

oligonucleotide. Structure b (Fig. 26b) is provided with an upstream oUgonucleotide that 
does not contain a region of overlap with the labeled probe (the label is indicated by the 
star). In structures c, d and e (Figs. 26c-e) the upstream oUgonucleotides have overlaps 
of 1, 3 or 5 nucleotides, respectively, with the downstream probe oligonucleotide and 

10 each of these structures represents a suitable invasive cleavage structure. The enzyme 
Pju FEN-1 was tested for activity on each of these structures and all reactions were 
performed in duplicate. 

Each reaction comprised 1 nM 5' TET labeled probe oligonucleotide 89-15-1 
(SEQ ID NO:212), 50 nM upstream oKgonucleotide (either oligo 81-69-2 [SEQ ID 

15 NO:216], oligo 81-69-3 [SEQ ID NO:215], oligo 81-69-4 [SEQ ID NO:214], oUgo 

81-69-5 [SEQ ID NO:213], or no upstream oligonucleotide), 1 finol M13 target DNA, 10 
mg/ml mik and 10 ng of P^w FEN-1 ui 10 jil of 10 mM MOPS (pH 7.5), 7.5 mM MgCh 
with 0.05% each of Tween 20 and Nonidet P-40. 

All of the components except the enzyme and the MgCh were assembled in a 

20 final volume of 8 ^1 and were overlaid with 10 jil of CHILLOUT liquid wax. The 

samples were heated to the reaction temperature of 69*'C. The reactions were started by 
the addition of the Pfu FEN-1 and MgCh, in a 2 ^il volume. After incubation at 69**C for 
30 minutes, ttie reactions were stopped with 10 |il of 95% formamide, 10 mM BDTA, 
0.02% methyl violet. Sample were heated to 90°C for 1 min immediately before 

25 electrophoresis through a 20% denaturing acrylamide gel (19:1 cross-linked), with 7 M 
urea, in a buffer of 45 mM Tris-Borate, pH 8.3, 1.4 mM EDTA. Gels were then analyzed 
with a FMBIO-100 Hitachi FMBIO fluorescence imager. The resulting image is 
displayed in Fig. 27. 

In Fig. 27, lanes labeled "a" contam the products generated from reactions 

30 conducted without an upstream oligonucleotide (structure a), lanes labeled "b" contain an 
upstream oligonucleotide that does not invade the probe/target duplex (structure b). 
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Lanes labeled "c", "d" and "e" contain tiie products generated from reactions conducted 
using an upstream oUgonucleotide that invades the probe/target duplex by 1. 3 or 5 bases, 
respectively. The size (in nucleotides) of the uncleaved probe and the cleavage products 
is indicated to the left of the image in Fig. 27. Cleavage of the probe was not detectable 
vsrhen structures a and b were utilized. In contrast, cleavage products were generated 
when invasive cleavage structures were utilized (structures c-e). These data show that the 
Pju FEN-1 raizyme requires an overl^ping upstream oligonucleotide for specific 

cleavage of the probe. 

Any enzyme maybe examined for its suitability for use in a sequential invasive 
cleavage reaction by examining the ability of the test enzyme to cleave structures a-e (it 
is understood by those in the art that the specific oligonucleotide sequences shown m 
Figs. 26a-€ need not be employed in the test reactions; these structures are merely 
iUustrative of suitable test structures). Desirable enzymes display Uttte or no cleavage of 
structures a and b and display specific cleavage of structures c-e (i.e., they generate 
cleavage products of the size expected fix)m the degree of overlap between the two 
oligonucleotides employed to form the invasive cleav^e structure). 

EXAMPLE 11 

Use Of The Products Of A First Invasive Cleavage Reaction To Enable A Second 
Invasive Cleavage Reaction With A Net Gain In Sensitivity 
As discussed in the Description of The Invention above, the detection sensitivity 
of the mvasive cleavage reaction can be increased by the performing a second round of 
invasive cleavage using the products of the first reaction to complete the cleavage 
structure in the second reaction. In this Example, the use of a probe that, when cleaved in 
a first invasive cleavage reaction, forms an integrated INVADER oligonucleotide and 
ta^et molecule for use in a second invasive cleavage reaction, is illustrated. 

A first probe was designed to contain some internal complementarity so that when 
cleaved in a first invasive cleavage reaction the product ("Cut Probe 1 ") could form a 
target strand comprising an integral INVADER oligonucleotide. A second probe was 
provided m the reaction that would be cleaved at the intended site when hybridized to the 
newly foraied target/INVADER molecule. To demonstrate the gain in signal due to the 
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perfonnance of sequential invaave cleavages, a standard invasive cleavage assay, as 
described above, was performed in parallel. 

All reactions were perfonned in duplicate. Each standard non-sequential) 
invasive cleavage reaction comprised 1 \iM 5' fluorescein-labeled probe oligo 073-182 (5' 

5 Fl-AGAAAGGAAGGGAAGAAAGCGAA-3*; SEQ ID NO:591), 10 nM upstream oligo 
81-69-4 (5'-CTTGACGGGGAAAGCCGGCGAACGTGGCGA-3'; SEQ ID NO:214), 10 
to 100 attomoles of M13 target DNA, 10 mg/ml tRNA and 10 ng of FEN-1 in 10 [il 
of 10 mM MOPS (pH 7.5), 8 mM MgCh with 0.05% each of Tween 20 and Nonidet 
P-40. All of the componrats except Qie enzyme and the MgCl2 were assembled in a 

10 volume of 7 \il and were overlaid with 10 ^il of CHILLOUT liquid wax. The samples 
were heated to the reaction temperature of 62^C. The reactions were started by the 
addition of the PJu FEN-1 and MgCfe, in a 2 ^il volume. After incubation at 62'C for 30 
minutes, the reactions were stopped with 10 |il of 95% formamide, 10 mM EDTA, 0.02% 
mefliyl violet 

15 Each sequential invasive cleavage reaction comprised 1 \jM 5' fluorescein-labeled 

ohgonucleotide 073-191 (the first probe or "Probe 1", 5' Fl-TGGAGGTCAAAACATCG 
ATAAGTCGAAGAAAGGAAGGGAAGAAAT-3'; SEQ ID NO:592), 10 nM upstream 
ohgonucleotide 81-69-4 (5«- CTTGACGGGGAAA GCCGGCGAACGTGGCGA-3'; 
SEQ ID NO:214), 1 |xM of 5* fluorescein labeled oligonucleotide 106-32 (the second 

20 probe or "Probe 2", 5* Fl-TGTTTTGACCT CCA-3»; SEQ ID NO:593), 1 to 100 amol of 
M13 target DNA, 10 mg/ml tRNA and 10 ng of PJu FEN-1 in 10 \i\ of 10 mM MOPS 
(pH 7,5), 8 mM MgCh with 0.05% each of Tween 20 and Nonidet P-40. All of the 
components except the enzyme and the MgCla were assembled in a volume of 8 |al and 
were overlaid with 10 p.1 of CHILLOUT liquid wax. The samples were heated to the 

25 reaction temperature of 62**C (this temperature is the optimum temperature for annealing 
of Probe 1 to the first target). The reactions were started by the addition of FEN-1 
and MgCh, m a 2 jil volume. After incubation at 62*^0 for 15 minutes, the temperature 
was lowered to 58**C (this temperature is the optimum temperature for annealing of Probe 
2 to the second target) and the samples were incubated for another 15 min. Reactions 
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were stopped by the addition of 10 ^1 of 95% fonnanude, 20 mM EDTA, 0.02% methyl 
violet 

Samples ftom both the standard and the sequential invasive cleavage reactions 
were heated to 90° C for 1 min immediately before electrophoresis through a 20% 
denaturing acrylamide gel (19:1 cross-linked), with 7 M urea, in a buffer of 45 mM 
Tris-Borate, pH 8.3, 1 .4 mM EDTA. The gel was then analyzed with a Molecular 
Dynamics Fluorlmager 595. The resulting image is displayed in Fig. 28a. A graph 
showing measure of fluorescence intensity for each of the product bands is shown in Fig. 
28b. 

In Fig. 28a, lanes 1-5 contain the products generated in standard invasive cleavage 
reactions that contained eitiier no target (lane 1). 10 amol of target Oanes 2 and 3) or with 
100 amol of target Oanes 4 and 5). The uncleaved probe is seen as a dark band in each 
lane about halfway down the panel and the cleavage products appear as a smaller black 
band near the bottom of the panel, the position of the cleavage product is indicated by an 
anow head to the left of Fig. 28a. The gray ladder of bands seen in lanes 1-5 is due to the 
thermal degradation of the probe and is not related to flie presence or absence of the 
target DNA. The remaining lanes display products generated in sequential invasive 
cleavage reactions tiiat contained 1 amol of target (lanes 6 and 7), 10 amol of target 
(lanes 8 and 9) and 100 amol of target (lanes 10 and 1 1). The uncleaved first probe 
(Probe 1; labeled "1 uncut") is seen near the top of tiie panel, while the cleaved first probe 
is indicated as "1: cut". Similarly, tiie uncleaved and cleaved second probe are indicated 
as "2: uncut" and 2: cut," req)ectively. 

The graph shown m Fig. 28b compares ttie amount of product generated fi»m tiie 
standard reaction ("Series 1") to the amount of product generated from the second step of 
tiie sequential reaction ("Series 2"). The level of background fluorescence measured 
ftom a reaction tiiat lacked target DNA was subtracted from each measurement. It can be 
seen from the table located below flie graph fliat ttie signal from tiie standard invasive 
cleavage assay tiiat contained 100 attomoles of target DNA was nearly identical to the 
signal fiom flie sequential invasive cleavage assay in which 1 attomole of target was 
present, indicating tiiat flie inchision of a second cleavage structure increases tiie 
sensitivity of tiie assay 100 to 200-fold. This boost in signal allows easy detection of 
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target nucleic acids at the sub attomole level using the sequential invasive cleavage assay, 
while the standard assay, when performed using this enzyme for only 30 minutes, does 
not generate detectable product in the presence of 10 attomoles of target. 

When the amount of target was decreased by 10 or 100 fold in the sequential 
5 invasive cleavage assay, the intensity of the signal was decreased by the same proportion. 
This indicates that the quantitative capability of the invasive cleavage assay is retained 
even when reactions are performed in series, thus providing a nucleic acid detection 
method that is both sensitive and quantitative. 

While in this Example, the two probes used had different optimal hybridization 
10 temperatures (Le., the temperature empiracally determined to give the greatest turnover 
rate in the given reaction conditions), the probes may also be selected (i.e., designed) to 
have the same optimal hybridization temperature so that a temperature shift during 
incubation is not necessary. 

15 EXAMPLE 12 

The Products Of A Completed Sequential Invasive Cleavage Reaction Cannot Cross 
Contaminate Subsequent Similar Reactions 
As discussed in the Description of the Invention, the serial nature of the multiple 
invasive cleavage events that occur in the sequential invasive cleavage reaction, in 

20 contrast to the reciprocating nature of the polymerase chain reaction and similar doubling 
assays, means that the sequential invasive cleavage reaction is not subject to 
contamination by the products of like reactions because the products of the first cleavage 
reaction do not participate in &e generation of new signal in the second cleavage 
reaction. If a large amoimt of a completed reaction were to be added to a newly 

25 assembled reaction, the background that would be produced would come from the 

amount of target that was also carried in, combined with the amoimt of akeady-cleaved 
probe that was carried in. In this Example, it is demonstrated that a very large portion of 
a primary reaction must be introduced into the secondary reaction to create significant 
signal. 

30 A first or primary sequential invasive cleavage reaction was performed as 

described above using 100 amol of target DNA. A second set of 5 reactions were 
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assembled as described in Ex. U with the exception that portions of the first reaction 
were introduced and no additional target DNA was included. These secondary reactions 
were initiated and incubated as described above, and included 0, 0.01, 0.1, 1, or 10% of 
the fiistreaction material. A control reaction including 100 amol of target was included 
in the second set also. The reactions were stopped, resolved by electrophoresis and 
visualized as described above, and the resulting image is displayed in Fig. 29. The 
primaiy probe, uncut second probe and the cut 2nd probe are indicated on the left as "1: 
cut", 2: uncut" and 2: cut", respectively. 

In Fig. 29, lane 1 shows the results of the first reaction with the accumulated 
product at flie bottom of the panel, and lane 2 show a 1 :10 dilution of the same reaction, 
to demonstrate the level of signal that could be expected from that level of contamination, 
without fiirtha ampUfication. Lanes 3 through 7 show the results of the secondary 
cleavage reactions that contained 0, 10, 1,0.1 or 0.01% of the first reaction material 
added as contaminant, respectively and lane 8 shows a control reaction that had 100 amol 
of target DNA added to verify the activity of the system in the secondary reaction. The 
signal level in lane 4 is as would be expected when 10% of the pre-cleaved material is 
transferred (as in lane 2) and 10% of the transferred target material firom the lane 1 
reaction is allowed to further amplify. At all levels of further dilution the signal is not 
readily distinguished fiom background. These data demonstrate that while a large-scale 
transfer fiom one reaction to another may be detectable, cross contamination by the 
minute quantities that would be expected from aerosol or from equipment contamination 
would not be easify mistaken for a felse positive result. These data also demonstrate that 
when the products of one reaction are deUberately carried over into a fresh sample, these 
products do not participate in the new reaction, and thus do not affect the level of target- 
dep«ttdent signal that may be gaierated in that reaction. 
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EXAMPLE 13 

Detection of Human Cytomegalovirus Viral DNA by Invasive Cleavage 
The previous Example demonstrates the abiUty of the invasive cleavage reaction 

5 to detect minute quantities of viral DNA in the presence of human genomic DNA. In this 
Example, the probe and INVADER oUgonucleotides were designed to target the 
3104-3061 region of the major immediate early gene of human cytomegalovirus 
(HCMV) as shown in Pig. 30. In Fig. 30, the INVADER oligo (89-44; SEQ ID NO:219) 
and the fluorescein (Fl>labeled probe oUgo (89-76; SEQ ID NO:218) are shown annealed 

10 along a region of the HCMV genome corresponding to nucleotides 3057-31 10 of the viral 
DNA (SEQ ID NO:220), The probe used in this Example is a poly-pyrimidine probe and 
as shown herein ttie use of a poly-pyrimidine probe reduces background signal generated 
by the thermal breakage of probe oligos. 

The genomic viral DNA was purchased from Advanced Biotechnologies, 

15 Incorporated (Columbia, MD). The DNA was estimated (but not certified) by personnel 
at Advanced Biotechnologies to be at a concentration of 170 amol (1 x 10^ copies) per 
microliter. The reactions were performed in quadrupUcate. Each reaction comprised 1 
\iM 5' fluorespein labeled probe oligonucleotide 89-76 (SEQ ID NO:218), 100 nM 
INVADER oligonucleotide 89-44 (SEQ ID NO:219), 1 ng/ml human genomic DNA, and 

20 one of five concentrations of target HCMV DNA in the amounts indicated above each 
lane in Fig, 31, and 10 ng of PJu FEN-1 in 10 jil of 10 mM MOPS (pH 7.5), 6 raM MgCh 
with 0.05% each of Tween 20 and Nonidet P-40. All of the components except the 
labeled probe, enzyme and MgCb were assembled in a final volume of 7 ^1 and were 
overlaid with 10 ^il of CHILLOUT liquid wax. The samples were heated to 95**C for 5 

25 min, then reduced to 62°C. The reactions were started by the addition of probe, PJU 
FEN-1 and MgCU, in a 3 )il volume. After incubation at 62**C for 60 minutes, the 
reactions were stopped with 10 ^1 of 95% foimamide, 10 mM EDTA, 0,02% methyl 
violet. Samples were heated to 90° C for 1 min immediately before electrophoresis 
through a 20% acrylamide gel (19:1 cross-linked), with 7 M urea, in a buffer of 45 mM 

30 Tris-Borate, pH 83, 1 A mM EDTA. Gels were then analyzed with a Molecular 
Dynamics Fluorlmager 595. 
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The resulting image is displayed in Fig. 31. The replicate reactions were run in 
groups of four lanes with the target HCMV DNA content of the reactions indicated above 
each set of lanes (0-170 amol). The uncleaved probe is seen in the upper third of the 
panel C'Uncut 89-76") while the cleavage products are seen in the lower two-thirds of the 
panel ("Cut 89-76"). It can be seen that the intensity of the accumulated cleavage product 
is proportional to the amount of the target DNA in the reaction. Furthermore, it can be 
clearly seen in reactions that did not contain target DNA ("no target") that the probe is 
not cleaved, even in a background of human genomic DNA. While 10 ng of human 
genomic DNA was included in each of the reactions shownmFig. 31, inclusion of 
genomic DNA up to 200 ng has sUght impact on the amount of product accumulated. 
The data did not suggest that 200 ng per 10 ^il of reaction mixture represented the 
maximum amount of genomic DNA that could be tolerated without a significant 
reduction in signal accumulation. For reference, this amomit of DNA exceeds what 
might be found in 0.2 ml of urine (a commonly tested amount for HCMV in neonates) 
and is equivalent to the amount that would be found in about 5 ^1 of whole blood. 

These results demonstrate that the standard (i.e., non-sequential) invasive 
cleavage reaction is a sensitive, specific and reproducible means of detecting viral DNA. 
Detection of 1.7 amol of target is roughly equivalent to detection of 10* copies of the 
virus. This is equivalent to the number of viral genomes that might be found in 0.2 mis 
of uiine from a congenitally infected neonate (10^ to \0' genome equivalents per 0.2 mis; 
Stagno et d,, J. Infect. Dis., 132:568 [1975]). Use of the sequential invasive cleavage 
assay would permit detection of even fewer viral DNA molecules, facilitating detection 
in blood (lO' to 10^ viral particles per ml; Pector et al, J. Clin. Microbiol.. 30:2359 
[1992]), vMsAi carries a much larger amount of heterologous DNA. 

From the above it is clear that the invention provides reagents and methods to 
permit the detection and characterization of nucleic acid sequences and variations in 
nucleic acid sequences. The INVADER-directed cleavage reaction and the sequential 
INVADER-directed cleavage reaction of the present invention provide ideal direct 
detection methods that combine the advantages of the direct detection assays {e.g., easy 
quantification and minimal risk of carry-over contamination) with the specificity 
provided by a dual or tri oligonucleotide hybridization assay; 
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As indicated in the Description of the Invention, the use of sequential invasive 
cleavage reactions can present the problem of residual uncut first, or primary, probe 
interacting with the secondary target, and either competing with the cut probe for 
binding, or creating background through low level cleavage of the resulting structure. 
5 This is shown di^ramatically in Figs. 32 and 33. In Fig. 32, the reaction dq)icted makes 
use of the cleavage product from the first cleavage structure to form an INVADER 
oligonucleotide for a second cleavage reaction. The structure formed between the 
secondary target, the secondary probe and the uncut primary probe is depicted in Fig. 32, 
as the right hand structure shown in step 2a. This structure is recognized and cleaved by 

10 the 5' nucleases, albeit very inefiSciently {Lc, at less than about 1% in most reaction 
conditions). Nonetheless, the resulting product is indistinguishable fi-om the specific 
product, and thus may lead to a false positive result. The same effect can occur when the 
cleaved primary probe creates and integrated INVADER/target (IT) molecule, as 
described in Example 1 1; the formation of the undesirable complex is depicted 

15 schematically in Fig. 33, as the right hand structure shown in step 2a. 

The improvements provided by the inclusion of ARRESTOR oligonucleotides of 
various compositions in each of these types of sequential INVADER assays are 
demonstrated in the following Examples. These ARRESTOR oligonucleotides are 
configured to bind the residual uncut probe from the first cleavage reaction in the series, 

20 thereby increasing tiie e£Gcacy of and reducing the non-specific background in the 
subsequent reaction(s). 



EXAMPLE 14 

"ARRESTOR" Oligonacleotides Improve Sensitivity of Multiple Sequential 
25 Invasive Cleavage Assays 

bi this Example, the effect of including an ARRESTOR oligonucleotide on the 
generation of signal using the IT probe system 33 is demonstrated. The ARRESTOR 
oligonucleotide hybridizes to the primary probe, mainly in the portion that recognizes the 
target nucleic acid during the first cleavage reaction. In addition to examining the effects 
30 of adding an ARRESTOR oligonucleotide, the effects of using ARRESTOR 

oligonucleotides that extended in complementarity different distances into the region of 
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tbe primary probe that composes tbe secondaiy IT structure were also investigated. 
Hiese effects were compared in reactions that included the target DNA over a range of 
concentrations, or that lacked target DNA, in order to demonstrate the level of 
nonspecific not related to target nucleic acid) background in each set of reaction 
S conditions. 

The target DNA for these reactions was a fragment that comprised the full length 
of tiie hepatitis B genome fiom strain of serotype adw. This material was created using 
tiie polymerase chain reaction from plasmid pAM6 (ATCC #45020D). The PCRs were 
conducted using a vector-based forward primer, oligo # 156-022-001 (5'- 
10 ggcgaccacacccgtcctgt-3'; SEQ ID NO:594) and a reverse primer, oUgo #156-022-02 (5'- 
ccacgatgcgtccggcgtag-3'; SEQ ID NO:595) to amplify the full lengtti of the HBV insert, 
an amplicon of about 3.2kb. The cycling conditions included a denahiration of the 
plasmid at 95»C for 5 minutes, foUowed by 30 cycles of 95»C, 30 seconds; 60°C. 40 
seconds; and IT'C, 4 minutes. This was foUowed by a final extension at irC for 10 
15 minutes. The resulting amplicon. termed pAM6#2, was adjusted to 2 MNH4OAC, and 
collected by precipitation wiht isopropanol. After drying in vacuo, DNA was dissolved 
in lOmMTrispH .0, 0.1 mMEDTA. The concentration was determined by OD200 
measurement, and by INVADER assay witii comparison to a standard of known 
concentration. 

The INVADER reactions w«e conducted as follows. Five master mixes, termed 
"A," "B," "C." "D." arid "E." were assembled; all mixes contained 12.5 mM MOPS, pH 
7.5, 500 finoles primary INVADER oUgo #218-55-05 (SEQ ID NO:596). 10 ng human 
genomic DNA (Novagen) and 30 ng AfiiFENl enzyme, for every 8 jil of mix. Mix A 
contained no added HBV genomic amplicon DNA; mix B contained 600 molecules of 
25 HBV genomic amplicon DNA pAM6 #2; mix C contained 6,000 molecules pAM6 #2; 
mix D contained 60,000 molecules pAM6 #2; and mix E contained 600,000 molecules 
pAM6 #2. The mixes were aUquotted to the reaction tubes, 8 jil/tube: mix A to tubes 1, 
2, 1 1, 12, 21 and 22; mix B to tiibes 3, 4, 13, 14, 23 and 24; mix C to tubes 5, 6, 15, 16, 
25 and 26; mix D to tubes 7, 8, 17, 18, 27 and 28; and mix E to tubes 9, 10, 19. 20, 29 
and 30. The samples were incubated at 95°C for 4 minutes to denature the HBV genomic 
amplicon DNA. The reactions were Hosa cooled to 67X, and 2 of a mix containing 
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37.5 mM MgCl2 and 2.5 pmoles 218-95-06 (SEQ ID NO:597) for every 2 jil, was added 
to each sample. The samples were incubated at 67*^C for 60 minutes. Three secondary 
reaction master mixes were prepared, all mixes contained 1 0 pmoles of secondary probe 
oligonucleotide #228-48-04 (SEQ ID NO:598) for every 2 jil of mix. Mix 2A contained 
5 no additional oBgonucleotide, mix 2B contained 5pmoles "ARRESTOR" oligo # 21 8-95- 
03 (SEQ ID NO:599) and mix 2C contained 5 pmoles of "ARRESTOR" oligo # 218-95- 
01 (SEQ ID NO:600). After the 60 minute incubation at 6TC (the primary reaction 
described above), 2 ^1 of the secondary reaction mix was added to each sample: Mix 2A 
was added to samples #1-10; Mix 2B was added to samples #1 1-20; and Mix 2C was 

10 added to samples #21-30. The temperature was adjusted to 52'^C and the samples were 
incubated for 30 minutes at 52°C. The reactions were then stopped by ihe addition of 10 
]x\ of a solution of 95% formamide, 5 mM EDTA and 0.02% crystal violet. All samples 
were heated to 95°C for 2 minutes, and 4 jil of each sample were resolved by 
electrophoresis through 20% denaturing acrylamide gel (19:1 cross-linked) with 7 M 

15 urea, in a buffer containing 45 mM Tris-Borate (pH8.3) and I A mM EDTA. The results 
were imaged using the Molecular Dynamics Fluoroimager 595, excitation 488, emission 
530. The resulting images are shown in Fig. 34. 

In Fig. 34, Panel A shows the results of the target titration when no ARRESTOR 
oUgonucleotide was mcluded in the secondary reaction; Panel B shows the results of the 

20 same target titration using an ARRESTOR oligonucleotide that extended 2 nt into the 
non-target complementary region of the primary probe; and Panel C shows the results of 
the same target titration usmg an ARRESTOR oligonucleotide that extended 4 nt into the 
non-target complementary region of the primary probe. The product of the secondary 
cleavage reaction is seen as a band near the bottom of each panel The first two lanes of 

25 each panel (i.e., 1 and 2, 1 1 and 12, 21 and 22) lacked target DNA, and the signal that co- 
migrates with the product band represents the nonspecific background under each set of 
conditions. 

It can be seen by visual inspection of these panels that the background signal is 
both reduced, and made more predictable, by the inclusion of either species of 
. 30 ARRESTOR oligonucleotide. In addition to reducing the background in the no-target 
control lanes, the background reduction in the reactions that had the more dilute amounts 
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Of target included is reduced, leading to a signal that is a more accurate reflection ofthe 
target contained within the reaction, thus improving the quantitative range ofthe 
multiple, sequential invasive cleavage reaction. 

To quantify the impact of including the ARRESTOR oUgonucleotide in the 
secondary cleavage reaction under these conditions, the average product band signal from 
the reactions having the largest amount of target (U, averages ofthe signals from lanes 9 
and 10, lanes 19 and 20. and lanes 29 and 30), were compared to the averaged signal 
from the no-target contol lanes for each panel, determine the "fold over background," the 
factor of signal amplification over background, under each set of conditions. For the 
reactions without the ARBESTOR oligonucleotide. Panel A, the fold over background 
was 5.3; for Panel B, the fold over background was 12.7; and for Panel C, the fold over 
background was 13.4, indicating that in this system inclusion of any ARRESTOR 
oligonucleotide at least doubled the specificity ofthe signal over the ARRESTOR 
oligonucleotide -less reactions, and that the ARRESTOR oUgonucleotide that extended 
slightly farther into the non-target complementary region may be sUghtly more effective, 
at least in this embodiment ofthe system. This clearly shows the benefits of using an 
ARRESTOR oUgonucleotide to enhance the specificity of these reactions, an advantage 
that is of particular benefit at low levels of target nucleic acid. 

EXAMPLE 15 

"ARRESTOR" Oligonucleotides AUow use of Higher Concentrations of Primary 
Probe Without Increasing Background Signal 
Increasing the concentation ofthe probe in tiie invasive cleavage reaction can 
dramatically increase the amount of signal generated for a given amount of target DNA. 
While not intending to Umit the explanation to any specific mechanism, this is beUeved to 
be caused by the fact that increased concentoration of probe increases the rate at which the 
cleaved probe is suppUmted by an uncleaved copy, thereby increasing the apparent 
turnover rate ofthe cleavage reaction. Unfortunately, this effect could not heretofore be 
applied in the primary cleavage reaction of a multiple sequential INVADER assay 
because the residual uncleaved primary probe can hybridize to the secondary target, in 
competition with the cleaved molecules, thereby reducing the efficacy of the secondary 
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reaction. Elevated concentrations of primaiy probe exacerbate this problem. Further, the 
resulting complexes, as described above, can be cleaved at a low level, contributing to 
background. Therefore, increasing the primary probe can have the double negative effect 
of both slowing the secondary reaction and increasing the level of this form of non target- 

5 specific background The use of an ARRESTOR oligonucleotide to sequester or 
neutralize the residual primary probe allows this concentration-enhancing effect to be 
applied to these sequential reactions. 

To demonstrate this effect, two sets of reactions were conducted. In the first set 
of reactions, the reactions were conducted using a range of primary probe concentrations, 

10 but no ARRESTOR oligonucleotide was supplied in the secondary reaction. In the 
second set of reactions, the same probe concentrations were used, but an ARRESTOR 
oligonucleotide was added for the secondary reactions. 

All reactions were performed in duplicate. Primary INVADER reactions were 
done in a final volimae of 10 \il and contained: 10 mM MOPS, pH 7.5, 7.5 mM MgCh, 

15 500 fin of primary INVADER (218-55-05; SEQ ID NO:596); 30 ng of AfiiFENl enzyme 
and 10 ng of human genomic DNA. 100 zeptomoles of HBV pAM6 #2 amplicon was 
inchided in all even numbered reactions (by reference to Figs. 35A and B). Reactions 
included 10 pmoles, 20 pmoles, 50 pmoles, 100 pmoles or 150 pmoles of primary probe 
(218-55-02; SEQ ID NO:601), MOPS, target and INVADER oligonucleotides were 

20 combined to a final volume of 7 Samples were heat denatured at 95^C for 5 minutes, 
then cooled to 67*^0. During the 5 minute denaturation, MgCk, probe and enzyme were 
combined. The primary INVADER reactions were initiated by the addition of 3 jU of 
MgCla, probe and enzyme mix, to the final concentrations indicated above. Reactions 
were incubated for 30 minutes at 67**C. The reactions were then cooled to 52**C, and 

25 each primary INVADER reaction received the following secondary reaction components 
in a total volume of 4 ^il: 2.5 pmoles secondary target (oligo number 218-95-04; SEQ ID 
NO:602); 10 pmoles secondary probe (oligo number 228-48-04; SEQ ID NO: 598). The 
reactions that included the ARRESTOR oligonucleotide had either 40 pmoles, 80 pmoles, 
200 pmoles, 400 pmoles or 600 pmoles of ARRESTOR oligonucleotide (oligo number 

30 218-95-01; SEQ ID NO:600), added at a 4-fold molar excess over the primary probe 
amount for each reaction, with this mix. Reactions were then incubated at 52^C for 30 
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minutes: The reactions were stopped by the addition of 10 nl of a solution of 95% 
fonnamide, 10 mM EOTA and 0.02% crystal violet All samples were heated to 95°C 
for 1 minute, and 4 jil of each sample were resolved by electrophoresis through 20% 
denaturing aaylamide gel (19:1 cross-linked) wifli 7 M urea, in abuffer containing 45 
5 mM Tris-Borate (pH8.3) and 1.4 mM EDTA. The results were imaged using the 
Molecular Dynamics Fluoroimager 595, excitation 488, emission 530. The resulting 
nnages for the reactions either without or with an ARKESTOR oUgonucleotide are shown 
in Figs. 35A and 35B, respectively. The products of cleavage of the secondary probe are 
seal as a band near the bottom of each panel. 
10 In Fig. 35A, lane sets 1 and 2 show results with 10 pmoles of primary probe; 3 

and 4 had 20 pmoles; 5 and 6 had 50 pmoles; 7 and 8 had 100 pmoles; and 9 and 10 had 
150 pmoles. It can be seen by visual examination, that the increases in the amount of 
primary ptobe have the combined effect of slightly increasing the background in the no- 
target lanes (odd numbers) whUe reducing the specific signal in the presence of target 
15 (even numbered lanes), and therefore the reducing the specificity of the reaction if 
viewed as the measure of "fold over background," demonstrating that the approach of 
increasing signal by increasing probe cannot be appUed in these sequential reactions. 

In Fig. 35B, lane sets 1 and 2 show results with 10 pmoles of primary probe; 
while 3 and 4 had 20 pmoles; 5 and 6 had 50 pmoles; 7 and 8 had 100 pmoles; and 9 and 
20 10 had 150 pmoles. In addition, each reaction included 4-fold molar excess of the 
ARRESTOR oUgonucleotide added before the secondary cleavage reaction. It can be 
seen by visual examination that the background in the no-target lanes (odd numbers) is 
lower in all cases, while the specific signal in th^ presence of target (even numbered 
lanes) increases with increased amounts of primary probe, leading to a greater "fold over 
25 background" sensitivity at this target level. 

To quantitatively compare these effects, the fluorescence signal fix)m the products 
of both non-specific and specific cleavage were measured. The results are depicted 
graphically in Fig. 35C, graphed as a measure of the percentage of the secondary probe 
cleaved during the reaction, compared to the amount of primary probe used. 
30 Examination of the plots from the no-target reactions confirms that the background in the 
absence of the ARRESTOR oligonucleotide is, in general, roughly two-fold higher, and 
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that both increase slightly with the increasing probe amounts. The specific signals 
however, diverge between the two sets of reaction more dramatically. While the signal in 
the no-ARRESTOR oligonucleotide reactions decreases steadily as primary probe was 
increased, the signal in the ARRESTOR oligonucleotide reactions continued to increase. 
5 At the highest primary probe concentrations tested, the no-ARRESTOR oligonucleotide 
reactions had specific signal that was only 1 .7 fold over background, while the 
ARRESTOR oligonucleotide reactions detected the 100 zmoles (60,000 copies) of target 
with a signal 6.5 fold over background, thus demonstrating the improvmient in the 
sequential invasive cleavage reaction when an ARRESTOR oligonucleotide is included. 

10 

EXAMPLE 16 

Modified Baclcbones Improve Performance of ARRESTOR Oligonucleotides All 
Natural "ARRESTOR" OUgo With No 3'-Amine 

The reactions described in the previous two Examples used ARRESTOR 
15 oligonucleotides that v/&re constructed using T 0-methyl ribose backbone, and that 
included a positively charged amine group on the 3' temunal nucleotide. The 
modifications were made specifically to reduce enzyme interaction with the primary 
probe/ARRESTOR oligonucleotide complex. During the development of the present 
mvention, it was determined that the 2* 0-methy modified oligonucleotides are somewhat 
20 resistant to cleavage by the S' nucleases, just as they are slowly degraded by nucleases 
when used in antisense applications {See e,g., Kawasaki et al, J. Med. Chem., 36:831 
[1993]). 

The presence of an amino group on the 3* end of an oligonucleotide reduces its 
ability to direct invasive cleavage. To reduce the possibility that the ARRESTOR 
25 oligonucleotide would form a cleavage structure in this way, an amino group was 
included in the design of the experiments described in this and other Examples. 

Initial designs of the ARRESTOR oligonucleotides (sometimes referred to as 
"blockers") did not mclude these modifications, and these molecules were found to 
provide no benefit in reducing background cleavage in the sequential invasive cleavage 
30 assay and, in fact, sometimes contributed to background by inducing cleavage at an 
' unanticipated site, presumably by providing some element to an alternative cleavage 



256 



wo 01/90337 



PCT/USOl/17086 



Structure. The effects of natural and modified ARRESTOR oligonucleotides on the 
background noise in fliese reactions are examined in this Example. 

The efiBcacy of an "ail-natural ARRESTOR oHgonucleotide (i.e.. an ARRESTOR 
oligonucleotide that did not contain any base analogs or modifications) was examined by 
comparison to an identical reactions that lacked ARRESTOR oligonucleotide. All 
reactions were performed in duplicate, and were conducted as follows. Two master 
mixes were assembled, each containing 12.5 mM MOPS, pH 7.5, 500 finoles primary 
INVADER oligonucleotide #218-55-05 (SEQ ID NO:596). 10 ng human genomic DNA 
(Novagen) and 30 ng AfiiFENl enzyme for every 8 jil of mix. Mix A contained no 
added HBV genomic ampUcon DNA, mix B contained 600,000 molecules of HBV 
genomic ampUcon DNA, pAM6 #2. The mixes were distributed to the reaction tubes, in 
aUquots of 8 yAftabe as foUows: mix A to tubes 1, 2, 5 and 6; and mix B to tubes 3, 4, 7 
and 8. The samples were incubated at 95''C for 4 minutes to denature the HBV genomic 
amplicon DNA. The reactions were then cooled to 67»C and 2ul of a mix containing 37.5 
mM MgCl2 and 10 pmoles 218-55-02B (SEQ ID NO:603) for every 2 ^a, was added to 
each sample. The samples were then incubated at 67''C for 30 minutes. Two secondary 
reaction master mixes were prepared, each containing 10 pmoles of secondary probe 
oUgo #228-48-04N (SEQ ID NO:604) and 2.5 pmoles of secondary target oligonucleotide 
#218-95-04 (SEQ JD NO:602) for every 3 \il of mix. Mix 2A contained no additional 
oligonucleotide, while mix 2B contained 50 pmoles of the natural "ARRESTOR" 
oUgonucleotide #241-62-02 (SEQ ID NO:605). After the initial 30 minute incubation at 
67°C, the temperature was adjusted to 52''C, and 3 nl of a secondary reaction mix was 
added to each sample, as follows: Mix 2A was added to samples #1-4; and Mix 2B was 
added to samples #5-8. The samples were then incubated for 30 minutes at 52''C. The 
reactions were then stopped by the addition of 10 ^1 of a solution of 95% formamide, 10 
mM EDTA and 0.02% crystal violet. 

All of the samples were heated to 95°C for 2 mmutes, and 4 \i\ of each sample 
were resolved by electrophoresis through a 20% denaturing acrylamide gel (19:1 cross- 
linked) with 7 M urea, in a buffer containing 45 mM Tris-Borate (pH8.3) and 1 .4 mM 
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EDTA, The results were imaged using the Molecular Dynamics FluoroimagCT 595, 
excitation 488, emission 530, The resulting image is shown in Fig. 36A. 

To compare the effects of the various modifications made to the ARRESTOR 
oligonucleotides, reactions were performed using ARRESTOR oligonucleotides having 

5 all natural bases, but including a 3* terminal amine; ARRESTOR oligonucleotides having 
a 3' portion composed of 2* 0-methyl nucleotides, plus the 3* terminal amine; and 
ARRESTOR oligonucleotides composed entirely of 2' 0-methyl nucleotides, plus the 3* 
tominal amine. These w^e compared to reactions performed without an ARRESTOR 
oligonucleotide. The reactions were conducted as follows. Two master mixes were 

10 assembled, all mixes contained 14.3 mM MOPS, pH 7.5, 500 finoles primary INVADER 
oligo #21 8-55-05 (SEQ ID NO:596) and 10 ng human genomic DNA (Novagen) for 
every 7 ^1 of mix. Mix A contained no added HBV genomic amplicon DNA, mix B 
contained 600,000 molecules of HBV genomic amplicon DNA, pAM6 #2. The mixes 
were distributed to the reaction tubes, at 7 |al/tube: mix A to tubes 1, 2, 5, 6, 9, 10, 13 and 

15 14; and mix B to tubes 3, 4, 7, 8, 11, 12, 15 and 16. The samples were wanned to 95*^0 
for 4 minutes to denature the HBV DNA. The reactions were then cooled to 67**C and 3 
\xl of a mix containing 25 mM MgCh, 25 pmoles 218-55-02B (SEQ ID NO:603) and 30 
ng AfuFENl enzyme per 3 were added to each sample. The samples were then 
incubated at 67°C for 30 minutes. Four secondary reaction master mixes were prepared; 

20 all mixes contained 10 pmoles of secondary probe oligonucleotide #228-48-043 (SEQ ID 
NO:606) and 2.5 pmoles of secondary target oligonucleotide #218-95-04 (SEQ ID 
NO:602) for every 3 ^1 of mix. Mix 2A contained no additional oligonucleotide, while 
mix 2B contained 100 pmoles of the natural+amine ARRESTOR oligonucleotide # 241- 
62-01 (SEQ ID NO:607), mix 2C contained 100 pmoles of partially O-methyl+amine 

25 oligonucleotide # 241-62-03 (SEQ ID NO:608) and mix 2D contained 100 pmoles of all 
O-methyl+amine oligonucleotide #241-64-01 (SEQ ID NO:609). After the mitial 30 
minute incubation at 67°C, the temperature was adjusted to 52°C and 3 jil of a secondary 
reaction mix was added to each sample, as follows: mix 2A was added to samples #1-4; 
mix 2B was added to samples #5-8; mix 2C was added to samples #9-12; and mix 2D 

30 was added to samples #13-16. The samples were incubated for 30 minutes at 52°C, then 
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Stopped by the addition of 10 jil of a solution of 95% formamide. 10 mM NaEDTA, and 

0.2% crystal violet. 

AU samples were heated to 95'C for2 minutes, and4 fil of each sample were 
resolved by electrophoresis through a 20% denaturing acrylamide gel (19:1 cross-linked) 
with 7 M urea, in a buffer containing 45 mM Tris-Borate (pH8.3) and 1.4 mM EDTA. 
The results were imaged using the Molecular Dynamics Fluoroimager 595, excitation 
488, emission 530. The resulting unage is shown in Fig. 36B. 

hi Fig. 36A, the left-hand panel shows the reactions fliat lacked an ARRESTOR 
oUgonucleotide, while the right hand panel shows the data fiom reactions that included 
the all natural ARRESTOR oUgonucleotide. The first two lanes of each panel are from 
no-target controls, the second set of lanes contained target. The products of cleavage are 
visible in the bottom one/fourth of each panel. The position at which Ihe specific reaction 
products should run is indicated by anows on left and right 

It can be seen by exammation of these data, that the reactions run m the absence 
of ARRESTOR oligonucleotide show reproducible quaUty between the replicates, and 
show significant cleavage only when target is present In contrast, the addition of another 
xaimodified oUgonucleotide into the reactions causes great variation between the replicate 
lanes {e.g.. lanes 5 and 6 were provided with the same reactants, but produced markedly 
different results). The introduction of the aU-natural ARRESTOR oUgonucleotide 
produced, rather than reduced, background in these no-ta^et lanes, and increased 
cleavage at other sites the bands other that those mdicated by the arrows flanking the 
panels). For these reasons the modifications that are described above, the effects of 
which are shown on Fig. 36B, were incorporated. 

The first 4 lanes of Fig. 36B show Ihe products of dupUcate reactions without an 
ARRESTOR oUgonucleotide. plus or minus the HBV target (lanes 1, 2, and lanes 3, 4, 
respectively); The next 4 lanes, 5, 6 and 7, 8 used a natural ARRESTOR oligonucleotide 
having a 3' terminal amine; lanes 9, 10 and 11, 12 used the ARRESTOR oUgonucleotide 
with a 3* portion composed of 2' 0-me(hyl nucleotides, and having a 3' terminal amine; 
lanes 13, 14 and 15, 16 used Ihe ARRESTOR oUgonucleotide composed entirely of 2' O- 
methyl nucleotides and having a 3' terminal amine. The products of cleavage of the 
secondary probe are visible in the lower one third of each panel. 
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Visual inspection of these data shows that the addition of the 3* terminal amine to 
the natural ARRESTOR oligonucleotide suppresses the aberrant cleavage seen in Fig. 
36A, but this ARRESTOR oligonucleotide does not improve the performance of the 
leaction, as compared to the no- ARRESTOR oligonucleotide controls. In contrast, the 

5 use of the T 0-methyl nucleotides in the body of the ARRESTOR oligonucleotide does 
reduce background, whether partially or completely substituted. To quantify the relative 
effects of these modifications, the fluorescence fix)m each of the co-migrating product 
bands was measured, the signals firom the duplicate lanes were averaged and the "fold 
over background" was calculated for each reaction contaming target nucleic acid. 

10 When ARRESTOR oligonucleotide was omitted, the target-specific signal (lanes 

3, 4) was 27-fold over the no target background; the natural ARRESTOR oligonucleotide 
+amine gave a signal of 17-fold over background; tiie partial T 0-methyl+ amine gave a 
signal of 47-fold over background; and the completely 2' 0-methyl+ amine gave a signal 
of 33 fold over background. 

15 These Figures show that both modifications can have a beneficial effect on the 

specificity of the multiple, sequential invasive cleavage assay. They also show that flie 
use of the 2* 0-methyl substituted backbone, eiflier partial or entire, markedly unproves 
the specificity of these reactions. It is intended that, in various embodiments of the 
present mventon, any number of modifications that make either the ARRESTOR 

20 oUgonucleotide or the complex it forms with the primary target resistant to nucleases will 
provide similar enhancement 

EXAMPLE 17 

Effect of ARRESTOR Oligonucleotide Length on Signal Enhancement in Multiple 

25 Sequential Invasive Cleavage Assays 

As noted in the Description of the Invention, the optimal length for an 
ARRESTOR oligonucleotide depends upon the design of the other nucleic acid elements 
of the INVADER reaction, particularly on the design of tiie primary probe. In this 
Example,'the effects of varying ^e length of the ARRESTOR oligonucleotide were 

30 explored in systems using two different secondary probes. A schematic diagram showing 
these ARRESTOR oligonucleotides aligned as they would hybridize to the primary probe 
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oUgonucleotide is provided in Fig. 37C. In this Figure, the region of the primary probe 
that recognizes the target nucleic acid is shown underlined; the non-underlined portion, 
plus the first underlined base is the portion that is released by the first cleavage, and goes 
on to participate in flie second or subsequaxt cleavage structure. 

All reactions were performed in dupUcate. The INVADER reactions were done 
in a final volume of 10 jil final volume containing 10 mM MOPS. pH 7.5, mM MgCU. 
500 finoles of primary INVADER 241-95-01, (SEQ ID NO:610). 25 pmoles of primary 
probe 241-95-02 (SEQ ID N0:6U), 30 ng of AfuFENl enzyme, and 10 ng of human 
genomic DNA, and if included, 1 amoles of HBV ampUcon pAM 6 #2. MOPS, target 
DNA, and INVADER oligonucleotides were combmed to a final volume of 7 \il 
Samples were heat denatured at 95»C for 5 minutes, then cooled to 67°C. During the 5 
minute denaturation, MgCh. probe and enzyme were combined. The primary INVADER 
reactions were initiated by the addition of 3 yii of MgClj, probe and enzyme mix, to the 
final concentrations indicated above. Reactions were incubated for 30 minutes at 6rC. 
The reaction were then cooled to 52»C, and each primary INVADER reaction received 
the following secondary reaction components in a total vohraie of 3 fil: 2.5 pmoles 
secondary target 241-95-07 (SEQIDNO:612), 10 pmoles ofeither secondary probe 228- 
48-04 (SEQ ID NO:598), or 228-48-04N (SEQ ID NO:604) and 100 pmoles of an 
ARRESTOR oligonucleotide, either 241-95-03 (SEQ ID NO;613), 241-95-04 (SEQ ID 
NO:614), 241-95-05 (SEQ ID NO:615) or 241-95-06 (SEQ IDNO:616). The 
ARRESTOR oligonucleotides were omitted from some reactions as contiwls for 
ARRESTOR oligonucleotide effects. 

The reactions were incubated at 52°C for 34 minutes, and were flien stopped by 
file addition of 10 ^1 of 95% foimamide, 10 mM EDTA, and 0.02% crystal violet All 
samples were heated to 95°C for 1 minute, and 4 ^il of each sample were resolved by 
electrophoresis through 20% denaturing acrylamide gel (19:1 cross-linked) with 7 M 
urea, in abuffer containing 45 mM Tris-Borate (pH8.3) and 1.4 mM EDTA. The results 
were imaged using flie Molecular Dynamics Fluoroimager 595, excitation 488, emission 
530. The resulting images for the reactions with the shorter and longer secondary probes 
are shown in Figs. 37A and 37B. respectively. 
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In each Figure, the products of cleavage are visible as bands in the bottom half of 
each lane. The first 4 lanes of each Figure show the products of duplicate reactions 
without an ARRESTOR oligonucleotide, plus or minus the HBV target (lanes sets 1 and 
2 respectively); in the next 4 lanes, sets 3 and 4 used the shortest ARRESTOR 241-95-03 

5 (SEQ ID NO:613); lanes 5 and 6 used 241-95-04 (SEQ ID NO:614); lanes 7 and 8 used 
241-95-05 (SEQ ID NO:615); and lanes 9 and 10 used 241-95-06 (SEQ ID NO:616). 

The principal background of concern is the band that appears in the "no target" 
control lanes (odd numbers; this band co-migrates with the target-specific signal near tiie 
bottom of each gel panel). Visual inspection shows that the shortest ARRESTOR 

10 oligonucleotide was the least effective at suppressing this background, and that the 
efiBcacy was inoreased when the ARRESTOR oligonucleotide extended fiu^her into the 
portion that participates in the subsequent cleavage reaction. Even with this difference in 
effect, it can be seen firom these data that there is much latitude in the design of the 
ARRESTOR oligonucleotide. The choice of lengths will be influenced by the 

15 temp^ature at which the reaction making use of the ARRESTOR oligonucleotide is 
performed, the lengths of the duplexes fomied between the primary probe and the target, 
the primary probe and the secondary target, and the relative concentrations of the 
different nucleic acid species in the reactions. 



20 EXAMPLE 18 

Effect of ARRESTOR Oligonucleotide Concentration on Signal Enhancement in 
Multiple Sequential Invasive Cleavage Assays 
In examining the effects of including ARRESTOR oligonucleotides in these 
cleavage reactions, it was of interest to determine if the concentration of the ARRESTOR 
25 oligonucleotide in excess of the primary probe concentration would have an effect on 
yields of either non-specific or specific signal, and if the length of the ARRESTOR 
oligonucleotide would be a factor. These two variables were investigated in the 
following Example. 

AH reactions were performed in duplicate. The primary INVADER reactions 
30 were done in a final volume of 10 pi and contamed 10 mM MOPS, pH 7.5; 7.5 mM 
MgCl2, 500 finoles of primary INVADER 241-95-01 (SEQ ID NO:610), 25 pmoles of 
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primaiy probe 241-95-02 (SEQ ID N0:61 1), 30 ng of AfoFENl enzyme, and 10 ng of 
humangenomicDNA. Where included, the target DNA was 1 amoleof HBV ampUcon 
pAM 6 #2, as described above. MOPS, target and INVADER were combined to a final 
volume of 7 nl. The samples were heat denatured at 95°C for 5 minutes, then cooled to 
ere. During the 5 minute denaturation,MgCl2. probe and enzyme were combined. The 

primaiy INVADER reactions were initiated by the addition of 3 ^il of MgCla, probe and 
enzymemix. The reactions were incubated for 30 nunutes at 67°C. The reactions were 
tiien cooled to Sl'^C and each primary INVADER reaction received the foUowing 
secondary reaction components: 2.5 pmoles secondary target 241-95-07 (SEQ ID 
NO:612), 10 pmoles secondary probe 228-48-04 (SEQ ID NO:598); and. if inchuied. 50. 
100 or 200 pmoles of eiflier ARRESTOR oUgonucleotide 241-95-03 (SEQ ID NO:613) 
or 241-95-05 (SEQ ID NO:615), in a total volume of 3 jil. Reactions were flien 
incubated at 52»C for 35 minutes. Reactions were stopped by tixe addition of 10 \i\ of 
95% formamide, 10 mM EDTA, and 0.02% crystal violet. All of the samples were 
heated to 95''C for 1 minute, and 4 jil of each sample were resolved by electrophoresis 
through 20% denaturing acrylamide gel (19:1 cross-linked) witii 7 M urea, in a buffe 
containing 45 mM Tris-Borate (pH8.3), and 1 .4 mM EDTA. The results were imaged 
using the Molecular Dynamics Ruoroimager 595, excitation 488. emission 530. The 
resulting images are shown as a composite image in Fig. 38. 

Each of the duplicate reactions were loaded on tiie gel in adjacent lanes and are 
labeled with a single lane number. All odd numbered lanes were no-target controls. 
Lanes 1 and 2 had no ARRESTOR oUgonucleotide added; lanes 3-8 show results from 
reactions containing the shorter ARRESTOR oligonucleotide, 241-95-03 (SEQ ID 
NO:613); lanes 9-14 show results from reactions contaming tiie longer ARRESTOR 
oligonucleotide, 241-95-05 (SEQ ID NO:615). The products of cleavage from the 
secondary reaction are visible in the bottom one fliird of each panel. Visual inspection of 
tiiese data {i.e.. comparison of tiie specific products to the background bands) shows that 
both ARRESTOR oligonucleotides have some beneficial effect at all concentration. 

To quantify the relative effects of ARRESTOR oligonucleotide lengtii and 
concentration, tiie fluorescence from each of tiie co-migrating product bands was 
measured, tiie signals from the dupHcate lanes were averaged and ttie "fold over 
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background" (signal+target/signal-target) was calculated for each reaction containing 
target nucleic acid The reaction lacking an AKRESTOR oligonucleotide yielded a signal 
approximately 27-fold over background. Inclusion of the shorter ARRESTOR 
oligonucleotide at SO, 100 or 200 pmoles produced products at 42, 51 and 60-fold over 
5 background, respectively. This shows that while the short ARRESTOR at the lowest 
concentration seems to be less effective than the longer ARRESTOR oligonucleotides 
(See, e.g.y previous Example) this can be compensated for by increasing the concentration 
of ARRESTOR oligonucleotide, and fho-eby the ARRESTOR oligonucleotide:primary 
probe ratio. 

10 In contrast, inclusion of the longer ARRESTOR oligonucleotide at 50, 100 or 200 

pmoles produced products at 60, 32 and 24 fold over background, respectively. At the 
lowest concentration, the efficacy of this longer ARRESTOR oligonucleotide relative to 
the shorter ARRESTOR oligonucleotide is consistent with the previous Example. 
Increasing the concentration, however, decreased the yield of specific product, suggesting 

15 a competition effect with some element of the secondary cleavage reaction. 

These data show that the ARRESTOR oligonucleotides can be used to advantage 
in a number of specific reaction designs. The choice of concentration will be influenced 
by the temperature at which the reaction making use of the ARRESTOR oligonucleotide 
is performed, the lengths of the duplexes formed between the primary probe and the 

20 target, the primary probe and the secondary target, and between the primary probe and 
the ARRESTOR oUgonucleotide. 

Selection of oligonucleotides for target nucleic acids other than the HBV shown 
here, (e,g., oligonucleotide composition and length), and tiie optimization of cleavage 
reaction conditions in accord with the models provided here follow routine methods and 

25 common practice well known to those skilled in the methods of molecular biology. 

Example 10 demonstrated that some enzymes require an overlap between an 
upstream INVADER oligonucleotide and a downstream probe oligonucleotide to create a 
cleavage structure O^igure 27). It has also been determmed that the 3' termmal nucleotide 
of the INVADER oligonucleotide need not be complementary to the target strand, even if 

30 it is the only overlapping base in the INVADER oUgonucleotide (e.g., as with the HCMV 
probes shown in Figure 30). The requirement for an overlap can serve as a convenient 
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basis for detecting single base polymorphisms (SNPs) or mutations in a nucleic acid 



10 



15 



20 



25- 



For detection of single base variations, at least two oUgonucleotides {e.g.. 
a probe and an INVADER oUgonucleotide) hybridize in tandem to the target nucleic acid 
to form the overlapping structure recognized by the CLEAVASE enzyme to be used in 
thereaction. An unpaired "flap" is included on the 5' end of the probe. Theenzyme 
recognizes the overlap and cleaves ofifthe unpaired flap, releasing it as a target-specific 
product Enzymes that have a strong preference for an overlapping structure, i.e., that 
cleave the overlapping structure at a much greater rate than they cleave a non- 
overlapping structure include Ihe FBN-1 enzymes from Archaeoglobmfulgidus and 
Pyrococcusfuriosus and such enzymes are particularly preferred in the detection of 
mutations and SNPs. . 

EXAMPLE 19 
Kits for performing the mRNA INVADER Assay 
In some embodunents, the present invention provides kits comprising one or more 
of Ihe components necessaiy for practicing the present invention. For example, the 
present invention provides kits for storing or deUvering the enzymes of the present 
invention and/or tiie reaction components necessary to practice a cleavage assay (e.g.. the 
INVADER assay). By way of example, and not intending to limit flie kits of the present 
invention to any particuha- configuration or combination of components, the foUowmg 
section describes one embodiment of a kit for practicing tiie present invention: 

In some embodunents, tiie kits of tiie present invention provide tiie following reagents: 

CLEAVASE enzyme (e.g.. Primary Oligos 
TtiiAKK) ^ 

RNA Primary Buffer 1 Secondary Oligos 

KNA Secondary Buffer 1 RNA Standard [lOOamol/jil] 

tRNA CaiTier [20 ng/^l] lOX CeU Lysis Buffer 1 
Tioeo.i Buffer [lOmMTris'HCl, 
pH8, 0.lmMEDTA] 
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Examples of Primary Oligonucleotides and Secondary Oligonucleotides suitable 
for use with the methods of the present invention are provided in Figure 41 . While the 
oUgonucleotides shown therein may find use in a number of the methods, and variations 
of the methods, of the present invention, these INVADER assay oligonucleotide sets find 

5 particular use with kits of flie present invention. The oUgonucleotide sets shown in 
Figure 41 may be used as individual sets to detect individual target RNAs, or may be 
comined in biplex or multiplex reactions for the detection of two or more analytes or 
controls in a single reaction. It is contemplated that the designs of these probes sets (e.g., 
the oligonucleotides and/or their sequences) may be adapted for use in DNA detection 

10 assays, using the guidelines for reaction design and optimization provided herein. 
Additional oligonucleotides Qiat find use m detection assays and kits of the present 
invention are provided in Figure 47. 

In some embodiments, a kit of the present inv^tion provides a list of additional 
components (e,g.^ reagents, supplies, and/or equipment) to be supplied by a user in order 

IS to perfonn the methods of the invention. For example, and without intending to limit 
such additional components lists to any particular components, one embodiment of such a 
list comprises the following: 

■ RNase-firee (e.g., DEPC-treated) H2O 

■ Clear CHILLOUT-14 liquid wax (MJ Research) or RNase-fi-ee, optical grade mineral 
20 oil (Sigma, Cat No. M-5904) 

■ Phosphate-bufifered saline (no MgCU, no CaCla) 

■ 96-well polypropylene microplate (MJ Research, Cat No. MSP-9601) 

■ 0.2-ml thin-wall tubes 

■ Thermaseal well t^e (e.g., GeneMate, Cat No. T-2417-5) 
25 ■ Multichannel pipets (0.5-lOfil, 2.5-20|il, 20-200til) 

" Thermal cycler or other heat source (eg., lab oven or heating block). 

■ Fluorescence microplate reader (a preferred plate reader is top-reading, equipped with 
light filters have the following characteristics: 
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Excitation Emission 

(Wavelength/Bandwidth) (Wavelength/Bandwidth) 
485nm/20nm 530nni/25nm 
560nni/20nm 620nm/40nm 

In some embodiments, a kit of the present invention provides a list of optional 
components (e.g., reagents, suppUes, and/or equipment) to be suppUed by a user to 
faciUtate performance of the methods of the invention. For example, and without 
5 intending to limit such optional components Usts to any particular components, one 
embodiment of such a list comprises ttie following: 
■ tRNA Solution, 20ng/nl (Sigma, R-5636) 

- IX Stop Solution (10mMTris»HCl,pH 8, lOmMEDTA) 

- Black opaque, 96-well microplate {e.g., COSTAR, Cat. No. 3915) 
10 ■ Electronic repeat pipet (250^1) 

In some embodiments of a kit, detailed protocols are provided. In preferred 
embodiments, protocols for the assembly of INVADER assay reactions (e.g., 
formulations and preferred procedures for making reaction mixtures) are provided. In 

15 particularly preferred embodiments, protocols for assembly of reaction mixtures include 
computational or graphical aids to reduce risk of error in the perfonnance of the methods 
of the present invention (e.g., tables to facilitate calculation of volumes of reagents 
needed for multiple reactions, and plate-layout guides to assist in configuring multi-well 
assay plates to contain numerous assay reactions). By way of example, and without 

20 intending to Ifanit such protocols to any particular content or format, kits of the present 
invention may comprise the following protocol: 

I. DETAILED mRNA INVADER ASSAY PROTOCOL 
1. Plan the microplate layout for each experimental run. An example microplate layout 
25 for 40 samples, 6 standards, and a No Target Control is shown in Fig. 40. delusion of 
a No Target Control (tRNA Carrier or IX CeU Lysis Buffer 1) and quantitation 
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Standards are required for absolute quantitation. 
2. Prepare the Primary Reaction Mix for either the single or biplex assay format To 
calculate the volumes of reaction components needed for the assay (X Volume), 
multiply the numba: of reactions (for both samples and controls) by 1 .25 \X Volume 
5 = # reactions x 1.251. Vortex the Primary Reaction Mix briefly after the last 

reagent addition to mix thoroughly. Aliquot 5^1 of the Primary Reaction Mix per 
microplate well (an electronic repeat pipet is recommended for this stq>). 

Primary Reaction Mix 
10 Single Assay Format 



Reaction Components 


IX Volume 


X Volume 


RNA Primary Buffer 1 


4.0nl 




Primary Oligos 


0.25^1 




Tioeo.i Buffer 


0.25nl 




CLBAVASE eoz. enzyme 


0.5til 




Total Mix Volume (IX) 


5.0m1 




Biplex Assay Format 


Reaction Components 


IX Volume 


X Volume 


RNA Primary Buffer 1 


4.0^ 




Primary Oligos 


0.25nl 




Housekeeping Primary Oligos 


0.25til 




CLBAVASE enzyme 


0.5^1 




Total Mix Volume (IX) 


S.0m1 





15 3. Add 5|il of each No Target Control, standard, or sample (total KNA or cell lysate) to 
the appropriate well and mix by pipetting up and down 1-2 times. Overlay each 
reaction with 10|al of clear CHELLOUT or mineral oil. Seal microplate with 
Thermaseal well tape. 
4. Incubate reactions for 90 minutes at 60^C in a thermal cycler or oven. 

20 5. While the primary reaction is incubating, prepare the Secondary FRET Reaction Mix 
for the single or biplex format. Calculate the component volumes required QC 
Volume) by multiplying the number of reactions (for both samples and controls) by 
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1 .25 [X Volume (jd) = # reactions x L2S Aliquot the Secondary FRET 
Reaction Mix into multiple 0.2.ml thin-wall tubes or an 8-well strip (70jil/tube i 
suflBcient for a tow of 12 reactions). 

Secondary FRET Reaction Mix 



Single Assay Format 



Reaction Components 


IX Volume 


X Volume 


UNA Secondary Buffer 1 


2.0nl 




Secondary Oligos 


1.5ul 




Tioeo.1 Buffer ^ 


1.5nl 




Total Mix Volume (IX) 


5.0m1 




Biplex Assay Format 




Reaction Components 


IX Volume 


X Volume 


RNA Secondary Buffer 1 


2.0nl 




Secondary Oligos 


1.5nl 




Housekeeping Secondary 


1.5nl 




Total Mix Volume (IX) 


5.0m1 





6. After the primary reaction incubation is completed, remove the microplate seal, and 
add 5nl Secondary FRET Reaction Mix per weU using a multichannel pipet Mix by 
pipetting up and down 1-2 times. Reseal the microplate wifli tiie well tape and 
incubate the microplate at 60°C for 60 or 90 minutes, as indicated in each Product 
Information Sheet The secondary reaction incubation time can be varied. See 
sections 2 of the PROCEDURAL NOTES FOR OPERATION OF THE mRNA 
INVADER ASSAY for details. 

7. Reactions can be read using one of two procedures: Dir^tRead or Stop and 
Transfer . 

NCWE: Remove the microplate seal before reading the microplate. 
nirpr.t Read Procedure 
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■ This procedure enables collection of multiple data sets to extend the assay's 
dynamic range. During fee secondary INVADER reaction, read the micro^^^^ 
directly in a top-reading fluorescence microplate reader. 

■ Recommended settings for a PCTSq}tive Biosystem Cytofluor 4000 instrument are as 
follows: 



Specific Gene Signal: 


Housekeeping Gene Signal: 


Excitation: 


485/20nm 


Excitation: 


560/20nm 


Emission: 


530/25nm 


Emission: 


620/4Qnm 


ReadsAVeU: 


10 


Reads/Well: 


10 


Gain: 


40 


Gain: 


45 


Temperature: 


25^C 


Teft^jerature: 





NOTE: Because the optunal gain setting can vary between instruments, adjust 
the gain as needed to give the best signaI^ackground ratio (sample raw signal 
divided by the No Target Control signal) or No Target Control sample readings of 
-lOORFUs. Fluorescence microplate readers that use a xenon lamp source 
generally produce higher RFUs, For directly reading the microplates, the probe 
height of, and how the plate is positioned in, the fluorescence microplate reader 
may need to be adjusted according to the manufacture's recommendations. 

Stop and Transfer Procedure 

1. Prepare IX Stop Solution (lOmM Tris*HCl, pH 8, lOmM EDTA) with 
RNase-firee H2O* Add lOOp.1 per well with a multichannel pipet. 

2. Transfer lOO^il of the diluted reactions to a black microplate (e.g., 
COSTAR (Coming), Cat. No. 3915). 

3. Read the microplate using the same parameters as the Direct Read 
Procedure, but adjust the gain to give No Target Control sample readings of -'1 00 
RFUs (see NOTE above). 

In some embodiments, supplemenatary documentation, such as protocols for 
ancillary procedures, e.g., for the preparation of additional reagents, or for preparation of 
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samples for use in the methods of the present invention, are provided. In preferred 
embodiments, supplementary documentation includes guidelines and lists of precautions 
provided to fecilitate successfiil use of the methods and kits by unskilled or inexperienced 
users. In particularly preferred embodiments, supplementary documentation includes a 

5 troubleshooting guide, e.g., a guide describing possible problems that may be 

encountered by users, and providing suggested solutions or corrections to intended to aid 
the user m resolving or avoiding such problems. 

For example, and without intending to limit such supplementary documentation to 
any particular content, kits of the present mvention may comprise any of the foUowing 

10 procedures and guidelines: 

n. AVOroANCE OF RNase CONTAMINATION 

To avoid RNase contamination during sample prq)aration and testing, in one 
anbodiment, the user is cautioned to observe the following precautions: 
15 • Wear disposable gloves at all times to avoid contact with samples and reagents. 

• Use certified RNase-free disposables, including thin-waU polypropylene tubes and 
aerosol-barrier pipet tips, for preparing samples and assay reagents, to avoid cross- 
contamination. 

• Use RNase-free pEPC-treated) H2O for diluting samples and/or reagents. 
20 • Keq) RNA samples and controls on ice during assay setup. 

ffl. SAMPLE AND CONTROL PREPARATION 

NOTE: Dilute both standards and samples to concentrations that correspond to a 
5-(il addition pa: reaction. 
25 Example 1: The concentration of a 5-attomole standard is lamol/^l. lamol = 10" 

mole = 602,000 molecules. 
Example 2: The concentration of a 100-ng sample should be aOng/jil. 
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A. Control Preparation 

No Target Control : 

Total RNA Format: tRNA Carrier (2Qng/^l) 

CeU Lysate Format: IX Cell Lysis Buffer 1 (dilute lOX Cell Lysis Buffer 1 to IX 
5 withRNase-freeH20) 

Positive Control : RNA Standard (Std) (lOOamol/jil in vitro transcript) 

1 , Prepare RNA standards by diluting the positive controls with tRNA Carrier (when 
running total RNA samples) or with IX Cell Lysis Buffer 1 [lOX Cell Lysis Buffer 

10 1 diluted with RNase-free H2O] (when running cell lysate samples). The Product 

Information Sheet included in each kit indicates the recommended standard test 
levels and preparation methods. 

2. Using a fresh set of standards for each run is recommended. Store the standards on 
ice during reaction setup. 

15 

B. Total RNA Sample Preparation 

1 . Prepare total RNA from cells or tissue according to manufacturer's instructions for 
the selected preparation method. Recommended methods include TRIZOL (Life 
Technologies, Rockville, MD), RNEASY (Qiagen, Valencia, CA), and RNA WIZ 

20 (Ambion, Austin, TX). 

2. Dilute total RNA samples with RNase-free H2O to the appropriate concentration. 

C. Cell Lysate Sample Preparation - 96-weU microplate format 
NOTE: This cell lysate detection format is used for adherent cells cultured in 96-well 
25 tissue cxilture microplates. Cells are typically seeded at 10,000-40,000 cells per well 
Different seeding densities may be required depending on cell type and/or mRNA 
expression levels. See Procedural Notes for more details. For cells exhibiting high 
expression, die following methods can be used to attenuate the signal from the cell 
lysates: 



272 



wo 01/90337 



PCTAJSOl/17086 



■ plate fewer cells per well; 

■ dUute the cell lysates with IX CeU Lysis Buffer 1 before addition to 
reaction 2.5^1 lysate + 2.5fil IX Cell Lysis Buffer 1); 

■ read flie reaction microplate 15-30 minutes after addition of the 
Secondary FIUET Reaction Mix instead of the reconunended 60-90 
minutes; 



1 . Dilute lOX Cell Lysis Buffer 1 to a IX concentration with RNase-ftee H2O. 

2. Using a multichannel pipet, carefully remove flie culture medium fiom the weUs of 
adherent cells without disturbing the cell monolayer. 

3. Wash the cells once with 200nl PBS (no MgCh, no CaCU) and carefully remove the 
residual PBS with the multichannel pipet. 

4. Add 40nl IX Cell Lysis Buffer 1 per well. Lyse cells at room temperature for 3-5 
minutes. 

5. Using a multichannel pipet, carefully transfer 25^1 of each lysate sample into a 96- 
well microplate. Avoid transfening cellular material from the bottom of the well. 

6. Overlay each lysate sample with lOjil clear CHILLOXJT or mineral oil (overlaying is 
not necessary if using a heated-Ud thermal cycler). 

7. Seal microplate with Hiermaseal weU tape. Immediately heat lysates at 75-80°C for 
15 minutes in a thermal cycler or oven to inactivate cellular nucleases. 

8. During the heating step, proceed with the reaction setup. See DETAILED mRNA 
INVADER ASSAY PROTOCOL (above) for instructions. 

9. After the heat inactivation step, add the lysate samples immediately to the reaction 
microplate. Alternatively, the lysate samples can be quickly transferred to a -TO'C 
freezer for later testing (long-term stability has not been established and may diflfer 
for each cell type). 
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IV. PROCEDURAL NOTES FOR OPERATION OF THE mRNA INVADER 
ASSAY 

1 . RNA sample types and optimization of KN^A sample amount 

The assay is optimized for perfomiance mfti total KNA samples prepared 
5 from eifter tissue or cells. Several total RNA preparation methods/kits have been 

validated for performance in the mRNA INVADER assay: 

• TRIZOL (Life Technologies, Rockville, MD) 

• RNeasy (Qiagen, Valencia, CA) 

• RNA WIZ (Ambion, Austin, TX) 

10 

It is important to use a method or kit that minimizes the level of genomic 
DNA, which can inhibit signal generation. Performance of a preliminary 
experiment is recommended to determine the amount of total RNA sample 
(typically l-200ng, depending on the gene's expression level) that provides the 
1 s best limit of detection and dynamic range. 



The assay has also been validated with lysate samples from a number of 
cell types. Recommended cell densities in a 96-well tissue culture microplate are 
10,000-40,000 cells per well depending on cell type and expression level of the 
20 gene of interest. Performance of a preliminary experiment is recommended for 

any given cell line and/or gene being monitored. Such an experiment should 
mclude different cell density levels and/or dilution of the lysate samples with IX 
Cell Lysis Buffer 1 (e.g. a Ijil test level is prepared by mixing l^il lysate sample + 
4^1 IX Cell Lysis Buffer 1 for a 5^1 sample addition). 

25 

2. Dynamic range modulation: variable secondary reaction incubation times. 

The length of the secondary reaction incubation time listed in the protocol 
is sufficient for most analytes. However, the linear detection range 
(Signal/Background < 15-25) can be adjusted by reading the reaction microplate 
30 at variable tunes after addition of the secondary FRET reagents. For example, 
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10 



high expression samples can often be detected in 15-30 minutes. The Direct 
Read method pETAILED mRNA INVADER ASSAY PROTOCOL, step 7) 
enables simple optimization of the secondary reaction time as the reaction 
microplate can be incubated further if an early time read does not provide enough 
signal fiom the samples being tested. 

Monitoring llie secondary reaction fluorescence signal with time can also extend 
the dynamic range of the assay. The Direct Read method at multiple time points 
can be applied using low-cost instrumaitation. Alternatively, real-time 
fluorescence instrumentation can be used to achieve comparable dynamic ranges 
exhibited by oflier mRNA quantitation methods. 



3. Dynamic range modulation: variable sample levels. 

While the FRET detection metiiod greatly simplifies the assay, the 

15 dynamic range is typically limited to 2-3 logs when usmg an endpoint read 

method. However, since mRNA INVADER assay signal is generated linearly 
with both target level and time, the easiest method for extending the dynamic 
range beyond 3 logs (as may be required, e.g., for highly induced genes) is to 
adjust total RNA sample levels. Fold changes in gene expression (treated sample 

20 signal divided by untreated sample signal) can be reUably calculated using 

normalized sample signals. This is accomplished by testing sample levels that 
give signal within the linear detection range defined by the standard curve. For 
example, the fold induction for ahighly induced sample can be calculated as 
follows: 

25 Fold induction = (Net Signal for 1 ng treated sample X 100) / Net Signal 

for 100 ng untreated sample 
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V, TROUBLESHOOTING GUIDE 



Problem 


Possible Solution 


No signal 


• v^jucoa. ulai luQ iiuorescencc nucropidic rc<uicr iias ocwii oci up 
correctly and that the appropriate excitation and emission filters are 
in place. 

• Perform mRNA INVADER assay with the provided standard as a 
positive control. 

• Potential RNase contamination of the samples and reagents. 
Discard suspect reagents. 

• Use only reagents and oligonucleotides supplied in the kit. Do not 
mix reagents or oligonucleotides between kits. 


High variation 

between 

replicates 


• Always work with master primary and secondary reaction mixes. 

• Thoroughly mix all master mixes and samples. 

• Pipet in a similar manner across all the controls and samples. 

• Calibrate pipets firequently. 


Lack of low 
target level 
detection 


• Calibrate thermal cycler or heat block. 

• Minimize assay variability (see above), i.e. CVs are less than 5% 
for the sample replicates. This is particularly important for 
detecting low target levels. 


Lack of 
discrimination 
between high 
signal samples 


• Decrease secondary reaction incubation time to achieve detection 

within the linear range of the assay. 

• Use less total RNA per reaction. 

• Attenuate cell lysate sample signal (see NOTE, Sample and Control 

Oi^AnArafinn Part 
i^rcpaTauun) x^aii \^), 


Signal 
inhibition 


• Run samples on an agarose gel to check for presence of genomic 
DNA. Alter the RNA sample isolation method to minimize 
genomic DNA or presence of other inhibitors. The same isolation 
procedure should be used throughout an experiment. 

• If using the cell lysate format, residual PBS can be inhibitory. Be 

sure to remove residual PBS firom the tissue culture microplate. Do 
not use PBS that contains MgCla or CaCk , which inhibits the 
assay. 
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APPENDIX A: 
mKNA INVADER SINGLE ASSAY WORKSHEET 

,^PNAlNV>r>P.R ASSAY PP nCTRDURE 

■ Prepare samples and controls. 

- Prepare Primary Reaction Mix. Vortex briefly and aliquot 5^1 per well. 

- Add 5^1 sample or control per weU and pipet up and down 1-2 times. 

■ Add 1 0^1 CHILLOUT or mineral oil per well. 

• Incubate primary reaction at 60°C for 90 minutes. 

■ Prqjare Secondary FRET Reaction Mix, vortex briefly. 

- Using a multichannel pipet. aliquot 5 ^il per weU and pipet up and down 1-2 times. 

■ Incubate secondary reaction at 60»C for 60 or 90 minutes. 

- Read micioplate in fluorescence micioplate reader (FAM Dye: Ex. 485 nm/Em. 
530 nm). 



Reaction Components 


IX Volume 


^X Volume (No. of 


RNA Primary Buffer 1 


4.0)xl 




Primary Oligos 


0.25til 




Tioeo.i Buffer 


0.25^1 




CLEAVASEenzyme 






Total Mix Volume (IX) 






SFrONDARY FRET REACTION MIX 


Reaction Components 


IX Volume 


^X Volume 

rfJ^. rtf refiptifttis X 1-25) 


RNA Secondary Buffer 1 


2.0^1 




Secondary Oligos 


1.5^1 




Tioeoi Buffer 


1.5nl 




Total Mix Volume (IX) 


5.0^1 
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APPENDIX B: 
mRNA INVADER BIPLEX ASSAY WORKSHEET 
mRNA TNVADER ASSAY PROCEDURE 

■ Prq^are saiiq>les and controls. 

■ Prepare Primary Reaction Mix. Vortex briefly and aliquot 5^1 per well. 

■ Add 5\il sample or control per well and pipet up and down 1-2 times. 

■ Add 1 Ofil CHILLOUT or mineral oil per well. 

■ Incubate primary reaction at 60''C for 90 minutes. 

■ Prepare Secondary FRET Reaction Mix, vortex briefly. 

■ Using a multichannel pipet, aliquot 5^1 per well and pipet up and down 1-2 times, 

■ Incubate secondary reaction at GO^'C for 60 or 90 minutes. 

■ Read microplate in fluorescence microplate reader (FAM Dye: Ex. 485 nm/Em. 
530 nm and red dye: Ex. S60nm/Em. 620nm). 



PRIMARY REACTION MIX 



Reaction Components 


IX Volume 


X Volume 


KNA Primary Buffer 1 


4.0nl 




Primary Oligos 


0.25jil 




Housekeeping Primary Oligos 


0.25nl 




CLEAVASE DC enzyme 


0.5^1 




Total Mix Volume (IX) 


5.0(11 




SECONDARY FRET REACTION MIX 


Reaction Components 


IX Volume 


X Volume 

(#rea<:ti«n!fx1^25^ 


RNA Secondary Buffer I 


2.0^1 




Secondary Oligos 


1.5^1 




Housekeeping Secondary Oligos 


1.5nl 




Total Mix Volume (IX) 


S.0^1 
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TABLE 2 



Klenow 

Wild-Type 
S610A 
R668A 
N678A 
E710A 
E710D 
K758A 
K758R 
Y766S 
R841A 
N845A 
N845Q 
Q849A 
Q849E 
H881A 
D882N 
D882S 



Kcatfs' 

^} 
2.4 
n. d. 

0.006 
n-d. 

0.1 

1.7 
0.131 

2.0 

0.8 

0.3 

1.0 

0.03 

0.02 
0.001 

0.3 
<0.0001 
0.001 



KmfdNTP) 

2.8 

6.5 

15 
7.7 
15.6 
2.1 
6.4 
9.8 
23 
1.7 
3.8 
n. d. 
3.3 
n.d. 
7.5 



Kd(nM) 
8 

iLd. 
140, 150 
n. d. 
250 
110 



13 
40, 53 

8,5 
80,55 
100, 160 
90,91 
20,28 

30 

0.9 



Relative DNA 
affinity 
1 

0.06,0.05 

0.03 
0.07 
0.63 
1.125 
0.4,0.6 

0.2 
1.0, 1.7 
0.1,0.2 
0.08,0.05 
0.09 
0.4,0.3 
0.6 
9 



Reference Tag Pol 



2 
5 

5 
2 
2 
4 
4 

1^ 
1,2 

1.2 
1.2 

2 

2 



Wild-Type 
S515 
R573* 
N583 
E615* 
E615* 
K663 
K663 
Y671 
R746* 
N750* 
N750* 
Q754 
Q754 
H784* 
D785 
D785 



10 
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15 TABLE 3: Rational mutations in the polymerase region 
A. DNA activity table 



TthDNRXHT 


31.91 


100% 


83% 


3.81 


101.9 


TthDNRXHT 
H641A 


23.61 


74% 


62% 


5.32 


221.24 


TthDNRXHT 
R748A 


22.1 


69% 


58% 


4.39 


88.17 


TthDNRXHT 
H786A 


34.31 


108% 


90% 


7.75 


185.35 


TthDNRXHT 


32.1 


101% 


84% 


5.7 


332.8 
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H786A/G506K/ 
Q509K(AKK) 












TaqDNRXHT 
W417UG418K/ 
E507Q/H784A 
(Taq4M) 


38.23 


120% 


100% 


68.21 


1100.18 


Taq4MG504K 


36.04 


113% 


94% 


31.76 


417.40 


Taq4MH639A 


42.95 


135% 


112% 


91.46 


2249.67 


Taq4MR587A 


44.78 


140% 


117% 


143.0 


252.69 


TaqDNRXHT 

W417L/G418K/ 

G499R/A502K/ 

I503UK504N/ 

E07K/H784A 

(Taq8M) 


43.95 


138% 


115% 


122.53 


346.56 


TaqSSR677A 


32.3 


101% 


84% 


206.9 


2450.0 



B. RNA activitv table 

IiTl %Tth %Taq4M 



TfliDNRXHT 


0.89 


100% 


34% 








TthDNRXHT 


1.18 


133% 


45% 


H641A 








TthDNRXHT 


1.34 


151% 


51% 


R748A 








TthDNRXHT 


1.31 


147% 


49% 


H786A 








TfliDNRXHT 

H786A/G506K/ 


1.59 


179% 


60% 


Q509K(AKK) 








TaqDNRXHT 

W417L/G418K/ 


2.65 


298% 


100% 


E507Q/H784A 
(Taq 4M) 








Taq 4M G504K 


2.76 


310% 


114% 


Taq4MH639A 


3.89 


437% 


147% 


Taq4MR587A 


3.13 


352% 


118% 


TaqDNRXHT 
W417L/G418K/ 


4.00 


450% 


151% 


G499R/A502K/ 








I503L/K504N/ 








E07K/H784A 








(TaqSM) 








TaqSS R677A 


2.22 


249% 


84% 



5 
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TABLE 4: Rational arch mutations 
DNA activity table 



Taq4M 
P88E/P90E 


10.20 


32% 


27% 


2.00 


97.00 


Taq4M 
G80E 


26.30 


82% 


69% 


103.6 


2900 


Taq4M 
L109F/A110T 


36.45 


114% 


95% 


19.71 


749.69 



RNA activity table 



IrTl %Tth %\ 


raq4M 




Taq4M 
P88E/P90E 


Taq4M 
P88E/P90B 


0.10 


11% 


4% 


Taq4M 
G80E 


Taq4M 
G80E 


3.11 


349% 


117% 


Taq4M 
L109F/A110T 


Taq4M 
L109F/A110T 


2.45 


275% 


92% 
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TABLE 5: Arch/thumb combinations 

DNA activity table 
IdT %Tth %Taq4M HP X 



TaqW417L/ 

G418K/E507K/ 

H784A/L109F/ 

A110T/G499R/ 

A502K/I503y 

G504K/E507K/ 

T514S(TaqSS) 


63.33 


198% 


166% 


177.05 


202.32 


TaqP88E/P90E/ 

W417iyG418K/ 

G499R/A502K/ 

I503UG504K/ 

E507K/T514S/ 

H784A 


36.48 


114% 


95% 


9.44 


70.35 



15 
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RNA activity table 

IiTl %Tth %Taq4M 



TaqW417L/ 

G418K/E507K/ 

H784A/L109F/ 

A110T/G499R/ 

A502K/I503L/ 

G504K/E507B:/ 

T514S (Taq SS) 


3.16 


355% 


119% 


TaqP88E/P90E/ 

W417UG418K/ 

G499R/A502K/ 

I503I7G504K/ 

E507K/T514S/ 

H784A 


0.22 


25% 


8% 



5 TABLE 6: HeUx-hairpin-helix random mutagenesis 

DNA activity table 



IdT %Ttfa %Taq4M HP X 



TaqSS 
K198N 


23.4 


73% 


61% 


25.7 


1233.1 


TaqSS 
A205Q 


25.6 


80% 


67% 


13.4 


699.1 


TaqSS 
T204P 


11.2 


35% 


29% 


1.9 


209.4 


TaqSS 

I200M/A205G 


16.8 


53% 


44% 


7.8 


597.2 


TaqSS 
K203N 


25.9 


81% 


68% 


36.6 


1429.8 


TthDNRXHT 
H786A/P197R/K200R 


10.7 


33% 


28% 


3.2 


66.3 


TthDNRXHT 
H786A/K205Y 


11.5 


36% 


30% 


6.1 


327.5 


TthDNRXHT 
H786A/G203R 


18.3 


57% 


48% 


2.1 


98.8 
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RNA activity table 





IiTl 


%Tth %Taq4M 






TaqSS 
K198N 


1.22 


137% 


46% 






TaqSS 
A205O 


0.62 


70% 


23% 






TaqSS 
T204P 


0.36 


40% 


14% 






TaqSS 

I200M/A205G 


0.77 


87% 


29% 






TaqSS 
K203N 


2.09 


235% 


79% 






TthDNKXHT 
H786A/P197R/BaOOR 


0.47 


52% 


18% 






TthDNRXHT 
H786A/K205Y 


0.68 


77% 


26% 






TthDNRXHT 
H786A/G203R 


1.61 


180% 


61% 






TABLE 7: Random tbamb mutations 
DNA activity table 

IdT %Tth 


%Taa4M HP 


X 


TaqDNRXHT 
W417L/G418K/ 
E507K/H784A/G499R/ 
A502K/K504N/(M1-13) 


59.96 


188% 


157% 


133.65 


907.41 


TaqDNRXHT 
W417L/G418K/ 
/H784A/L500yQ5O7H 
A502K/G504K(Ml-36) 


46.74 


146% 


122% 


123.11 


822.61 


TaqDNRXHT 
W417L/G4l8KyG499R/ 
A502K/G504K/E507K/ 
H784A/r514S (M2-24) 


85.7 


269% 


224% 


369,96 


3752.12 


TaqDNRXHT 
W417UG418K/G499R/ 
A502K/G504KyE507K/ 
H784AA^518L(M2-06) 


76.7 


240% 


201% 


355.87 


2038.19 
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RNA activity table 

IiTl %Tth %Taq4M 



TaqDNRXHT 
W417L/G418K/ 
E507K/H784A/G499R/ 
A502K/BC504N/(M1-13) 


2.55 


287% 


96% 


TaqDNRXHT 
W417I7G418K/ 
/H784A/L500I/Q507H 
A502K/G504K (Ml-36) 


2.71 


304% 


102% 


TaqDNRXHT 
W417L/G418K/G499R/ 
A502K/G504K/E507K/ 
H784A/T514S (M2-24) 


4.43 


498% 


167% 


TaqDNRXHT 
W417L/G418K/G499R/ 
A502K/G504K/E507K/ 
H784AA^518L(M2-06) 


3.56 


400% 


134% 



TABLES; Chimeric mutants 
A. DNA activity table 





IdT2 


%TtiiAKK 


HP 


X 


TthAKK 


34.18 


100% 


5 


393 


Taq4MG504K 


40.19 


105% 


28 


1991 


TfiDN2M 


36.60 


106% 


289 


1326 


TSCDN2M 


25.49 


75% 


283 


2573 


TaqTthAKK 


63.89 


187% 


32 


1658 


TthTaq 4M G504K 


25.03 


73% 


8 


627 


TfiTthAKK 


34.13 


100% 


15 


459 


TscTthAKK 


35.23 


103% 


29 


2703 


TfiTaq4MG504K 


35.69 


104% 


37' 


872 


TscTaq4M 
G504K 


30.04 


88% 


25 


2008 



B. RNA activity table 

IrT3 %TthAKK 



TthAKK 


2.27 


100% 


Taq4MG504K 


2.31 


102% 


TfiDN2M 


0.20 


9% 


TscDN2M 


0.29 


13% 
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TaqTthAKK 


6.81 


300% 


TthTaq4M 
G504K 


1.09 


4<$/0 


TfiTthAKK 


1.24 


55% 


TscTthAKK 


9.65 


4.25% 


TfiTaq4MG504K 


1.05 


46% 


TscTaq 4M 
G504K 


2.95 


130% 
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All publications and patents mentioned in the above specification are herein 
incorporated by reference. Various modifications and variations of the described 
methods and systems of the invention will be apparent to those skilled in the art without 
departing fiom the scope and spirit of the invention. Although the invention has been 
described in connection with specific preferred embodiments, it should be understood 
that the invention as claimed should not be unduly limited to such specific embodiments. 
Indeed, various modifications of the described modes for carrying out the invention 
which are obvious to those skiUed in the relevant fields are intended to be within the 
scope of the following claims. 



IS 
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CLAIMS 

We claim: 

5 LA composition comprising an enzyme, wherein said enzyme comprises a 

heterologous functional domain, wherein said heterologous functional domain provides 
altered functionality in a nucleic acid cleavage assay. 

2. The composition of Claim 1, wherein said enzyme comprises a 5' 
10 nuclease. 

3. The composition of Claim 2, wherein said S* nuclease comprises a 
thermostable S' nuclease. 

15 4. The composition of Claim 1, wherein said enzyme comprises a 

polymerase. 

5. The composition of Claim 4, wherein said polymerase is altered in 
sequence relative to a naturally occurring sequence of a polymerase such that it exhibits 

20 reduced DNA synthetic activity from that of the naturally occurring polymerase. 

6. The composition of Claim 4, wherein said polymerase comprises a 
thermostable polymerase. 

25 7. The composition of Claim 6, wherein said thermostable polymerase 

comprises a polymerase from a Thermus species. 

8. The composition of Claim 7, wherein said Tliermus species is selected 
from Tliermus aguaticus, Thermus flavus^ Tliermus thermophiluSy Tliermus filiformus, 
30 and Thermus scotoductus. 
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9, The composition of Claim 1, wherein said heterologous functional domain 
comprises an amino acid sequence that provides an improved nuclease activity in said 
nucleic acid cleavage assay. 

5 10. The composition of Claim 1 , wherein said heterologous functional domain 

comprises an amino acid sequence that provides an improved substrate binding activity in 
said nucleic acid cleavage assay. 

11. The composition of Claim 1 , wherein said heterologous functional domain 
10 comprises an amino acid sequence that provides improved background specificity in said 

nucleic acid cleavage assay. 

12. The composition of Claim 1, wherein said heterologous functional domain 
comprises two or more amino acids firom a polymerase domain of a polymerase. 

15 

13. The composition of Claim 1 2, wherein at least one of said two or more 
amino acids is fit»m a pahn region of said polymerase domain. 

14. The composition of Claim 12, wherein at least one of said two or more 
20 amino acids is from a fliumb re^on of said polymerase domain. 

15. The composition of Claim 12, wherein said polymerase comprises 
Thermus thermophiltis polymerase. 

25 16. TTie composition ofClaim 12, wherein said two or more amino acids from 

said polymerase domain comprise two or more amino acids from amino acids 300-650 of 
SEQn)NO:l. 

17. The composition of Claim 1, wherein said enzyme comprises an amino 
30 add sequence selected from the group consisting of SEQ ID NOs:2-68, 341 , 346, 348, 
351. 353, 359, 365, 367, 369, 374. 376, 380, 384, 388, 392, 396, 400, 402, 406, 408. 410, 
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412, 416, 418, 420, 424, 427, 429, 432, 436, 440, 444, 446, 448, 450, 456, 460, 464, 468, 
472, 476, 482, 485, 488, 491, 494, 496, 498, 500, 502, 506, 510, 514, 518, 522, 526, 530, 
534, 538, 542, 544, 550, 553, 560, 564, 566, 568, 572, 574, 576, 578, 580, 582, 584, 586, 
588, and 590. 

1 8. The composition of Claim 1 , wherein said nucleic acid cleavage assay 
comprises cleavage of a DNA member of a substrate containing at least one RNA 
component. 

1 9. The composition of Claim 1 , wherein said nucleic acid cleavage assay 
comprises an invasive cleavage assay. 

20. A composition comprising a nucleic acid encoding the enzyme of Claim 1 . 

2 1 . The composition of Claim 20, wherein said nucleic acid is selected from 
the group consisting of SEQ E) NOs:69-135, 340, 345, 347, 350, 352, 358, 364, 366, 
368, 373, 375, 379, 383, 387, 391, 395, 399, 401, 405, 407, 409, 411, 415, 417, 419, 423, 
426, 428, 431, 435, 439, 443, 445, 447, 449, 452, 454, 455, 459, 463, 467, 471, 475, 481, 
484, 495, 497, 499, 501, 505, 509, 513, 517, 521, 525, 529, 533, 537, 541, 543, 549, 552, 
559, 563, 565, 567, 571, 573, 575, 577, 579, 581, 583, 585, 587, and 589. 

22. The composition comprising an expression vector, said expression vector 
comprising said nucleic acid of Claim 20. 

23. A composition comprising a host cell containing the expression vector of 
Claim 22. 

24. A method for producing an altered enzyme with improved functionality in 
a nucleic acid cleavage assay comprising: 

a) providing an enzyme and a nucleic acid test substrate; 
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b) introducing a heterologous functional domain into said enzyme to 

produce an altered enzyme; 

c) contacting said altered eozyme with said nucleic acid test substrate 

to produce cleavage products; and 

d) detection said cleavage products. 

25. The method of Claim 24, wherein said introducing a heterologous 
functional domain comprises mutating one or more amino acids of said enzyme. 

26. The method of Claim 24, wherein said introducing a heterologous 
functional domain into said enzyme comprises adding a functional domain from a protein 
into said enzyme. 

27. The method Qaim 26, wherein said adding a fimctional domain from a 
protein into said enzyme comprising removing a portion of said enzyme sequence prior to 
adding said functional domain of said protein. 

28. The method of Claim 24, wherein said nucleic acid test substrate 
comprises a cleavj^e structure. 

29. The method of Claim 28, wherein said cleavage structure comprises an 
KNA target nucleic add. 

30. The method of Claim 28, wherein said cleavage stnicture comprises an 
invasive cleavage structure. 

31. The method of Claim 24, wherein said enzyme comprises a 5' nuclease. 

32. The mettiod of Claim 3 1 , wherein said 5' nuclease comprises a 
themostable 5' nuclease. 
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33. The method of Claim 24, wherein said enzyme comprises a polymerase. 

34. The method of Clafan 33, wherein said polymerase is altered in sequence 
relative to a naturally occurring sequence of a polymerase such that it exhibits reduced 
DNA synthetic activity from that of the naturally occurring polymerase. 

35. The method of Claim 24, wherein said polymerase comprises a 
thermostable polymerase. 

36. The method of Claim 35, wherein said thermostable polymerase comprises 
a polymerase from a Thermus species. 

37. The method of Claim 36, wherein said Thermus species is selected from 
Thermus aquattcus, Thermus flavus, Thermus thermophilus, Thermus filiformuSy and 
Thermus scotoductus, 

38. The method of Claim 24, wherein said heterologous fimctional domam 
comprises an amino acid sequence that provides an improved nuclease activity in said 
nucleic acid cleavage assay. 

39. The method of Claim 24, wherein said heterologous fimctional domain 
comprises an amino acid sequence that provides an improved substrate binding activity in 
said nucleic acid cleavage assay. 

40. The method of Claim 24, wherein said heterologous fimctional domain 
comprises an amino acid sequence that provides improved background specificity in said 
nucleic acid cleavage assay. 

4 1 . The method of Claim 24, wherein said heterologous fimctional domain 
comprises two or more amino acids from a polymerase domain of a polymerase. 
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42. The method of Claim 41. wherein at least one of said two or more amino 
acids is firom a palm region of said polymerase domain. 

43. The method of Claim 41 , wherein at least one of said two or more amino 
acids is fiom a thumb region of said polymerase domain. 

44. The method of Claim 41, wherein said polymerase domain comprises a 
polymerase domain of a T^emws themophilus polymerase. 

45. The mefliod of Claim 41, wherein said two or amino acids from said 
polymerase domain comprise two or more amino acids from amino acids 300-650 of 
SEQroNO:!. 

46. The metiiod of Claim 24, wherein said enzyme comprises an amino acid 
sequence selected from the group consisting of SEQ ID NOs:2-68, 341, 346, 348, 351, 
353, 359, 365. 367. 369, 374. 376. 380, 384, 388, 392, 396, 400, 402, 406, 408, 410. 412, 
416. 418, 420. 424. 427, 429, 432, 436. 440, 444, 446, 448, 450, 456, 460, 464, 468, 472, 
476, 482. 485, 488. 491. 494, 496, 498. 500, 502, 506, 510, 514, 518, 522, 526, 530, 534, 
538, 542, 544. 550, 553. 560. 564, 566, 568, 572, 574, 576, 578, 580, 582, 584. 586, 588, 
and 590, 

47. The mefliod of Qaim 24, wherein said alto-ed enzyme comprises an amino 
acid sequence selected from the group consisting of SEQ ID NOs:69-135, 340, 345, 347, 
350, 352. 358. 364. 366, 368, 373. 375, 379, 383, 387. 391, 395. 399, 401, 405, 407, 409, 
411, 415, 417, 419, 423, 426, 428, 431, 435, 439, 443. 445, 447, 449, 452, 454, 455. 459. 
463, 467, 471. 475, 481. 484. 495, 497, 499, 501, 505. 509, 513. 517, 521, 525, 529. 533, 
537, 541, 543. 549, 552, 559, 563, 565, 567, 571, 573. 575. 577. 579, 581. 583, 585, 587, 
and 589. 

48. An altered enzyme produced by the method of Claim 24. 
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49. A kit comprising the altered «izyme of Claim 48. 

50. A kit comprising the composition of Claim 1 . 

5 51. The kit of Claim 50, further comprising at least one nucleic acid cleavage 

substrate. 

52. The kit of Claim 51, further comprismg at least one RNA capable of 
hybridizing to said nucleic acid cleavage substrate. 

10 

53. The kit of Claim 50, further comprising a labeled oligonucleotide. 

54. The kit of Claim 50, further comprising an invasive oligonucleotide. 

15 55. A method for cleaving a nucleic acid comprising: 

a) providing: 

i) the enzyme of Claim 1; and 

ii) a sample comprising substrate nucleic acid; and 

b) exposing said substrate nucleic acid to said enzyme. 

20 

56. The method of Claim 55, wherein said exposing said substrate nucleic acid 
to said enzyme produces at least one cleavage product. 

57. The method of Claim 56, further comprising the step of c) detecting said 
25 cleavage product. 

58. The method of Claim 55, wherein said sample comprising substrate 
nucleic acid comprises a cell lysate. 

30 59. A method for detecting the presence of a target nucleic acid comprising: 
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a) cleaving an invasive cleavage structure, said invasive 
cleavage structure comprising an RNA target nucleic 
acid; and 

b) detecting the cleav^e of said invasive cleavage structure. 

60. The method of Claim 59, wherein cleaving is cairied out by a cleavage 
agent 

61 . The method of Claim 59, wherein said target nucleic acid comprises a first 
region and a second region, said second region downstream of and contiguous to said 
first region. 

62. The mefliod of Claim 61 , wherein said invasive cleavage structure 
comprises said target nucleic acid, a first oligonucleotide, and a second oUgonucleotide, 
wherein at least aportion of said first oligonucleotide is completely complementary to 
said first region of said first target nucleic acid, and wherein said second oligonucleotide 
comprises a 3* portion and a 5* portion, wherein said 5* portion is completely 
complementary to said second region of said target nucleic acid. 

63. The method of Claim 62, wherein at least said portion of said first 
oligonucleotide is annealed to said first region of said target nucleic acid and wherein at 
least said 5' portion of said second oligonucleotide is annealed to said second region of 
said target nucleic. 

64. The method of Claim 59, wherein said cleaving generates a non-target 
cleavage product. 

65. The method of Claim 64, wherein said detecting the cleavage of said 
invasive cleavage structure comprises detecting said non-target cleavage product. 
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66. The method of Claim 62, wherein said 3* portion of said second 
oligonucleotide comprises a 3' terminal nucleotide not complementary to said target 
nucleic acid. 

5 67. The method of Claim 62, wharein said 3' portion of said second 

oligonucleotide consists of a single nucleotide not complementary to said target nucleic 
acid. 

68. The method of Claim 59, wherein said detecting the cleavage of said 
10 invasive cleavage structure comprises detection of fluorescence. 

69. The method of Claim 59, wherem said detecting the cleavage of said 
invasive cleavage structure comprises detection of mass. 

15 70. The method of Claim 59, wherein said detectmg the cleavage of said 

invasive cleavage structure comprises detection of fluorescence energy transfer. 

71. The method of Claim 59, wherein said detecting the cleavage of said 
cleavage structure comprises detection selected from the group consisting of detection of 

20 radioactivity, luminescence, phosphorescence, fluorescence polarization, and charge. 

72. The method of Claim 62, wherein said first oligonucleotide is attached to a 
solid support. 

25 73. The method of Claim 62, wherein said second oligonucleotide is attached 

to a solid support. 

74. The mefliod of Claim 60, wherein said cleavage agent comprises a 
stmcture-specific nuclease. 
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75. The method of Claim 74, wherein said structure-specific nuclease 
comprises a thermostable structure-specific nuclease. 

76. The method of Claim 60, wherein said cleavage agent comprises an 
enzyme, wherein said enzyme comprises a heterologous functional domain, wherein said 
heterologous fimctional domain provides altered fimctionality in a nucleic acid cleavage 
assay. 

77. The method of Claim 76, wherein said enzyme comprises a 5* nuclease. 

78. The method of Claun 77, wherein said 5' nuclease comprises a 
thermostable 5* nuclease. 

79. The method of Claim 76, wherein said enzyme comprises a polymerase. 

80. The method of Claim 79, wherein said polymerase is altered in sequence 
relative to a naturally occurring sequence of a polymerase such that it exhibits reduced 
DNA synthetic activity &om that of the naturally occurring polymerase. 

81. The method of Claim 79, wherein said polymerase comprises a 
thermostable polymerase. 

82. The method of Claim 81, wherein said thermostable polymerase comprises 
a polymerase from a Thermus species, 

83. The method of Claim 82, wherein said Thermus species is selected from 
Thermus aquaticus, Thermus flavus, Thermus thermophilus, Thermus filiformus, and 
Tfiermus scotoductus. 
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84. The method of Claim 76, wherein said heterologous fiinctional domain 
comprises an amino acid sequence that provides an improved nuclease activity in said 
nucleic acid cleavage assay. 

5 85. The method of Claim 76, wherein said heterologous functional domain 

comprises an amino acid sequ^ce that provides an improved substrate binding activity in 
said nucleic acid cleavage assay. 

86. The method of Claim 76, wherem said heterologous functional domain 

10 comprises an amino acid sequence that provides improved background specificity in said 
nucleic acid cleavage assay. 

87. The method of Claim 76, wherein said heterologous functional domain 
comprises two or more amino acids from a polymerase domain of a polymerase. 

15 

88. The method of Claim 87, wherein at least one of said two or more amino 
acids is fi-om a palm region of said polymerase domain. 

89. The method of Claim 87, wherein at least one of said two or more amino 
20 acids is from a thumb region of said polymerase domain. 

90. The method of Claim 87, wherein said polymerase domain comprises a 
polymerase domain of a Thennus thermophilus polymerase. 

25 91 . The method of Claim 87, wherein said two or more amino acids from said 

polymerase domain comprise two or more amino acids from amino acids 300-650 of 
SEQIDNO:!. 

92. The method of Claim 76, wherein said enzyme comprises an amino acid 
30 sequence selected from the group consisting of SEQ ID NOs:2-68, 341 , 346, 348, 351, 
353, 359, 365, 367, 369, 374, 376, 380, 384. 388, 392, 396, 400, 402, 406, 408, 410, 412, 
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416, 418, 420, 424, 427, 429, 432, 436, 440. 444. 446. 448. 450. 456, 460. 464. 468, 472, 
476. 482, 485, 488, 491, 494. 496, 498. 500. 502, 506. 510, 514. 518. 522, 526, 530. 534. 
538. 542, 544, 550, 553, 560, 564. 566, 568, 572, 574, 576. 578, 580, 582, 584. 586. 588. 
and 590. 

93. The method of Claim 76, wherein said enzyme is encoded by a nucleic 
acid selected from the group consisting of SEQ ID NOs:69-135, 340. 345. 347, 350, 352. 
358. 364, 366, 368. 373, 375. 379, 383, 387. 391. 395. 399. 401. 405. 407. 409. 411. 415. 
417, 419, 423, 426, 428. 431. 435. 439, 443. 445. 447, 449, 452. 454. 455. 459, 463, 467, 
471, 475, 481. 484. 495, 497, 499. 501, 505, 509, 513, 517. 521. 525, 529, 533. 537. 541. 
543. 549. 552, 559, 563. 565. 567, 571, 573, 575, 577, 579, 581. 583. 585, 587, and 589. 

94. The method of Claim 64, further comprising the steps of forming a second 
invasive cleavage structure comprising said non-target cleavage product and cleaving 
said second invasive cleavage structure. 

95. The method of Qaim 94, wherein said invasive cleavage structure or said 
second invasive cleavage structure comprises an oUgonucleotide comprising a sequence 
selected from the group consisting of SEQ ID NO:709-2640. 

96. The method of Claim 94, wherein said mvasive cleavage structure or said 
second invasive cleavage structure comprises an oligonucleotide comprising a sequence 
selected from the group consisting of SEQ ID NO:169-21 1 and 619-706. 

97. The method of Claun 59, wherein said RNA target nucleic acid comprises 
a cytochrome P450 RNA. 

98. The method of Claim 59, wherein said RNA target nucleic acid comprises 
a cytokine RNA. 
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99. The method of Claim 61, wherein a portion of said target nuleic acid 
consisting of said first region and said second region of said target nucleic acid comprises 
a splice junction. 

5 100. The method of Claim 61, wherein said first region or said second region of 

said target nucleic acid comprises an exon. 

101. The method of Claim 61, wherein said first region or said second region of 
said target nucleic acid comprises an intron. 

10 

1 02. The method of Claim 59, wherein said RNA target nucleic acid is 
provided in a cell lysate. 

103. The method of Claim 62, wherein said first oligonucleotide is covalently 
1 s attached to said second oligonucleotide. 

104. A kit for performing an invasive cleavage assay comprising a probe 
oligonucleotide and an invasive oligonucleotide configured to form an invasive cleavage 
structure in the presence of an RNA target nucleic acid. 

20 

1 05. The kit of Claim 1 04, wherein said kit comprises an oligonucleotide 
comprising a sequence selected from the group consisting of SEQ ID NO:709-2640. 

106. The kit of Claim 104, wherein said kit comprises an oligonucleotide 

25 comprising a sequence selected fi-om the group consisting of SEQ ID NO:169-211 and 
619-706. 

1 07. A method for detecting the presence of two or more target nucleic acid 
sequences comprising: 

30 a) cleaving two or more invasive cleavage structures, each of said two or 

more invasive cleavage structures comprising an RNA target nucleic 
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acid havmg a target sequence, wherein each of said invasive cleavage 
structures comprises a different RNA target sequence; and 
b) detecting the cleavage of said invasive cleavage structures. 

5 



299 



wo 01/90337 



PCT/USOl/17086 




FIGURE 1 



1/145 



wo 01/90337 



FIGURE 2 



PCT/USOl/17086 



5- Y Probe 3. 
A Invader Oligonucleotide ^'^fesftAfiGGAAGGGAAGAAAGCGAA 

5'acggggaaagccggcgmcgtggcgasam 

.CGMCUGCa:CUUUCX3GCCGCUUGCACCGCUCUUU0CUUCCCUUCUUU0^ 

RNA Target Strand 



5' Y Probe 3, 
B Invader oligonucleotide ^'^asaafesAAGGGAAGAMGCGAA 

5'A0GGGGAAAG0CGGCGAAC6TCGCX3AeMC 
-CGMCUGCCOJUUUCGQCXXaaJUGCACCGaJCUUUCCUUCCCUUCUUUO^ 

RNA Target Strand 

5' 




3' 



.S'y Probe 3, 
C Invader oHgonucleoUde '^'>)teAAAfiGAAGGGW«3AMGCGAA 
S'cttgacggggaaagccggcgaacgtggcgC 

-CGMCUGCCCCUUUCX3GCXmJUGCA(XX3CUCUUUCC^^ 



RNA Target strand 

5' 



Y Probe 3. 

D Invader oligonucleotide 5'/Jgamggaagggaagaaagcgaa 

6'cttgacggggaaagocggggaacgtggcgP 
-osmcugccccuuucggccqcuugcacxgcucuuuccuucx^cuucu^ 

RNA Target strand 

5' 

3' 
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DNA 

TaqExoTaqPolTthPol Pfu Afu Mja With NE 

Uncleaved ^js^t ggjj^jjMSkiL^^igi^ ^i^^ 
probB ^^^W ^^HT ^^Bff ^^^f ^PST 



Cleaved _ _ ^ 



1 2 3 4 5 6 7 8 

g RNA 

TaqExoTaqPolTthPol Pfu Afu Mja Mth NE 

Undeaved 
probe 



Cleavede 
products 




1 2 3 4 5 6 7 8 



FIGURE 11 



20/145 



wo 01/90337 



PCTAJSOl/17086 



TaqPol 
TthPol 
TaqTth(N) 
TthTaq(N) 
TaqTth(B-S) 
TaqTth(N-B) 
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TaqTth(D-B) 
TaqTth(Bs-B) 




0.5 1 
cycling rate, mln'"' 
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FIGURE 12 
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TthPol 
TaqTfiKD-B) 
TaqTth(D-B) F413H/A414R 

TaqTlh{D-B)W417iyG418K 
TaqTth(D-B)A439R 
TaqTth(N-D) 
TaqTth(N-D)V463L 
TaqTth(N-D)G499R 
TaqTth(N-D)E507Q 
TaqTth(N-D)Y535H 




cycling rate, min' 



FIGURE 14 
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FIGURE 15 
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Nuclease 
Domain 



Tth 



Polymerase 
Domain 



Polymerase Activity Assays 

% Pl-laheled dllTP in«irporated 
RNA.prA^ or DNA. pfdA^ Template 



5.8 (1,00) 



14.8(1.00) 



Taq 



0.8(0.14) 



15.0(1.01) 



Taqm(N) 



4.88(0.84) 12.9(0.87) 



TaqTth(N-B) 



0.58 (0.10) 133 (0.90) 



TaqTtti(B-S) 



Taq(W417iyG418K/ES07Q) 



6.60(1.14) 14.9(1.01) 



0.42(0.07) 12.6(0.85) 



FIGURE 16 
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FIGURE 17 
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(SEQ1DN0:223) 
(SEQrDNO:224) ^, cy3 | 

▼ v^^GCT TCTCGCTCGC 

ACGGAACGAGCGTCTTTG 
UGCCUUGCUCGCAGAAAGCGACAGAGCGAGCG 

(SEQn)NO:225) 



FIGURE 18A 
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(SEQn>NO:223) 
(SEQIDNO:224) | 

^ ^^Gcl VcTCGCTCGC 

ACGGAACGAGCGTCTTTG 
TGCCTTGCTCGCAGAAAGCGACAGAGCGAGCG 

♦ 

(SEQIDNO:226) 



FIGURE 18B 
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(SEQIDNO:223) 
(SEQIDNO:226) ^ cy3 | 

▼ ^GCT TCTCGCTCGC 

TGCCTTGCTCGCAGAAAGCGACAGAGCGAGCG 
ACGGAACGAGCGTCTTT Qa », 

(SEQIDNO:227) 



FIGURE 18C 
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(SEQIDNO:223) 

<^GCT TCTCGCTCGC 
TGCCTTGCTCGCAGAJ^GCGACAGAGCGAGCG 

(SEQIDNO:226) 



FIGURE 18D 
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Poljmerase 
Domain 




Turnover Rate (l/min> (Rel ative Rate^ 



IL-6 RNA Synthetic r25mer 
Invader Assav Invader Assay 



TaqPoI 



0.86 
(1.00) 



0.29 
(0.32) 



0.29 
(1.00) 



0.03 
(0.10) 



SyntheticIrTl 
Invader Assay 

1.85 

(1.00) 



0.05 
(0.03) 



TaqTth^ 



TthTaqCN) 

r— 



TaqTth(B-S) 



TthTaqCB-S) 



Taqm(N.B) 



TthTaq(N-B) 

czz 



TaqTth(N-Bs) 



I 



TaqTth(Bs-B) 

I 

TaqTth(N-D) 

I 



I 



TaqTth(D-B) 



0.86 
(1.00) 

0.33 
(0.38) 

0.57 
(0.67) 

0.70 
(0.79) 

1.41 
(1.59) 

0.22 
(0.25) 

0.22 
(0.25) 

0.89 
(1.04) 

0.33 
(0.38) 

0.32 
(0.42) 



0.45 
(1.56) 

0.03 
(0.10) 

0.07 
(0.23) 

0.30 
(1.05) 

0.40 
(1.38) 

0.05 
(0.18) 

0.10 
(0.11) 

0.18 
(0.63) 

0.08 
(6.29) 

0.16 
(0.54) 



3.36 
(1.81) 

0.00 
(0.00) 

0.15 
(0.08) 

1.70 
(0.92) 

3.22 
(1.74) 

0.05 
(0.03) 

0.06 
(0.03) 

0.71 
(0.38) 

0.18 
(0.10) 

0.16 
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FIGURE 19 
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FIGURE 21 



s, 

-^CAACTGCCGTGA 
^^CGAGGCGCACG 
G GCTCCGCGTGGTTGACGGCACT 



"^CAACTGCCGTGA 
AcGAGGCGCACG 

GGCUCCGCGUGGUUGACGGCACU-BiotinSA-5 
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FIGURE 22 



(SEQIDNO:230) 

3 ■NH4-AATTGCTCCGCGTGGTTGACGAAGGAGGC-5 ' 

^<^CAACTGCTTCCTCCG-3 ' 

<fV (SBQIDNO:231) 
^^^^ 



B (SEQIDNO:232) ^^<^Cp 

,C1 ^ 



I -f^^ (SEQIDNO230) 

5 ' -AACGAGGCGCACC''^ 1 



3 'NH4-AATTGCTCCGCGTGGTTGACGAAGGAGGC-5 ' 

-<^CAACTGCTTCCTCCG-3 ' 



^V*^ (SEQIDNO:231) 
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FIGURE 23 



(SEQ1DN0:235) (SEQIDNO:234) 

♦ \ i 

5 ' -ACGAGCGTCTTT^ "^CGCTGTCTCGCT 

TTGCTCGCAGAAA GCGACAGAGCGA-Fl-5 ' 

4 

(SEQIDNO:233) 
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FIGURE 24 



(SEQIDNO:167) 

•^CAACTGCCGTGA 
^CGCACG 
^gGCGUGGUUGACGGCACU-5 ' 

(SEQIDNO:228) 



B (SEQIDNO:167) 



•^caactgccgtga 
acgcacg 
^r.gcgtggttgacggcact-5 ' 

(SEQroNO:229) 
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FIGURE 25 
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FIGURE 26 



1 89-15-1 3' 

^^GAAAGGAAGGGAAGAAAGCGAA 

CGAACTGCCCCmCGGCCGCTTGCACCGCTCmCCTTCCCTTCTTTCGCTT- 

M13mp18 

3* 5' 

"^^1 89-15-1 3' 

81-69-5 ^AGAAAGGAAGGGAAGAAAGCGAA 




^bcTTGACGGGGAAAGCCGGCGAACGTGGCG 



CGAACTGCCCCmCGGCCGCTTGCACCGCTCTTTCCTTCCCTTCTTTCGCTT- 

M13mp18 



3' 5' 



5' 



^;.T 89-15-1 3. 



S 



81-69-4 '^AGAAAGGAAGGGAAGAAAGCGAA" 
CTTGACGGGGAAAGCCGGCGAACGTGGCGA 
CGAACTGCCCCmCGGCCGCTTGCACXJGCTCmCCTTCCCTTCTTTCGCTT-- 

M13mp18 

3' 5* 

Y 89-15-1 3- 

81-69-3 '^6S^6GAAGGGAAGAAAGCGAA 

^ TGACGGGGAAAGCCGGCGAACGTGGCGAS& 

CGAACTGCCCCmCGGCCGCTTGCACCGCTCTTTCCTTCCCTTCTTrCGCTT- 

M13mp18 




5' 



^1 Y 89-15-1 3* 

8 1 -69-2 '^^^SAMGGAAGGGAAGAAAGCGAA 

^ ACGGGGAAAGCCGGCGAACGTGGCG AGAAA 

-CGAACTGCCCCmCGGCCGCTTGCACCGCTCmCCTTCCCTTCTTTCGCTT- 

M13mp18 

3' 5* 
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FIGURE 27 



Size (nt) 



6> 
4> 



Structure: a 



j( II II I 
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FIGURE 29 



1 2 3 4 5 6 7 8 

"~ - ^1: uncut 



^2: uncut 



^ ^2:cut 
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FIGURE 34 



1 23456789 10 



A 




11 12 13 14 15 16 17 18 19 20 



B 




21 22 23 24 25 26 27 28 29 30 



c 




46/145 



wo 01/90337 



PCT/USOl/17086 




47/145 




48/145 



wo 01/90337 



PCT/USOl/17086 




49/145 



wo 01/90337 



PCT/USOl/17086 



FIGURE 36A 



1 2 3 4 5 6 7 8 
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Figure 39 



Human Ubiquitin (4506712) mRNA Slngle^tranded 
Analysis 



120% 
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Nucleotide Position 
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SEQUENCE LISTING 
<110> Third Wave Technologies 
<120> Detection of RNA 
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<160> 708 

<170> Patentin version 3.0 
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<211> 834 

<212> PRT 

<213> Thermus aquaticus 
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Met «. XI. H.t «. «o Leu oxu «<. oly «9 «! 
vax «p .ly Hi, «e -eu Me xyr Thr Phe Phc «. '■V 

20 2^ 
.eu Thr Thr Ser Arg Gly Olu Pro Val Oln Ala Val Tyr Oly Phe Ala 
35 *° 



.v. ser Le. Lye «a Leu Ly. Glu «P =ly "'^ 

50 5^ 
val val Pl^e Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Glu 
70 

' 1 Tyr Lye «. «ly -> ^ "° ^ 

85 

Leu «. L.U n= Ly. =lu Leu v.1 «P Leu Leu .ly P.. Thj Leu 

100 

Olu v.1 Pxo Oly Tyr 31u .1. »P A.P v.1 L.. Thr Leu «a Ly. 
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Ly, ». «u Ly. al. Oly Tyr Glu v.1 «:g II. Leu Thr Al. ».P »9 

130 

Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His Pro Glu 
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lieu Lyg Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Val Lys Pro Glu Asn Val Arg Glu Lys lie Lys Ala His Leu Glu Asp 
225 230 235 240 

Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu Pro Leu 
245 250 255 

Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly Leu Arg 
260 265 270 

Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly 
275 280 285 

Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro Pro Pro 
290 295 300 

Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro Met Trp 
305 310 315 320 

Ala Glu Leu Lys Ala Leu Ala Ala. Cys Arg Asp Gly Arg Val' His Arg 
325 330 335 

Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly 
340 345 350 

Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp 

355 360 365 

Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro 
370 375 380 

Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp 
385 390 395 400 

Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg 
405 410 415 

Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr 
420 425 430 

His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala 
435 440 445 

Thr Gly Val Arg Leu Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu 
450 455 460 

Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala 
465 470 475 480 

Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu 
485 490 495 



Phe Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys Thr Gly 
500 505 510 
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530 
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545 
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580 
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610 
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625 
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660 
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' 680 



675 



685 



OXU OXU «. vax ^. Ph. n. OXU ^ Tyr Ph. OX. S.r Ph. Pro Ly. 

iron 695 



690 



val Ala Trp He Olu .ys Thr Leu olu Olu oly Arg .ys Arg Oly 
705 "^^O 

val Glu Thr .eu Phe Oly Arg Arg Tyr Val Pro Asp .eu Asn 
725 

«a vax .y. S.r V.X OXu «a «a OXU «.t «, Ph. «. 



740 



745 



750 



P.0 vax OX. oxy ^r «. Ma ^ M.t .y. ... "a M.t vaX 

755 



P.e pro Arg Leu Arg olu Met Oly Ala Arg Met Leu Oln 

775 

val His ASP Olu Leu Leu Leu Olu Ala Pro Oln Ala Arg Ala Olu Olu 



Lys Leu 
770 



785 



790 



vax M. Ala L.U .i. .y. OXU Ma ».t OXu .y. «a Tyr Pro ... M. 

805 "-^^ 



3 



wo 01/90337 



PCT/USOl/17086 



Val Pro Leu Glu Val Glu Val Qly Met Gly Glu Asp Trp Leu Ser Ala 
820 825 830 



Lys Gly 



<210> 2 

<211> 839 

<212> PRT 

<213> Artificial 



<220> 

<223> Synthetic 
<400> 2 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 

35 40 45 



Ala Lys Ser Leu Leu Lys Ala Leu 
50 55 

Val Val Phe Asp Ala Lys Ala Pro 
65 70 

Gly Tyr Lys Ala Gly Arg Ala Pro 
85 

Leu Ala Leu lie Lys Glu Leu Val 
100 



Lys Glu Asp Gly Asp Ala Val He 
60 

Ser Phe Arg His Glu Ala Tyr Gly 
75 ' 80 

Thr Pro Glu Asp Phe Pro Arg Gin 
90 95 

Asp Leu Leu Gly Leu Ala Arg Leu 
105 110 



Glu Val Pro Gly Tyr Glu Ala Asp 
115 120 

Lys Ala Glu Lys Glu Gly Tyr Glu 
130 135 

Asp Leu Tyr Gin Leu Leu Ser Asp 

145 * 150 

Gly Tyr Leu He Thr Pro Ala Trp 
165 

Pro Asp Gin Trp Ala Asp Tyr Arg 
180 

Asn Leu Pro Gly Val Lys Gly He 
195 200 

Leu Glu Glu Trp Gly Ser Leu Glu 



Asp Val Leu Ala Ser Leu Ala Lys 
125 

Val Arg He Leu Thr Ala Asp Lys 
140 

Arg He His Val Leu His Pro Glu 

155 160 

Leu Trp Glu Lys Tyr Gly Leu Arg 
170 175 

Ala Leu Thr Gly Asp Glu Ser Asp 
185 190 

Gly Glu Lys Thr Ala Arg Lys Leu 
205 

Ala Leu Leu Lys Asn Leu Asp Arg 



4 
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^„ Ly. .ro «a U. ^ olu Ly. U. I^u M. HI. H« «P «P 

225 

L.» ser -V' ^ ~ ^ 

245 

v,l »P PKe .y. «9 «9 pro «P Olu Le» «g «a 

260 265 



riv ser Leu Leu His Olu Phe Gly Leu 
Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu i.e 
275 280 

Olu s.r P~ Ly= «. "° ^ "° ^" "° 

290 295 
Oly »U P« V.1 Oly V.I L,« se. -9 ^V- - f,? 

305 

^ ^ L.U Ala ^^u Ma «. «a Oly Oly ^ V.I Hi. «| "a 

^ P„ »a Oly .y. -P - 51? 

340 ^'^^ 

7vT=. Q^^-r Ara Glu Gly Leu Asp Leu 
Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg 

355 

,al pro Oly MP ..P P~ Hat ^ 

370 -^^^ 

Th, Thr Pxo Olu Oly .al Ala «9 «9 ^ ^ 
385 "» 

Olu ASP Al. Ala Hi. A., Ala ..u Leu Saj Olu AT, Leu HU A.| Aa. 

405 

:.u Leu .y. AT, I<u Olu Oly Olu Olu .y. :<u :.u T.P Tyr Hla 

420 

tf^n T«,i ala His Met Glu Ala Thr 
Glu val Glu Lys Pro Leu Ser Arg Val Leu Ala Hxs Me^ 

435 

Tifir, val Ala Tyr Leu Gin Ala Leu ser Leu Glu Leu 
Gly Val Arg Arg Asp vai Axa xy^ 

450 

Ma Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg I^u Ala Oly 
465 

His pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 

485 .^^^ 
„p Olu cau A« Lau P„ Ala .au Oly Lya T.. Olu .ya thj Oly Lya 
500 
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515 



520 



525 



lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys Leu Lys Asn 
530 535 540 

Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg lie Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu Val 

595 600 605 

Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu Ser , 
610 615 620 

Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Lys Asp lie His 
625 630 635 640 

Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr Glu 
675 680 685 

Glu Ala Val Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin Val 

770 775 780 

His Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu Val 
785 790 795 800 

Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala Val 
805 810 815 



Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala Lys 
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825 

820 

Gly His His His His His His 
835 

<210> 3 
<211> 842 
<212> PRT 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 3 

«. S.r OXU «a H« ..u pro PJe «u P» 0» ^ V.l 
1 5 



^. v.. aly Hi. Hi, ..u 

20 

oi, «» - f/ "° fs- ^' 

P^ «. ly. s» «u .V. - fj> 

50 " 
v,i val va Phe «P -V^ 

0i« M. T,. ^V. ». OIV r "° 

' 85 

^ 0i„ «u «a ne .y. Olu ..u vai MP Le. «u OXy P.. 
^ .eu 3iu P~ «y Ty. 01. "a «P «P vai IJ. -to 

115 -^^^ 



... .y. Ly. «a Oiu .y. «u O.y Tyr OX. val U, ... «>r Ma 

130 

7v-r-« \7ai Ala Val Leu. His 
ASP Arg ASP Leu Tyr Gin Leu Val Ser Asp Arg Val Ala 
145 150 

P„ .xu OXy Hi= L.« XXa Tte Pro OX. T.p ... «P OX. .y= Tyr OXy 
165 -^'^ 
^ P,o ox. OX. TTP vaX «P P.. - vaX OXy ».P Pro 



180 

s.. «p «n L.. - oXy vaX Ly. OXy IX. OXy ox. .y. ^ Ma ... 

195 

. Glu Trp Gly ser Leu Glu Asn Leu Leu Lys Asn Leu 

Lys Leu Leu Lys Glu Trp 'aiy ocx 

210 
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Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys lie Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala 

340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly 
355 360 365 

Leu Gly Leu Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu 
405 410 415 

Phe Ala Asn Leu Trp Gly Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp 
420 425 430 

Leu Tyr Arg Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser 

450 455 460 

Leu Glu Val Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Gly Leu Pro Ala He Gly Lys Thr Glu Lys 
500 505 510 



Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 
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«. Hi. pro n. V.1 01. Ly. lie I.. OX" ^„ '''' 

530 535 

^ s.. Thr ^ U. X.P «o .eu Wo «P ^« XI. Hi. Pro «, 

545 550 

«v, 7io« rin Thr Ala Thr Ala Thr Gly 
Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala . 

565 

;^ s.r ser ser «P Pro "« ^ ™' 

580 

^ C.V 31n lU «S P.e n. Ola Olu oly Trp 

595 600 
.eu Leu Val Ala .eu ^.sp Tyr Ser Oln Xle Glu Leu Val .eu 
610 

Hia Ueu ser Gly Asp Glu Asn Leu Xle Arg Val Phe Gin Glu Gly Arg 
625 "0 

Xle His Thr Glu Thr Ala Ser Trp Met Phe Oly Val Pro Are Glu 
645 

Ma v.! MP pro LO. Hee ^ Ar. .1. Ly, Thr He P-- «V 
660 

val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala Xle 
675 

pro xyr 01. 01. 01. ^ ^ 01. ^ Tyr Pb. ol„ ser Phe 

690 

n cn Tio nn Lvs Thr lieu Glu Glu Gly Arg Arg 
Pro Lys val Arg Ala Trp He Glu Lys mr u^^ ^^o 

705 '^^^ 

Oly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 

725 ^-^^ 
01. Ma val -y. s.r val 01. «. "a 01. «| Het M. 

740 

Phe ». H.t pro val 01. oly Thr M. M. A.P L.. Nat Lya La. M. 

755 ■ 
„« val Ly. L», Pha pro La. 01. 01. -at oly «9 «.t ... 

770 

x,eu Gin val His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala 

790 

Til Al. val Ala AT, La. Al. hy. olu Val Met 01. Oly val Tyr Pro 



820 



805 

^. Ala val pro La. 01. val 01. val oly 11. Oly 01. «P Trp La. 
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Ser Ala Lys Glu His His His His His His 
835 840 

<210> 4 
<211> 839 
<212> PRT 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 4 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 

85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie j Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 
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Lys Leu ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 



245 



val ASP Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 



275 280 
Leu Glu ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 



290 



295 



Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
310 

Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 



305 



Asp Leu Leu 



325 330 



335 



Ala ASP pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 



340 



345 



Leu 



Ala Lys ASP Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 



355 



360 



val pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 

380 



370 



375 



Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 



395 



385 390 

Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 

4X5 



Glu Asp 



405 410 
Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 



420 425 
Glu val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 



Gly 



435 440 
val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu 



460 



450 455 
Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly 



465 



470 



475 



His pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 



485 



490 



Asp 



Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys Thr Gly Lys 



500 



505 



Arg ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 

^ 525 



515 



520 



lie val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys Leu Lys Asn 



530 



535 



540 
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Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg lie Arg Arg Ala Phe lie Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Arg Asp lie His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met. Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 • 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 

805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 



Glu His His His His His His 
835 
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<210> 5 

<211> 839 

<212> PRT 

<213> Artificial 



<220> 

<223> Synthetic 
<400> 5 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
1 5 10 1^ 



Leu 



val ASP Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 



Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

val val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 

85 3° 
Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 



100 



105 "0 



Glu val pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 



115 120 125 

Lys Ala . . ^^(j 



Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 

ASP Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
X45 150 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 

Pro ASP Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 

Asn Leu pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 



Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 

Lys Leu ser Trp Asp Leu Ala Lys Val Arg' Thr Asp Leu Pro Leu Glu 
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245 



250 



255 



Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp LeT4 Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Trp Gly Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala lie Gly Lys Thr Glu Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 



Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
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ccc 560 
545 550 SSS 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

ser ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 

Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu Val 



.595 



600 



Ala Leu ASP Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly ASP Glu Asn Leu lie Arg Val Phe Gin Glu Gly Lys Asp He His 
625 630 

Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp 
645 650 655 

pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly Tyr 
705 710 715 



val Glu 



Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn Ala 



725 



730 735 



Arg val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu Val 
785 790 795 

Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala Val 



805 



810 



Pro Leu 



Glu val Glu val Gly Met Gly Glu Asp Trp Leu Ser Ala Lys 



820 825 



830 



Gly His His His His His His 
835 

<210> 6 
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<211> 839 
<212> PRT 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 6 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 

180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 
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val ASP Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 

Phe Leu Glu Arg Leu Glu Phe Qly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 



290 



295 



300 



Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 

ASP Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 

Ala ASP pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 

val pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp ^ 
385 390 395 

Glu ASP Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
405 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 

420 

Glu val Glu Lys pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 4*5 

Gly val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 



Ala 
465 



450 455 

Glu Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 



470 



475 



His pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 

490 



485 



Asp 



Glu Leu Gly Leu Pro Ala He Gly Lys Thr Glu Lys Thr Gly Lys 
500 505 510 

Arg ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 



530 



535 



540 



Thr Tyr He Asp Pro Leu Pro Asp- Leu He His Pro Arg Thr Gly Arg 



545 
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Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg lie Arg Arg Ala Phe lie Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Arg Asp lie His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr lie Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly lie Gly Glu Asp Tip Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 

<210> 7 
<211> 839 
<212> PRT 
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<213> Artificial 
<220> 

<223> Synthetic 
<400> 7 

Met Asn Ser Qly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
1 5 10 15 

Leu val ASP Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 



20 25 30 



Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 

35 40 " 

Ala Lys ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 

50 55 60 

val val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Qly 
65 70 75 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Qln 



85 



Leu 



Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 



100 



105 



Glu val pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 

130 135 "° 

ASP Leu Tyr Gin Leu Leu Ser Asp Arg He His val Leu His Pro Glu 



145 



150 



Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 



165 



pro ASP Qln Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 

Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys Leu 



195 



200 



Leu Glu Glu Trp Qly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 . 220 



Leu Lys pro Ala lie Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 

Lys Leu ser Trp Asp Leu Ala Lys val Arg Thr Asp Leu Pro Leu Glu 

- . - 250 ^ 



245 



val ASP Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 

260 265 
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Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 

355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Trp Gly Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu 
450 455 460 

Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys Leu Lys Asn 

530 535 540 

Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arg 
545 550 555 560 



Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 
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Ser ser Asp Pro Asn Leu Qln Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 

Ala Leu ASP Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu Ser 



610 



615 



Gly ASP Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
630 635 



625 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 

Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 



Gly Met Ser 
675 

Glu Ala Gin 
690 



680 685 
Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 



695 



700 



Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 

val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 



740 



745 



Pro val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 



805 



810 



Pro Leu 



Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 

83 0 



820 



825 



Glu His His His His His His 
835 



<210> 8 

<211> 839 

<212> PRT 

<213> T^tificial 
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<220> 

<223> Synthetic 
<400> 8 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 

210 • 215 220 

Leu Lys Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
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oflo 285 
275 280 



I.e« Gl« Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Qlu 

290 295 
Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lye Glu Pro Met Trp Ala 
305 3" 

Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 ^-^^ 

Pro Glu pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 

355 

pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 



370 



375 



Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 



385 



390 



Glu ASP Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
405 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 
420 

Glu val Glu Lys pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 



435 



440 



Gly val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu ser Leu Glu Leu 



450 455 



Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly 
465 470 475 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 

ASP Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys Thr Gly Lys 

500 505 

Arg ser Thr Ser Aia Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 

515 520 
lie val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys Leu Lys Asn 

530 535 
Thr ryr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg «xr Gly Arg 
545 550 555 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 

565 570 

ser ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
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580 585 590 

Gin Arg lie Arg Arg Ala Phe lie Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 

660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Tirp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 

<210> 9 
<211> 839 
<212> PRT 
<213> Artificial 

<220> 

<223> Synthetic 
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<400> 9 

Met Asn Ser Oly Met .eu Pro Leu Phe Glu Pro .ys Gly Are Val Leu 

L val ASP Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 

20 25 
Oly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
3S *° 
Lys ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 5^ 

, r Tvi^ T^^r^ Qpr Phe Arq His Glu Ala Tyr Gly 
val val Phe Asp Ala Lys Ala Pro Ser Phe Arg n 

65 '^^ 

Oly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 

Ala Leu Xle Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 

100 

alu val pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 



x.ys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 

Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 

145 

Oly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 

pro ASP Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 



180 "5 



.sn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 

.eu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 

210 215 
.eu Lys pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 

.ys Leu ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 

245 250 



val ASP Phe Ala Lys Arg Arg Glu Pro Asp Arg Olu Arg Leu Arg Ala 

260 265 

P.e Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 



275 280 
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Leu Glu Ser Pro Lys Ala Leu Glu Qlu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 365 

Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Trp Gly Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 

450 455 460 

Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala He Gly Lys Thr Glu Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 

515 520 525 

He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 



565 



570 



575 



Ser 



Ser 



Asp Pro Asn 
580 



Leu 



Gin Asn He Pro 
585 



Val 



Arg 



Thr Pro Leu 
590 



Gly 
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. ni» Phe lie Ala Glu Glu Gly Trp I>eu Leu Val 
Gin Arg He Arg Arg Ala Phe lie Aia 
595 

^ ^ «P ^ - 01„ n. 01» .eu ^ vax «. Hi= - - 
610 

«P ». «u no ^ V.1 Phe oiu 01, "P »i= 

625 ^"^^ 

Olu ™. S« »" ^ "° 

645 

Ala LVB Thr lie ASH Phe Gly Val Leu Tyr 
Pro Leu Met Arg Arg Ala Ala Lys inr ^j. 
660 

M« se. «. Hi. - OX. 01» .e. - ^ 

675 

Ti« ftiu &ra Tvr Phe Gin Ser Phe Pro Lys Val 
Glu Ala Gin Ala Phe He Glu Arg Tyr 

690 

^ «. ne oi» .yj T>,r oiu oi, «a «9 oiy ^ 

Tl 01« Leu .he Oly ^ val pro «P Le« olu 

725 

^ v.1 .ys ser val 01« ^. ^l- olu ^ "-^ ^ 

740 

p„ vel 01„ Oly Thr «. «• - «et .ye .e« «a He. val .ye 
755 

.e» Phe pro 01» olu H.t oly «e «s Met l.u 01. val 

HI. Zl Olu X.U val ..u 01. «. pro Lye 01. »9 -J 
785 

;aa Ku Ma Lye olu Val Met olu oly val Tyr Pro «» Ma val 

805 

^ «u Olu v.1 Olu val Oly Xle oly olu «P «P - ser M. Lye 

Glu His His His His His His 





835 


<210> 


10 


<211> 


842 


<212> 


PRT 


<213> 


Artificial 


<220> 


Synthetic 


<223> 


<400> 


10 
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Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala. Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 



Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 
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^ ^ a., o., ... v.. 0» VU .eu s„ - oxu pro 

305 

o.„ ^ - - - s; S 

325 

^ „a «P - - - -V - ^V. »P - ^V. - 

340 

. " T-n Ma val Leu Ala Ser Arg Glu Gly 
arg Gly Leu Leu Ala Lys Asp Leu Ala L 
355 



,al «o Oly «P «P "o ^ 

370 

P„ s« «n Thr pro .ly «S Ty- f.? 

390 

r» ^ «.r axu ..P «. HX. - - - - S 

405 

. T alu Gly Glu Glu Lys Leu Leu Trp 
His Arg Asn Leu Leu Lys Arg Leu Glu Gly o 
420 

T, T»« ser Arg Val Leu Ala His Met 
^eu xyr His Glu Val Glu Lys Pro Leu Ser Arg 

435 

, » tr=>i Ala Tvr Leu Gin Ala Leu Ser 
Glu Ala Thr Gly val Arg Arg Asp val Ala Tyr 

450 

^ oxu LOU oxu .lu XX. «e - - - Tsl 
465 

»»r, «!*.r Ara ASP Gin Leu Glu Arg 
Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg A p ^ 
485 

va ..u ... -p =1. - - - - r. 

500 ^""^ 

« AT;, Ma val Leu Glu Ala Leu Arg Glu 
Thr Gly Lys Arg Ser Thr Ser Ala Ala Val 

515 

n^r^ wia Ara Glu Leu Thr Lys 
;aa His pro lie Val Glu Lys He Leu Gin Hxs Arg 
530 "5 

.u .y. «» Ty. V.X «p P~ - - - - %l 

545 

™. .xy .3 ..u HXS ... ^ - - " - 

565 

^ „u s« - »»P P.O ». - S '"^ 

580 

Ala Phe He Ala Glu Glu Gly Trp 

600 



pro Leu Gly Gin Arg He Arg Arg Ala .ne ... 



595 
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Leu Leu Val Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Arg 
625 630 635 640 

Asp He His Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Glu Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 

755 760 765 

Met Val Lya Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala 
785 790 795 800 

Glu Ala Val Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Glu His His His His His His 
835 840 



<210> 


11 


<211> 


842 


<212> 


PRT 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


11 



Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
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10 15 



.eu I^u Val ASP Qly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 

20 25 
.ys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Oln Ala val Tyr Gly 

35 ^° 
Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 

50 55 «0 

val Phe val Val Phe Asp Ala Lys Ala Pro ser Phe Arg His Glu Ala 
65 70 ''S 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 

85 5" 
;^g Gin Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 

I.eu Glu val pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 ^20 
Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala 



130 



135 



ASP Arg ASP Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 



145 150 



pro Glu Gly His Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 



ser ASP Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu 

195 200 
Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 

210 215 
ASP Arg val Lys Pro Glu Asn val Arg Glu Lys He Lys Ala His Leu 
225 230 

Glu ASP Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 

pro Leu Glu val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 

260 265 
Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 



Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 

290 295 
pro pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
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'305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly 
355 360 365 

Leu Gly Leu Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 



Glu Trp Thr Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu 
405 410 415 

Phe Ala Asn Leu Trp Gly Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp 
420 425 430 

Leu Tyr Arg Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser 

450 455 460 

Leu Glu Val Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Gly Leu Pro Ala lie Gly Lys Thr Glu Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 



Ala His Pro He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys 
530 535 540 



Leu Lys Ser Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr TQa Thr Ala Thr Gly 

565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
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610 



615 



620 



His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Oln Glu Gly Lys 
625 • 

ASP lie His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Olu 

645 

Ala val ASP pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 

val Leu ryr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 

pro ^r Glu Glu Ala Val Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe 



690 

pro Lys val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 



755 



Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 

X.U Asn Ala Arg Val Lys ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 

760 

Met val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 

770 '^'^^ 
I^u Gin val His Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 ''SO 

Glu Glu val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 

805 S-*'^ 
.eu Ala val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 



820 ®25 



Ser Ala Lys Gly His His His His His His 

840 





835 


<210> 


12 


<211> 


833 


<212> 


PRT 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


12 



Met Asn ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 

- c 10 
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Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala* Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala lie Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 



Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 
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ASP Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 

325 

Ala ASP pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 

.eu Ala Lys Asp Leu Ala val Leu Ala Ser Arg Glu Gly Leu Asp Leu 



355 



360 



val pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 



370 



375 



Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 



Glu ASP Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 

405 ^■^^ 

.eu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 

420 ^^^^ 
Glu val Glu Lys pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 

435 4*° 
Gly val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu 
450 

Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly 
465 470 475 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 ' 

ASP Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys Thr Gly Lys 

500 505 
Arg ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 



515 520 



He 



val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys Leu Lys Asn 

540 



535 



545 



530 

Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arg 



550 



Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 



565 



570 



ser ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly 



580 



585 



Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu Val 
595 600 

Ala Leu ASP Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu Ser 



610 



615 
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Gly Asp Glu Asn Leu lie Arg Val phe Gin Glu Gly Lys Asp lie His 
625 630 635 640 

Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr Glu 
675 680 685 

Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn Ala 

725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asp Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu Val 
785 790 795 800 

Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 



Gly 



<210> 


13 


<211> 


833 


<212> 


PRT 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


13 



Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 

20 25 30 
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Oly Leu Thr Thr Ser Arg 3ly Olu Pro Val Gin Ala Val Tyr Gly Phe 

Ala Lys ser .Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 

50 55 
val val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 " 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 

Glu val pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 

115 120 
Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 

130 

ASP Leu Tyr Gin Leu Leu ser Asp Arg He His Val Leu His Pro Glu 



145 



150 



Gly Tyr Leu lie 



Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 



165 



170 



pro ASP Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 

185 



180 



Asn Leu Pro 
195 



Gly val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys Leu 



200 



Glu Trp Gly ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 



215 



Leu Glu 
210 

Leu Lys pro Ala lie Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 

Lys Leu ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 

val ASP Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 

260 265 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 

275 280 
Leu Glu ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 

290 295 
Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 

ASP Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 



325 
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Ala Asp Pro Leu Ala Gly hen Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 365 

Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 
420 425 430 

Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Net Glu Ala Thr 
435 440 445 

Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu 
450 455 460 

Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 

485 490 495 

Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys Leu Lys Asn 
530 535 540 

Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 



Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 
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^ «„ «. S« TTP M« .he =1, VU P» «S 01» 

645 



H« "a «. n. oiy V.1 ^ 

660 

01, „.t s« Hi. s.r ox. ax» n- - ^ 

675 

, «i. Ti« riu Ara Tyr Phe Gin Ser Phe Pro hys Val 
Qlu Ala Gin Ala Phe He Glu Arg Tyr h-n 

690 



;aa Tn. xie oi» .y. «» f,^ «9 «v ^ 

705 ''^^ 

val Glu Thr Leu Phe Gly Arg Arg ^g Tyr Val Pro .eu Glu Ala 
725 

^ v.. ^ 0.U ^. oau «9 PH. «a net 

740 

P„ v,l 31. «y "a "5 - 

755 

... ... pro L.« 01- Olu net oly «a ».t l^u 31» val 

770 

..p «» val Olu »a - .va =lu ^ 

785 ''^^ 

Ma «s "a Lys olu vaX Mat 01. Oly Val Tyr Pr, Val 

805 

Olu val 01« val Oly XI. «y Olu MP Trp «« «. .ya 



820 »25 



Glu 



<210> 14 

<211> 833 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 



<400> 14 

«n ..r oly Mat Pro x.« P.. Olu Pro .y. Oly «3 val L.u 



Met o**-^ — -I 

1 ^ 



„1 «P oly Hi. Hta X.U .1. Tyr Thr Ph. HXa Ua L.« Ly. 

20 

Oly ... Thr Thr s.r Ar, Oly 01. Pro val 01. «a Val Tyr Oly Ph. 
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35 



40 



45 



Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 

165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 

275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 



Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
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340 



345 



.la Lys ASP Ser Val .eu Ala .eu Ar. Glu Gly Leu Oly Leu 
355 360 
pro pro Gly Asp Asp Pro Met Keu Leu Ala Tyr Leu Leu Asp Pro Ser 

370 ^'^^ 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 

390 j-'a 

111 Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
40S "^^^ 
Trp Gly Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 

420 

Olu val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 

435 "° 
Oly val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 

Ma Glu Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 

Ann *k lo 

465 

His pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 

..p Glu Leu Gly Leu Pro Ala lie Gly Lys Thr Glu Lys Thr Gly Lys 
500 

Arg ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 

515 

lie val Glu Lys He Leu Gin- Tyr Arg Glu Leu Thr Lys Leu Lys Ser 

530 535 
Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 

.eu His Thr Arg Phe Asn Glu Thr Ala Thr Ala Thr Gly Arg Leu Ser 

565 570 

ser ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly 

580 585 
ox. II. «9 M. val 01. Al. Gly «a Val 
595 

^. «P T,, s« 01. n. =iu V.1 Ai. «i. 

610 

Gly ASP Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys Asp He His 
625 

. Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp 
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645 



650 



655 



Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr Glu 
675 680 685 

Glu Ala Val Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin Val 

770 775 780 

His Asp Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu Val 
785 790 795 800 

Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 



Gly 



<210> 15 

<211> 833 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 15 



Met Asn Ser Gly Met Leu Pro Leu 
1 5 

Leu Val Asp Gly His His Leu Ala 

20 

Gly Leu Thr Thr Ser Arg Gly Glu 
35 40 



Phe Glu Pro Lys Gly Arg Val Leu 
10 15 

Tyr Arg Thr Phe His Ala Leu Lys 

25 30 

Pro Val Gin Ala Val Tyr Gly Phe 
45 
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. Tv« Ala Leu Lys Qlu Asp Gly AsP A^-* Val He 
Ala Lys Ser Leu Leu Lys Ala Leu uys 

v,i p.. «P - "» ?f ^' 

II «a 0., 1 - - - - ^ "° 

^. I.. ox. V.1 ..P - «. - 

100 

..u VU «o o., Tvr 0.U Ma «P ..P «u Ma s« ..a «a ..a 

115 

oxu .,a =X» OXV 2^ 0- - xxe «a - - »P 

130 

^ ^„ ^ axa ^ S« «P ne HX. V.X Hi. Ola 

145 ^ 

ox, .yx na ™. «o M. T.P ... Trp OX. ^ oxv 5.. 

^ ^ 3Xa «P Ta «P ^ - - - - S: 

180 



. T o mv He Gly Glu Lys Thr Ala Arg Lys Leu 
Leu Pro Gly Val Lys Gly lie Giy y 

195 

, Glv ser Leu Glu Ala Leu Leu Lys Asn Leu ASP Arg 

Glu Glu Trp Giy bet 



I^U - 215 

210 



An= Tif. Ara Glu Lys He i^eu 
Leu Lys Pro Ala lie Arg uj.u j 



lie Leu Ala His Met Asp Asp Leu 



225 230 



.eu ser Trp Asp Leu Ala Lys Val Arg ^ Asp l.u Pro Leu Glu 
245 



vax «P P.a M. .ya ^ ^ OX. Pro «P OXu 

260 

..e «. ox. OXU P.a oxy .a. ... - - ^ ^ 

275 

OX. P.O .v. "a - - 1?. "° 

290 295 

oxy «a P>» V.X oxy P>» vaX ^ - "« ^ S 

310 

II ... Ma Ma Ma OXy OXy ^ V.X ,Xa «| Ma 

325 

ox. P.O ^ .y. "a - "5 °" "° 



Pro Glu pro xyj. ^:t^ ^ar 

• 340 
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Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala lie Gly Lys Thr Gin Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 

580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 
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pro Leu Met Are Ala Ala Kys Xhr He Asn Phe aly Val Leu Tyr 

660 

«y -et S.r Hi. «9 Leu s« =1. 01" Leu »Ia U. Pro Tyr Olu 

675 

Olu Ala Oln Ala Phe He Olu Arg Tyr Phe Oln Ser Phe Pro Lys Val 
690 

Ala Trp He Qlu Lys Tbr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 

705 

val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 

v.a Ly. ser v.1 olu Al. ^ =1" ^ ^ "° 

740 

pro val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 

.eu Phe pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 

770 775 
His ABp Glu Leu val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 •''S 

Ma Arg Leu Ala Lys Olu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 

P.O Leu Glu val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 



820 ®25 



Glu 



<210> 16 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 



<400> 16 

Met Asn ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 



^0 
1 



.eu val ASP Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 

20 2^ 
Oly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 

35 40 
;aa Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 



50 
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Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 



Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 
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n D^o Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
pro pro Qly Asp Asp Pro Met Leu u / 

370 

™. «o 01. «y «i «y is; 

Z 01« «a «, HU Z - - fx? 

405 

Trp Oly «u Olu 01, olu OXu .eu «u Tyr 

420 

n T.n ser Ala val Leu Ala His Met Glu Ala Thr 
Glu val Glu Arg Pro Leu Ser Aia vax 

435 "° 



. ,.«n val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu 
Qly val Arg Arg Asp Val Aia ryr 

450 

01» Olu n. «3 L.U 01» Olu OXU val "9 .1. OIV 



465 "» 

,1. P., P.. «» - S >Ts 

«p Olu «u Leu pro oly .y. T>» 01. .y. «V 

500 

ser ..r Ser Ala Ala Val Leu Glu Ala Leu Ars Glu Ala His Pro 

ne val Z Lys Xle Leu Gin His Arg Glu Leu Thr Lys Leu Lys Asn 
530 

T wai Ilia Pro Arg Thr Gly Arg 
Thr Tyr Val Asp Pro Leu Pro Ser Leu Val Hxs Arg 

545 

Hi. «r Ph. 01. T.r «a Thr «a T., oly .a. Sar 

565 

■n- Val Ara Thr Pro Leu Qly 
ser ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg 

580 



Cl. Xla "a P.e na «a 01. 01. Oly - v.l 

595 

„a ... «P Ty. s.r oln Xle 01. .e. ^ val ^. "ia - - 

610 . 
Gxy .sp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He Hxs 
625 

^ .1. ^ «a s» Met P.e Oly v.l Pro ^ 01. .la val «P 

«0 Ar, 1. «a Ly. P"' -^^V -J ^ 

660 
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Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr Qlu 
675 680 685 

Glu Ala Gin Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 



<210> 


17 


<211> 


839 


<212> 


PRT 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


17 


Met Asn Ser Gly Met 


1 


5 


Leu Val Asp Gly His 



10 15 



20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
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75 ^° 

65 

Tyr Lys .la Oly .r. Pro Pro olu .sp Phe Pro Oln 



.eu X.eu Xle .ys Olu X.u Val ^^P .eu .eu Oly .eu .la Arg .eu 

Glu Ala ASP ASP Val Leu Ala Ser Leu Ala Lys 
120 

rf»T avrr Tie LcU TlUT Ala Asp Lys 
Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu 

130 

His Val Leu His Pro Glu 
ASP Leu Tyr Gin Leu Leu Ser Asp Arg He His Vai 

145 

Oly xyr Leu Xle Thr Pro Ala Trp Leu Trp Olu Lys ^r Oly Leu Arg 



100 " = 

Glu val Pro Oly Tyr Glu Aia »»h ^^5 

115 "° 



165 "° 
ASP om Trp Ala Asp Tyr Arg Ala Leu T.r Oly Asp Olu Ser Asp 

180 

^ «y val Lys Oly n. Oly olu .y, 

195 

01„ .lu Oly S.r Le. Olu Ala Leu Leu Lys «n Leu -P 

210 215 
.eu Lys pro Ala Xle Arg Olu Lys Xle Leu Ala His Met Asp Asp Leu 

225 

Lys Leu ... Trp «P Leu «. Lys v.l «g Thr Asp Leu P~ Leu Olu 

' 245 
V.1 ASP Phe Al. Lys AT, «g Olu «o Asp Axg olu Ar9L.uA.5Al. 

260 

p.. Leu Olu ^ L.U Olu P». Oly S.r L,u Leu Hi. Olu Ph. Oly Leu 

275 

«u .lu ser pro Lys Al. L.u Olu olu Al. Pro Trp Pro Pro Pro Olu 

290 



Oly Ala Phe Val Oly Phe Val Leu Ser Arg Lys Olu Pro Met Trp Ala 

305 

.3P Leu Leu Ala Leu Ala Ala Ala Arg Oly Gly Arg Val His Arg Ala 

Ala Leu Arg Asp Leu Lys Glu Ala Arg Oly Leu 
345 

Leu Al, Lys ASP Leu ser v.l Leu Al. Leu AT, olu aly Leu Oly Leu 

355 

P.O pro Oly ASP ASP Pro Met Leu Leu Ala xyr Leu Leu Asp Pro Ser 



325 "° 

Pro Qlu Pro Tyr Lys Ala i.eu «rg — 

340 
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370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu His Arg Asn 
405 410 415 

Leu Trp Gly Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu 
450 455 460 

Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys Leu Lys Asn 
530 535 540 

Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg lie Arg Arg Ala Phe lie Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu Ser 

610 615 620 

Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Arg Asp lie His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr lie Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr Glu 
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675 



Olu Ala Gin Ala Phe He Glu Arg Tyr Plxe Qln Ser Phe Pro .ys Val 
690 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 "^^^ 

val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 

Arg val Lys ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 

pro val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 

755 "^^^ 
Leu Phe pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 

770 '^^^ 
His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 -^^^ 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 



805 810 



pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 



820 



825 



Glu His His His His His His 





835 


<210> 


18 


<211> 


839 


<212> 


PRT 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


18 



Met Asn ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
1 5 10 

Leu val ASP Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 

20 25 
Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 

35 *0 
Ala Lys ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 

50 5^ 
val val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 



65 



70 
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Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys Leu 

195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Alk 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 , 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 



Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 
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Thr Thr Pro Glu Oly Val .la ^ Oly Gly Glu Trp TJr 

385 

Olu Glu Ala Gly Glu Arg Ala Ala .eu Ser Glu Arg Leu Phe Ala Asn 

405 

L,. «9 o.y L,u L,. Tr. ^ T,^ 

420 

« T«,, qer Ala val Leu Ala His Met Glu Ala Thr 
Glu val Glu Arg Pro Leu Ser Aia vai li 



435 

. ^ 2van val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu 
Gly val Arg Arg Asp Val Aia xyr 

450 

al» Gl« lie «9 «9 ™^ fso 

r. pro P« - "» ^ f,S 

«p :^ A« «u pro ojy =1. Ly. »r oly .y. 

500 

»^ S.r S.r «. >a, V.1 ^» Olu «. Hi. P«. 

515 

Ti T»„ «iT, His Arg Glu Leu Thr Lys Leu Lys Asn 
He val Glu Lys lie Leu Gin His Arg v,x 

530 

Xyr val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arg 
545 550 

.eu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 

S.. ser ^ pro - =^ "» f « Val «3 Thr Pro Gly 

580 

01» lU »r, Ala Phe lie Al. alu 01. Oly Trp ..u Val 

595 

A.P Tyr ser 01. lie 01. »r, v.l ^. Hi- 



615 

610 



Oly ASP Glu Asu Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 



^ Thr Al. =er Trp Met Ph. oly v.l Pro Ar= 01. Ala v.l Aap 

645 

n ^1. jvia LV8 Thr He Asn Phe Gly Val Leu Tyr 
pro Leu Met Arg Arg Ala Ala Lys Tnr ix 

660 

oxy „et ser Ala Bi. Ars ... «r 01. 01. .e. Al, XI. Pro Tyr 01. 

675 
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Glu Ala Gin Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Net Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 

<210> 19 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 19 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 
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«. -eu ne 01. V.1 Le» «« «, <-» 

Olu val pro Oly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 

115 

.ys Ala Olu Lys Qlu Oly Tyr Olu Val Arg He Leu Thr Ala Asp Lys 
130 

ASP Leu Tyr Oln Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 

Oly Tyr Leu He Thr Pro Ala Trp Leu Trp Olu Lys -Tyr Oly Leu Arg 

165 '^^^ 
pro ASP Oln Trp Ala Asp Tyr Arg Ala Leu Thr Oly Asp Olu Ser Asp 
180 

Asn Leu pro Oly Val Lys Oly He Oly olu Lys Thr Ala Arg Lys Leu 

195 200 
.eu Olu Olu Trp Oly Ser Leu Olu Ala Leu Leu Lys Asn Leu Asp Arg 

210 215 
.eu Lys pro Ala He Arg Olu Lys Xle Leu Ala His Met Asp Asp Leu 
225 

Lys Leu ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Olu 
245 

val ASP Phe Ala Lys Arg Arg Olu Pro Asp Arg Olu Arg Leu Arg Ala 

260 265 
Phe Leu Olu Arg Leu Olu Phe Oly Ser Leu Leu His Olu Phe Oly Leu 
275 

.eu Olu ser Pro Lys Ala Leu olu Olu Ala Pro Trp Pro Pro Pro Olu 

290 295 

, ^T^^ Tiaii q©r Ara IjVS Glu Pro Met 'Trp Ala 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg ^ys 

305 

Leu Leu Ala Leu Ala Ala Ala Arg Oly Oly Arg Val His Arg Ala 
325 



pro Olu pro Tyr Lys Ala Leu Arg Asp Leu Lys Olu Ala Arg Oly Leu 

340 -^^^ 
Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Olu Oly Leu Oly Leu 
355 

pro Pro Oly Asp Asp Pro Met Leu Leu Ala ^r Leu Leu Asp Pro Ser 

370 ^'^^ 
Asa Thr Thr Pro Olu Oly Val Ala Arg Arg Tyr Oly Oly Olu Trp Thr 



385 



390 
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Olu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Trp Gly Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu 
450 455 460 

Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys Leu Lys Asn 
530 535 540 

Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg lie Arg Arg Ala Phe lie Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Arg Asp lie His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr lie Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr Glu 
675 680 685 



Glu Ala Gin Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 
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Arg Ala Trp He Olu Lys Thr Leu Glu Olu Gly Arg Arg Arg Gly Tyr 
705 710 715 

Pbe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 "0 



val Glu Thr leu Pbe Giy wrg iurg i^y -i- — — - 
725 "30 

Arg val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 

740 '^^^ 
pro val Glu Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 

755 "76° 
Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 

pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 

Glu His His His His His His 





835 


<210> 


20 


<211> 


839 


<212> 


PRT 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


20 



Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
1 5 10 

Leu val ASP Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 



20 

Gly Leu Thr Thr ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 

35 4° 
Ala Lys ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 



50 



55 



val val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
70 



65 



Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 



85 



90 



Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
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xoo 



105 



110 



Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 

115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 365 

Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 



Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
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405 



410 



415 



Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 

425 430 



420 



Glu 



Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 



435 



440 



445 



Gly Val Arg Arg Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 



485 



490 



495 



ASP Glu Leu Gly Leu Pro Ala He Gly Lys Thr Glu Lys Thr Gly Lys 
500 505 510 

Arg ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 * 520 525 

He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 



Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 



580 



585 



590 



Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
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705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 • 780 

His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 ' 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 

835 



<210> 


21 


<211> 


839 


<212> 


PRT 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


21 


Met Asn Ser Gly Met 


1 


5 



10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu- Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 
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Olu Val pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

I.ys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 

ASP Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 

Pro ASP Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 
Glu Glu Trp Gly ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 



Leu 



210 215 

Ala lie Arg Glu Lys He Leu Ala His Met Asp Asp Leu 

225 



Leu Lys Pro Ala lie Arg uxu j^y^ xj.^ — ^ 

230 235 



Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
' 245 250 

val ASP Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 

265 



Phe Leu 

275 



260 

Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 



280 285 



Leu Glu ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 

290 295 
Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 



305 



310 



ASP Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 ^"^^ 

Ala ASP Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 

360 365 



355 



val pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 

370 3" 
Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 



385 



390 395 



Glu ASP Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
405 410 
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Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 
420 425 430 

Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala lie Gly Lys Thr Glu Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
•515 520 525 

He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 

645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 



Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 
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val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 

Arg val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 



740 



Pro 



val Gin Oly Thr Ala Ala Asp I^u Met Lys Leu Ala Met Val Lys 

765 



755 



760 



Leu Phe pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 

780 



770 



775 



His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 

<210> 22 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 22 

Met Asn ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
1 5 10 15 

Leu val ASP Oly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 



20 



25 



Oly Leu Thr Thr Ser Arg Oly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 

50 55 60 

val val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 "^^ 

Oly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 



85 



Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 



100 105 
val pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 



Glu - 125 
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Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 

325 330 335 

Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 365 

Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
405 410 415 



Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 
420 425 430 
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alu Val Glu Lys pro Leu Ser Arg Val Leu Ala His Met Glu Ala 1*r 



435 



440 



Oly val Axg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Leu 
450 

Ala Glu Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 475 

His pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 

ASP Glu Leu Gly Leu Pro Ala He Gly Lys Thr Glu Lys Thr Gly Lys 
500 

ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 

He val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 

530 

Xhr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 

Hi. T>„ «9 Ph. 01. Thr Ala Thr oly s.r 

565 570 

ser ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 

580 585 
Oln Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 

Gly ASP Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 

pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 

ox, S.r Ma Hi. L.„ ser 01. Glu L.U «. n. Pre Tyr oiu 

675 

Olu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 



705 



710 



val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 



725 "^30 
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Arg Val hys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn^Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp heu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly lie Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 



<210> 


23 


<211> 


839 


<212> 


PRT 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


23 


Met Asn Ser Gly Met 


1 


5 



10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
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140 

135 

130 

T.^« Ti*» His Val I»eu His Pro Glu 
ASP Leu Tyr Gin Leu Leu Ser Asp Arg He His 

145 " 

^ ne P~ M. - ^ 

^ „p oin TTP 11 T,' fn 



on niv Glu LYS Thr Ala Arg Lys Leu 

Asn Leu pro Gly Val Lys Gly lie Gly Glu Ly 

195 

^ Txp oiy s» ..u ox« «. ^ - ^v; «. - "P ^ 

210 

^„ Xle =1« .y. - 

225 

.,3 ^ s» «P val »P - S 

^ 245 

„P P>» M. .ys «9 - 'Si 



... Olu Oly - .,u «u Hi, Olu .IV - 

3x„ r. «o .ys - 0- - "° IS? "° "° 

290 295 
Oly Ala Phe Val Oly Phe Val Leu Ser Arg Lys Olu Pro Met Trp Ala 
305 

^ ^ «. «. ™' - 

„p pro Z =ly ..u .y. .=P 



340 

... .ys «P ... V.1 - - fl 

VaX pro Ty «P MP P" «.« «» ^' '~ 

370 

T>u pro Oly val «a Ar, Tyr «y Oly =1. "P «r 

Til ^ «. «. - !ts "I "° 

405 



^. 1.U Ly. 01. Oly Olu OIU .y, 1*. Trp J- T^ 

420 

« T.,i ser Arg Val Leu Ala His Met Glu Ala Thr 
Glu Val Glu Lys Pro Leu Ser Arg vax 
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435 



440 



445 



Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu lie Arg Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala He Gly Lys Thr Glu Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 

515 520 525 

He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr lie Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 



Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
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740 ^" 



..o va. 01. -a Met .ys .eu ^^a Met VaX .ys 

755 

PH. ^ - al„ »e. 0.y «. «S - - - 

770 

,U 01» Le. va. Ma »' S 

I »" S "° "° - 

805 

alu val 31. val Oly na .Iv 31u ».P ..P - |« 



820 



Glu His His His His His His 
835 

<210> 24 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 



<4UU> 

Met sar 3Xy Mat La» Pro «. »e OXu .va Oly VU .au 



<400> 24 

Met Leu Pro hen Phe 

L ,.1 «p L "i= - 

20 

,1, s« 31y 3JU pro val 31. «a val 

35 

T Ten Lvs Ala Leu Lys Glu Asp Gly Asp Ala Val He 
Ala I»ys Ser Leu Leu Lys Axa ucu / 

50 

val val P.a «P ^ - 

1 Tyr .y. Ala 31y Z - ^ «P - 31. 

^ «a .a. XI, 1^ 31. ... val «P «. ^ Oly - 

100 

val pro 31y Tyr 31. ».P .-P v.l ... »!> - '''^ 

115 

. T riu Glv Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
Lys Ala Glu Lys Glu Gly Tyr v:.xu 

130 
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Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 



Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260| 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 



Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 



Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 365 

Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
365 390 395 400 

Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 
420 425 430 

Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 
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r ;^x, val Ala Tyr Leu Arg Ala Leu Ser I^u Glu Val 450 
Gly Val Arg Leu Asp vai Aia iyi- 
455 "° 

„. oiu oiu n. - in ^"^ 

465 

His pro Phe Asn Leu Asn Ser Arg Asp .In Leu Olu Arg Val Leu 

.3P Olu Leu Oly Z pro Ala Xle Oly Lys ..r Olu Lys X.r Oly Lys 

V. o Ala val Leu Olu Ala Leu Arg Olu Ala His Pro 

Arg Ser Thr Ser Ala Ala vai ^eu 

, T rin Tvr Arq Glu Leu Thr Lys Leu Lys Ser 
lie val Olu Lys He Leu Gin Tyr Arg giu 

530 

Tip. His Pro Arg Tbr Gly Arg 
Thr Tyr He Asp Pro Leu Pro Asp Leu He His 
545 550 



.eu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Oly Arg Leu Ser 
565 

-n« 0^1^ Val Ara T!hr Pro Leu Gly 
ser ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg 

580 



,vi trt,^ Tie Ala Glu Glu Oly Trp Leu Leu Val 
Gin Arg He Arg Arg Ala Phe He Ala Qiu 

595 

Ti» riu Leu Arg Val Leu Ala His Leu Ser 
Ala Leu Asp Tyr Ser Gin He Glu Leu Arg v 

610 

«P Olu L=» n. V.1 P.. ol„ Olu aly «P n- «!. 

625 

^ Olu m ;a. s« «P "et V.J - IS 

645 

..u M« «9 «a «a T.r XI. P^^ oly v.l .eu Tyr 

660 

oly net Ma Hi. - s.. 01» Olu .eu «a 11. P~ Ty. olu 

675 

Olu «a 01. .la P^ lla olu ^ foo 
690 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Oly Tyr 



val Olu Thr Leu Phe Oly Arg Arg Arg Tyr Val Pro Asp Leu Olu Ala 
725 '-^^ 
val Lys ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 



740 
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Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lye Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 V 830 

Glu His His His His His His 





835 


<210> 
<211> 
<212> 
<213> 


25 

839 

PRT 

Artificial 


<220> 
<223> 


Synthetic 


<400> 


25 


Met Asn Ser Gly Met 
1 5 


Leu Val Asp Gly His 
20 



10 15 



25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val- Leu His Pro Glu 
145 150 155 160 
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o., TV. n. «r «. «P - - — - 

^ ^ 01. T« 2 «P TV. f4 - - S 
180 



oi Ti« Qlv Qlu Lvs Thr Ala Arg Lys Leu 
ASH Leu pro Gly Val Lys Gly lie Gly y 
195 

^1 Ma T.*»u Leu Lys Asn Leu Asp Arg 
I,eu Glu Glu xrp Gly Ser Leu Glu Ala Leu Ly^ 

210 

- U. »5 .V. «. »i. - - - 

225 

s„ «P - val «s - - - 

245 

«P ..e «3 0>u «o «P - - 

ox. X.. ox. ^ OXJ se. .e. - - »^ 

275 

ox. ^ - "° 

290 ^- 
OX, ^ PKe VaX OXv ..e V.X ... S„ - - "« 

305 

^ ... ». - - 

P„ ... Z OX, ^1 - ^v. - V.X «5 3XV ... 

340 

7.^^ Qpr Arq Glu Gly Leu Asp Leu 
Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg ^^y 

355 

,.X P„ OX, MP ..P - - - - «a TV. «. - - 

370 ^ 
T.. OX. OXV V.X -V 0- - 

385 

o^y^ rjiu Arq Leu His Arg Asn 
Glu ASP Ala Ala His Arg Ala Leu Leu Ser Glu Arg 
405 

^. ... .v» - ox. OXV OX. ox. .V. - ... «P S 

420 

ox. V.X ox. .v= pro ... s.r ^ V.X - ^x. «. 

435 

val Ala Tvr Leu Arg Ala Leu ser Leu Glu Val 
Gly val Arg Leu Asp Val Ala Tyr u a 
450 
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Ala Glu Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Ala lie Gly Lys Thr Glu Lys Thr Gly Lys 

500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 

610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 * 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 

725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 



Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 



74 



wo 01/90337 



PCT/USOl/17086 



, «-i M^*- rw Ala Arq Met Leu Leu Gin Val 
Leu Phe pro Arg Leu Glu Glu Met Gly Ala Arg 

770 '^'^^ 

,is oiu L.. V.1 ^ =iu pro .y. «« 

785 "» 

^» .la Ly= clu V.X M.t =1. OlJ v.1 T,r Pro «u Jl. V.l 

805 

^ Olu val Olu V.1 Oly ne 01, 01« ..P ^=:p - Ala Ly. 

820 

Glu His His His His His His 
835 

<210> 26 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 



Met Asn Ser Oly Met .eu Pro .eu Phe Olu Pro .ys oly -1 .eu 
1 5 

. T Ma Tvr Ara Thr Phe His Ala Leu Lys 
Leu val Asp Gly His His Leu Ala Tyr Arg Tnr f 



20 25 



Oly Leu .hr Thr Ser Arg Oly Olu Pro Val Oln Ala Val Tyr Oly Phe 

Ma Lys 1 Leu Leu Lye Ala Leu Lys Olu Asp Oly Asp Ala Val He 

50 

val val Ph. ^ Al. ^y= =~ "° 

Tl, Ty, .y= .1. Oly -° "° 

^. «. L.U n. lys 01. v.1 ASP =ly 

100 

01. val P.O Oly Ty, Olu Ala A.p Asp val .au AI. S„ Leu Ala Lys 

115 

Ala Olu .y. 31. Oly Ty, Olu val A., He La. Al. Asp .ys 

130 

Leu Tyr Oln Leu Leu Ser Asp Arg lie His Val Leu His Pro Olu 



Gly 



Leu lie ,^r Pro Ala Trp Leu Trp Glu Lys Tyr Oly Leu Arg 
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X65 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 

245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala* Ser Arg Glu Gly Leu Asp Leu 
355 360 365 

Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
365 390 395 400 

Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 
420 425 430 

Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
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475 

465 

Ar-rt Tiso Gin Leu Glu Arg Val Leu Pne 
His Pro Phe Asn Leu A3n Ser Arg Asp Gin Leu 
485 

111. He Gly Lvs Thr Gin Lys Thr Gly Lys 
ASP Glu Leu Gly Leu Pro Ala lie Giy i^Vb 

500 ^""^ 

. ..la Ala val Leu Glu Ala Leu Arg Glu Ala His Pro 
Arg Ser Thr Ser Ala Ala vaj. u 
515 

XI. ai„ .y. ^ Tyr ^ oxu Leu ^ .y. =~ 

530 "5 

1=.. Tip His pro Arg Thr Gly Arg 
Thr Tyr He Asp Pro Leu Pro Asp Leu He Hx3 Pro g 

545 "° 

»i= ThX .H, «» 01. T« ^ «V 

565 . 

T-i Val Ara thr Pro Leu Gly 

ser ser ABp Pro Asn Leu Gin Asn lie Pro Val Arg in 

580 

01. ^...^^ "a «e «. oxu «u .xy Trp L=. val 

595 

Ma Leu Asp Tyr Ser Oln Xle Olu Leu Arg Val Leu Ala His Leu Ser 



610 "5 



Gly ASP Glu Asn Leu He Arg vax 
625 



val Phe Gin Glu Gly Arg Asp He His 



^ Olu Thr Ala ser Trp Met P.e Gly Val Pro Arg Glu Ala Val Asp 
645 

p„ ... ».t Al. II. val leu TV. 

660 

Olv -et S.r «a "i= -5 "° 

675 

alu 01. Al. U. «9 Tyr P.. .1. -J -V «1 

690 

Ala Trp II. 01. Lv. T-r L.U Ola 01. oly Ar. oly Tyr 

705 ^'^^ 



val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 

^ val Ly. ser Val ^ 01. .la 01. -J 

740 

V.1 Oln oly Thr .la «. «P - - ^ ™' 

755 

:^ pro I.. 01. 01. Oly «. 
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770 775 780 

His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Ly3 Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly lie Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 





835 


<210> 
<211> 
<212> 
<213> 


27 

839 

PRT 

Artificial 


<220> 
<223> 


Synthetic 


<400> 


27 


Met Asn Ser Gly Met 
1 5 


Leu Val Asp Gly His 
20 



10 15 



25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 
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^ ^ «P "J r,; 

180 

«i Tio riv (31u LYS Thr Ala Arg Lys Leu 
Asn Leu pro Gly Val Lys Gly lie Gly Glu Lys 

195 

aiu 0X« T.P OIV Se. ^. 01. L.. Lys «. Le. «P 

210 21b 

.ro KU. n. Arg 01. Ly. II. L.. .1. KU ».t «P A=P 

225 

.V. L.. s,. Trp A.P «a V.1 A., T« «p pro ^. 01. 

245 

Val A.P P^ Al. AT, Ar, 01. pro A=p Ar, 01. "^9 -J "9 Al. 

260 

p., 1.. 01. Arg ^ 01. Ph. Oly «r L.. Ki. 01. P.. oly «. 

275 280 
.1. ser pro ^. Al. ^. 01. 01. Al. Pro.Trp Pro Pro Pro 01. 

290 

^ 1 T ci*.r Ara Lvs Glu Pro Met Trp Ala 
Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys ^.x 

305 

.sp Leu .eu Ala Leu Ala Ala Ala Arg Oly Oly Arg Val His Arg Ala 
325 

.1. ^ pro Leu Ala Oly Leu Lys Asp Leu Lys Olu Val Arg Oly .eu 

Leu Ala Lys Z Ala Val Leu Ala Ser Arg Olu Oly .eu Asp Leu 

val pro liy Asp Asp Pro Met Leu Leu Ala ^r Leu Leu Asp Pro Ser 

370 375 
Asn ^ «.r pro Glu oly val Ala Arg Arg Tyr Oly Oly Olu Trp Thr 
385 390 

Olu ASP Ala Ala His Arg Ala Leu Leu Ser Olu Arg Leu His Arg Asn 

x,eu Leu Lys Arg Leu Olu Oly Olu Olu Lys Leu Leu Trp Leu Tyr His 
420 

o T«. ser Arq Val Leu Ala His Met Glu Ala Thr 
Olu val Olu Lys Pro Leu Ser arg vaj. ucu 

435 "° 
Oly V.1 Arg Le. A.p val AU Tyr Arg Al, M. u.. 01. v.l 

450 

Ma Olu Olu lie Ala Arg Leu olu Ala Glu Val Phe Arg Leu Ala Oly 



465 



470 
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His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala He Gly Lys Thr Glu Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala TOLa Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arq Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 

755 760 765 



Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 
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His Asn Olu Leu Val Leu Glu Ala Pro Lys Olu Arg Ala Glu Ala Val 
785 ■'90 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 

805 • 
P.O Leu Olu val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys _ 



820 



Glu His His His His His His 835 . 



<210> 28 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 



<400> 28 

„« «. ST Oly Me. «> .h. OIu .ro Ly. Oly ^ V.l Leu 



10 

1 = 



V.1 ».p alv His Hi. Le„ «a Ty. Ph= Hi. «a Le» Lys 

20 25 
Oly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 

Ma LVB ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 

, . Ai» D^o c?pr Phe Arg His Glu Ala Tyr Gly 
val Val Phe Asp Ala Lys Ala Pro Ser pne Arg 

65 7° ' 

Oly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 

85 ^ 
X,eu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 

Olu val pro Gly O^r Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 

115 ^20 
.ys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 

ASP Tel Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 

Oly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 

P,o ASP Oi„ AU ASP Tyr A« L,„ T.- Oiy Asp 01» S« A.p 

180 
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Asn Leu Pro Gly Val Lya Gly lie Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu 
225 230 235 . 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 

245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lya Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 365 

Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 

420 425 430 

Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 



Ala Glu Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 

465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 
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oiu «u oiy .r, «. n. fv Ly» t,» ^ 

500 

»r, s.r T*. ser Al. Al. v,l Le„ alu Leu «9 01„ «a HI. Pro 

515 

lie Val Glu Lys He Leu Olu Tyr Arg Glu Leu Thr Lys Leu Lys Asn 
530 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 

565 570 

ser ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly 

580 585 
Oln Arg He Arg Arg Ala Phe Xle Ala Glu Glu Gly Trp Leu Leu Val 



595 «0«> 



«i« Ti» fslu Leu Arq Val Leu Ala His Leu Ser 
Ala Leu ASP Tyr Ser Gin lie Glu Leu Arg va 

610 

Gly ASP Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 



Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 

645 ^50 
pro Leu Met Arg Arg Ala Ala Lys thr He Asn Phe Gly Val Leu Tyr 

660 

al, «et s« M. HI. ^ L,u s« 01« Olu L=u M. lie Ko «„ 

675 

Olu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 

690 ^^5 
Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 

val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 

^g val Lys ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 

pro val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 

755 '^^^ 
Leu Phe pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 

770 ''''5 
His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 



785 790 795 
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Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 



<210> 


29 


<211> 


839 


<212> 


PRT 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


29 


Met Asn Ser Gly Met 


1 


5 



10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 
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Olu 01» Trp 01, s« Leu cl« «» - "P 

210 2" 

... .y. .ro Ma XI. «9 =1. ^.V 

225 

Ly. ..u S.r Trp »»| Leu «. Ly. Val Vhr «p L.U Pro Leu olu 
v,l «P Phe AU Lys «9 01. P" Asp Axg olu Leu Arg Al. 

260 -^^^^ 

P.e Leu Olu A.g Leu olu P.. oly ^» Leu Leu Hi. olu p., Cly Leu 

275 

Leu Olu ser Pro Lye Ala Leu olu 01. Al, Pro Tjp Pro Pro Pro olu 

290 

, «i« TT 1 Ton ser Arq Lys Glu Pro Met Trp Ala 
Gly Ala Phe Val Gly Phe Val Leu Ser Arg uy^ ^^o 

305 

«p ..u Leu Al. Leu Al. Al. Al. AT, Oly Oly AT, V.l Hi. Ar| Al. 
„. A.P pro La. Al. Oly Leu Ly. Ajp Leu Ly. Olu Val Ar, oly Leu 

340 

«u Al. Ly. ASP L.U Al. V.l Le. Ala Ser A„ Olu oly Leu A.P Leu 

355 



val Pro Oly Asp A.p Pro Hat Le. Leu Al. Tyr Leu Leu Aap Pro .ar 

370 

Asn Thr Thr Pro Olu Gly Val Ala Arg Arg Tyr Oly Gly Glu Trp T.r 
390 

Z A.P Ala Al. Hi. AT, Al. Le. Leu sar olu Ar, Leu Hi. Ar| A,. 

405 

«u L.. Ly. Ar, Leu olu 01, olu Olu Ly. Leu Leu Trp L.u ^ Hi. 

420 

Olu val Olu Ly. Pro Le. oer Ar, v.l Leu Ala Hi. Het 01. Ala 

435 

Oly val AT, L,u Aap Val Al. ^r Leu Ar. Al. Leu ser Leu olu V.l 
450 

«■! rii,i Val Phe Arg Leu Ala Gly 

Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Ptie Arg 

465 ^-"^ 

His pro Phe A.n Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 



485 



«p .lu Leu Oly Leu Pro Al. lie Oly Ly. T.r 01. Ly. ^; oly Ly. 

500 
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Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys lie Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr Val Asp Pro Leu Pro Asp Leu lie His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Glu Asn lie Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg lie Arg Arg Ala Phe lie Ala Glu Glu Gly Trp Leu Leu Val 

595 600 605 

Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Arg Asp lie His 
625 630 635 640 

Thr Glu Thr Ala Ser Tirp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 • 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr lie Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Qlu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 



Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 
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^ OXU V,. 01. va. OX, OXV 0X» MP Trp - 

820 ^ 

Glu His His His His His His 
835 

<210> 30 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 30 

ox, ^ - - - - - - I" 

L V.X «P ox, L „i= «. ^ - - - - 

20 

«V OX- ..o V,X ox. «. V.X ^ OX, PH. 

„. 1 «u - 

50 

.1 LVB Ala pro Ser Phe Arg His Glu Ala Tyr Gly 
Val Val Phe Asp Ala Lys Ala fro 80 

65 ''^ 

ox, ^ «. «v «s - - ^ Pjo OX. ... - OX. 
»x. X.. XX, II. ox. ^ ».x » - - - 

100 

^, asD ASP Val Leu Ala Ser Leu Ala Lys 

Glu val Pro Gly Tyr Glu Ala Asp Asp va 
115 ^ 

ox. ox. ox, ox. V,X ^ XX, ... - - »' 
I ^ ox. sc. - - - "° 

145 

Thr pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arc, 
Gly Tyr Leu He Thr Pro AJ-a ■^'-f 175 
X65 

^ ox. -a. - - - - l^, 

180 

.1 Glv Glu LYS Thr Ala Arg Lys Leu 

Asn Leu Pro Gly Val Lys Gly lie Gly Y 
195 



Leu 
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210 



215 



220 



Leu hya Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 365 

Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 
420 425 430 

Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 



Asp Glu Leu Gly Leu Pro Ala He Gly Lys Thr Glu Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
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rin Tvr Arg Glu Leu Thr Lye Leu Lys Set 
lie Val Glu Lys He Leu Gin Tyr Arg 

. v«i His pro Arg Tbr Gly Arg 
™. Tyx II. «P - "° ™' S's ' 

565 

3.. «P p» - - - - lit ^ va - - 

, ov,. lie Ala Glu Glu Gly Trp Leu Leu Val 
om Arg He Arg Arg Ala Phe lie 
595 

c rln lie Glu Leu Arg Val Leu Ala His Leu Ser 
Ala Leu ASP Tyr Ser Gin lie Glu u 

Z - - - - - IS - 

645 

660 

- - - - - SI 

I - - - - - - 

690 

^ ^ n. o.« - - - - — = ^0 

„^ pro ASP lieu Glu Ala 

val Glu T.r Leu Phe Gly Arg Arg Arg Tyr Val Pro 
725 



Arg val Lys ser vai ™ 
740 



n Glu Arq Met Ala Phe Asn Met 

ser val Arg Glu Ala Ala Glu Arg w 

v.. - - - - "° f« 

755 

„o ^ 0- - - - Ti "° 

Z ... ^ - ^v. - - - r. 

. XT 1 rrHrr- Pro Ijeu Ala Val 
. n T.U Ala LYS Glu Val Met Glu Gly Val Tyr Pro I. 
Ala Arg Leu Aia uys ^ 
805 

o.. ox« v.. oxy »,P «P - - - 
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820 825 830 

Glu His His His His His His 
835 



<210> 31 
<211> 839 
<212> PRT 
<213> Artificial 






<220> 

< 2 2 3 > Synthetic 






<400> 31 










Met Ash 
1 


Ser 


Gly 


Met Leu Pro Leu Phe Glu Pro Lys 
5 10 


Gly Arg Val 
15 


Leu 


jjeu va± 


Asp 


Gly 
20 


His His Leu Ala Tyr Arg Thr Phe 
25 


His Ala Leu 
30 


Lys 


Qly Leu 


xjir 
35 


Thr 


Ser Arg Gly Glu Pro Val Gin Ala 
40 


Val Tyr Gly 
45 


Phe 


Ala Lys 
50 


Ser 


Leu 


Leu Lys Ala Leu Lys Glu Asp Gly 
55 60 


Asp Ala Val 


He 


Val Val 
65 


Pne 


Asp 


Ala Lys Ala Pro Ser Phe Arg His 
70 75 


Glu Ala Tyr 


Gly 
80 


Gly Tyr 


Lys 


Ala 


Gly Arg Ala Pro Thr Pro Glu Asp 
85 90 


Phe Pro Arg 
95 


Gin 


Leu Ala 


Leu 


lie 
100 


Lys Glu Leu Val Asp Leu Leu Gly 
105 


Leu Ala Arg 
110 


Leu 


Glu Val 


Pro 
115 


Gly 


Tyr Glu Ala Asp Asp Val Leu Ala 
120 


Ser Leu Ala 
125 


Lys 


Lys Ala 
130 


Glu 


Lys 


Glu Gly Tyr Glu Val Arg lie Leu 
135 140 


Thr Ala Asp 


Lys 


Asp Leu 
145 


Tyr 


Gin 


Leu Leu Ser Asp Arg He His Val 

150 155 


Leu His Pro 


Glu 
160 


Gly Tyr 


Leu 


lie 


Thr Pro Ala Trp Leu Trp Glu Lys 
165 170 


Tyr Gly Leu 
175 


Arg 


Pro Asp 


Gin 


Trp 
180 


Ala Asp Tyr Arg Ala Leu Thr Gly 
185 


Asp Glu Ser 
190 


Asp 


Asn Leu 


Pro 
195 


Gly 


Val Lys Gly lie Gly Glu Lys Thr 
200 


Ala Arg Lys 
205 


Leu 


Leu Glu 
210 


Glu 


Trp 


Gly Ser Leu Glu Ala Leu Leu Lys 
215 220 


Asn Leu Asp 


Arg 
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.eu .ys pro Ala He Arg Olu Lya He Leu Ala His Met Asp Asp Leu 
225 230 

.ys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 

245 

Val ASP Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 



260 



P.e Leu Olu Arg Leu Olu Phe Oly Ser Leu Leu His Olu Phe Gly Leu 



275 280 



.eu Olu ser Pro Lys Ala Leu Olu Olu Ala Pro Trp Pro Pro Pro Olu 

290 

Oly Ala Phe Val Oly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 3^*^ 

I.U Leu Ala Leu Ala Ala Ala Arg oly Oly Arg Val His Arg Ala 



305 

Leu Leu Ala 

325 

pro Glu pro Tyr Lys Ala Leu Arg Asp Leu Lys Olu Ala Arg Oly Leu 

340 



^ Ly. «p ser v.l Leu Ma «9 

355 

pro pro Oly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 ^'^^ 

Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Oly Oly Olu Trp Thr 

385 

Olu Glu Ala Gly Olu Arg Ala Ala Leu Ser Olu Arg Leu Phe Ala Asn 
405 

.eu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 

Glu val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 

Oly val Arg Leu Asp Val Ala l^r Leu Arg Ala Leu Ser Leu Glu Val 
450 

Ma Olu Olu tie Ala Arg Leu olu Ala olu Val Phe Arg Leu Ala Oly 
465 470 

His pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Olu Arg Val Leu Phe 

485 

^vsp Glu Leu Gly Leu Pro Ala lie oly Lys Thr Oln Lys Thr Oly Lys 
500 

.rg ser Thr Ser Ala Ala Val Leu Olu Ala Leu Arg Olu Ala His Pro 



515 520 
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lie Val Glu Lys lie Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr lie Asp Pro Leu Pro Asp Leu lie His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg lie Arg Arg Ala Phe lie Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Arg Asp lie His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr lie Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 

755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 



Pro Leu Glu Val Glu Val Gly lie Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 
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Glu His His His His His His 





835 


<210> 


32 


<211> 


839 


<212> 


PRT 


.<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


32 


Met Asn Ser Gly ^ 



Leu pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
L ,.x L Hi= ^» «a ^ ^ 

20 

^ Thr ^ s,, =iy 01" - -X 01, ^. -1 ^ oiy p« 

„a .ye 1 -V 7 

,.1 1 p,» «p 5J= - - ^ ^ - 

li, ^ .y» fv - 11" ^ 

^„ «u XI. Z 01. .«u val «P - "° 

100 

Olu V.1 P» 01, Ty. Olu «P V.1 ^ - "* 

115 ^"^^ 

.y, «a 01. ^ 01. Oly Tyj 01. val He - ^ 

130 

Tv^rt Ti^^ His Val Leu His Pro Glu 
ASP Leu Tyr Gin Leu Leu Ser Asp Arg He His Val 

145 

.1, ^ x». Xl= pro ^. ^ ^ 

165 -^'^ 

P„ «P 01. «p M. «P Tyr «a »a ^ ^ 

180 

pro OIV val .ys Oly Xlo Oly 01. ^ - «. I-V' - 

195 

01. 01. T.P oly sor ^. oi. ... - ^v; x.. «P 

210 215 

X.. .ys pro lie 01. .ys XL «. Hi. Hat «P ».P -J 

225 230 
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Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 

260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 



Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 



Leu Leu Gly Arg Leu Glu Gly Glu 
420 

Glu Val Glu Arg Pro Leu Ser Ala 
435 440 



Glu Arg Leu Leu Trp Leu Tyr Arg 
425 430 

Val Leu Ala His Met Glu Ala Thr 
445 



Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 

485 490 495 

Asp Glu Leu Gly Leu Pro Ala He Gly Lys Thr Gin Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys lie Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 
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Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr aly Arg 
545 550 555 

Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 



Leu His Thr Arg Phe Asn uxn — 
565 570 

ser ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 

580 585 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 

Ala Leu ASP Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu Ser 

610 "0 
Gly ASP Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 



625 



Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 



645 



650 



pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 

660 665 
Qly Met ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 

675 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 

690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 

val Glu Thr Leu Pha Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 

Arg val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 

Pro val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 

760 765 



755 



Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 



770 



775 



His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 



785 



790 



795 



Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 



805 810 



pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 



820 

Glu His His His His His His 
835 



825 
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<210> 


33 


<211> 


839 


<212> 


PRT 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


33 


Met Asn Ser 61y Met 


1 


5 


Leu Val Asp Gly His 




20 



10 15 



25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
65 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 

115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala lie Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 
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,al «p Pbe Al. 01» pro «P GXu «g «. Ma 

260 265 

..u Olu .^u Olu P^e Oly S.r L« .i= olu "» 

275 280 



I.U Glu Ser pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Olu 

290 295 

, «i r^u^ ^r^^ T.f^u Ser Arq Lys Glu Pro Met Trp Ala 
Gly Ala Phe Val Gly Phe Val Leu ser Arg ^y^ 

305 

X,eu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 

pro Glu pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 

Ma .y. «P S.r val Ma I-" "3 "| 



355 



360 



pro pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 

370 ^'^^ 

Thr Thr Pro Glu Gly val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 ^-^^ 
Trp Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 ^25 
Glu val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Olu Ala Thr 



435 



440 



Gly val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Olu Val 

450 455 
;aa Glu Glu lie Ala Arg Leu Olu Ala Glu Val Phe Arg Leu Ala Gly 
465 *70 475 

His pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 

Glu Leu Oly Leu Pro Ala lie Gly Lys Thr Oln Lys Thr Oly Lys 

500 

ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 
lie val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 

530 535 
Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 



545 



550 
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Leu His 



Thr Arg Phe Asn 
565 



Gin Thr Ala Thr 
570 



Ala 



Thr 



Gly 



Arg 



Leu 
575 



Ser 



Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 

645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 ' 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 

<210> 34 
<211> 842 
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<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 



<400> 34 

mi, Tila Met Leu Pro Leu 

10 

Gly His His Leu Ala Tyr Arg Thr PHe Phe Ala Leu 



«t s.r Olu Ma K« ..u «<. - - 



20 " 



Oly Th. «» s.r «g Oly OXu Pro .a. Val Ty. oly 

P.e «a ly. sar ^ 
50 " 

V.1 val val P^e ».p ^y= »• «^<> ^' «» «» fo' 
65 " 

^ 01„ M. ryr Lys «. Oly «g «a Pro Tta Pro Olu «P P^a «■> 

^ 01= «a 1 XXe .y. 0I» La. VaX Asp Gl, P.a T^r 

100 

Olu val Pro OlV Tyr «u «. «P "P val T« 

115 

«a wa Ly. OXu Ly. olu oXy Tyr OXu v.x Ar, xXe X.U Thr «a 

130 ^-^^ 
^sp Arg ASP Leu ^ Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 



Pro Olu OXy Hi, Leu Xla B,r Pro OXU Trp Lau Trp OXu Lys Tyr OXy 

165 

.au pro OXU OX. Trp VaX ^ PJa «9 »Xa Lau vaX OXy AsP Pro 

180 

sar MP AS. Leu pro OXy V.X Ly. OXy XXa OXy OXu Lya Tto AX, ^ 

195 

Lau Lau Lya OXU Trp OXy s.r Lau OXu Lau Lau Lya Aa. Lau 

210 

ASP Arg val Lys Pro Glu Asn Val Arg Glu Lys lie Lys Ala His Leu 
225 230 

OXU A.P Leu Ar, Lau ser Lau OXu Lau Sar Ar, val Ar, Aap Lau 

245 



P.O Leu Glu val Asp Leu Ala Gin Oly Arg Glu Pro Asp Arg Glu Gly 
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260 



265 



270 



Leu Arg Ala Phe Leu Giu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Asp Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 

435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 



Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
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us 

^ ^„ ser ser ser «P ». i^a He ^ro V.J 

580 

Oly .1. ^ M. ... V.1 aj« «v 

595 

»U Leu val .e„ «P ^ Se. 01„ lie Olu '"^ 

610 



„. se, 0» .=P Ola «a Leu n. «9 v.l 01« OXu OXy Ly. 

625 

„p Xle Thr 01. 0.r «a Sex Tn. »et ^^e oly vel Pro «o ol» 

645 



.1. v,l «p pro «et «9 f,? ^"^ 

660 



„,1 L=u Ty. oly «et S.r Al. Hie Leu 01» Gl» Leu Ala XI. 

675 

P„ ^ alu 01. «. val M. X- Olu «, Tyj P.e 01. sex P.= 
690 

P„ Ly. val «, Ala Trp 11. olu Lya ^ Leu Olu Olu Oly A« Ly. 

705 "^^^ 

^ «y .y. val Olu Leu P.. .ly A« Ax. A.3 Ty, Val Pxo Aep 
725 

„ 1 T„= qer Val Arq Qlu Ala Ala Glu Arg Met Ala 
Leu Asn Ala Arg Val Lys Ser Val Arg 
740 

P.e AS. net pre val 01. oly Ala Ala Asp Leu Met Ly. Leu Ala 

755 

val Lys Le. P.e Pro Ar, Ku Ax, olu Met Oly Ala Ax, M.t :.u 

770 

.eu om val His Asn Glu .eu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 ■'9° ^ 

Olu Glu val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 ^-^^ 

^1 all, Val Glv Met Gly Glu Asp Trp Leu 

Leu Ala val Pro Leu Glu Val Glu Val Qiy w 



820 825 



ser Ala Lys Gly His His His His His His 
835 



<210> 35 

<211> 842 

<212> PRT 

<213> Artificial 
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<220> 

<223> Synthetic 
<400> 35 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 

180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 
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„u ^ M. Ph. L.U Olu Le. Ph. =1V - - 

275 

^1 M;, Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
Phe Gly Leu Leu Glu Ala Pro Axa fro 

290 

„0 01« Oly «. P«. val Ph. ..1 L.« Pr. 01» P.J 

305 

Trp «a al. .ys «S »^ «| v-X 

325 "^"^ 

Hi. ^ «. ^I. "P pro ..u 01, >.u .y. .y. ox« V.1 

^ 01, Leu HI .y. ASP "J -1 !.« 

355 

^ «p «. V.1 pro Oly «P «P P» ».t «u L,u Tyr L.u 

370 



.sp pro Ser Asn «.r Thr Pro Glu Gly Val Ala Arg Arg .^r Oly Oly 
3B5 

, n ,11= ivia His Arg Ala Leu Leu Ser Glu Arg Leu 
Glu Trp Thr Glu Asp Ala Ala His Arg h ^^^^ 

405 



His Arg Asn Leu Leu Lys Arg Leu Giu ^xy «.xu 



Glu Gly Glu Glu Lys Leu Leu Trp 

420 

.eu Tyr His Glu Val Glu Lys Pro Leu ser Arg Val Leu Ala His Met 
435 440 

Tier, Val Ala Tvr Leu Gin Ala Leu Ser 
Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr 

450 



^ 01. L.. Al. Olu Olu n. «9 «9 L.U Olu OIU Olu V.1 Ph. AT, 

465 

«. aly His pro Ph. «a «. s» »P S . 

485 

val Ph. A.P Clu L.U L.. pro «. L.U Oly Ly. Thr Ola Ly. 
500 

. „ M= nifl Val Leu Glu Ala Leu Arg Glu 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu 

515 

UU pro 11. V.1 01. .y. lie «u 01„ HI. Olu ... Thr .y. 

530 

„u -y. «. Thr Tyr val «P Pro ... Pro s.r ..u Val HI. Pro 

545 

Thr Oly »r9 ^ HI. Thr «3 Ph. "„ 01; Thr «a Thr «a Thr Oly 

565 
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Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Ala Met Ala 

740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val His Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly His His His His His His 
835 840 

<210> 36 

<211> 842 

<212> PRT 

<213> Artificial 

<220> 
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<223> Synthetic 
<400> 36 

«n 1.1. MPt Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
Met Asn Ser Glu Ala Met Leu i-ro licu 

1 5 



X.eu X.eu val ^sp Gly His His .eu ^la Tyr Ars Thr Phe Phe .la .eu 

20 

01, ^^U Th. S« =1, "O f,' ^ 

P.. S.. L,« L,« .y. -V. 01" «V ^ -V. "a 

50 

val Phe val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 

70 

Z Olu »!. Ty. x.y» "a -9 «o «u: pro olu «P Ph. P~ 

85 

^ 01. I*. 01« val «P .e« Oly Ph. Thr 

100 "5 

^ Olu V.1 pro Oly Ty. ol« «. .BP "P 

115 



.ys Ly. «a Olu .ys Olu Oly Tyr 01. v.l Ue «u Thr «. 

130 

.sp Arg ASP Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
1« 150 

P.O Olu Gly His Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 

165 

^ ^ pro Olu 01» Trp v.1 ASP Ph. ^ .1. L.. v.1 Oly «P Pro 

180 

s.r «P «. L.. pro Oly v.l Ly. Oly He oly olu Lys Thr M. 

195 

.ys L.U Ly. Olu Trp Oly 3.r L.U Olu L.U L.U Ly. ^ L.U 

210 

;.sp Arg val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu 

225 

Olu ASP Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 



245 250 



P.O I.U Glu val ASP Leu Ala Gin Oly Arg Glu Pro Asp Arg Glu Oly 
260 

x«„ ^ Ph. L.U Olu ^ L.U Olu Ph. Oly S.r L.U L.u Hi. Olu 

275 280 
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Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 

435 . 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro lie Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 



Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
580 585 590 
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PTC L.U oly 3l« ru «9 «| «a fil 

595 



Ma Leu Val Ala Leu Asp Tyr Ser Oln He Olu Leu Arg Val Leu Ala 
610 

His Leu ser Oly Asp Glu Asn Leu He Arg Val Phe Oln Glu Gly Lys 
625 

;^p lie His Thr Gin Thr Ala Ser Trp Ket Phe Gly val Pro Pro Glu 



645 650 



Ma val ASP pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 

val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 

Ty, .Xu 01. val «. ^ lie =1- ^"^ 

690 

pro Lys val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 



705 



710 



^g Gly Tyr val olu Thr Leu Phe Cly Arg Arg Arg Tyr Val Pro Asp 
725 

.eu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Olu Arg Met Ala 
740 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 

755 ''^^ 
Met val Lys Leu Phe Pro Arg Leu Arg Olu Met Gly Ala Arg Met Leu 
770 '^'^5 



.eu Gin val Ala Asn Olu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 '790 795 

Glu Glu val Ala Ala Leu Ala Lys Glu Ala Met Olu Lys Ala Tyr Pro 
805 

Leu Ala val Pro Leu Olu Val Olu Val Gly Met Oly Olu Asp Trp Leu 



820 



Ser Ala Lys Gly His His His His His His 

840 





835 


<210> 


37 


<211> 


842 


<212> 


PRT 


<213> 


Artificial 


<220> 




<223> 


synthetic 
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<400> 37 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 

115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu lie Thr Pro Glu Trp Leu Ti^p Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Tip Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 ' 280 285 



Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 



108 



PCT/USOl/17086 

WO 01/90337 

290 295 300 



pro Pro Glu Oly Ala Pbe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu ASP Leu val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 



375 380 



370 

ASP Pro ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 



425 



435 



420 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
' 440 



Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 



450 



455 



Leu Glu Leu Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 



Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His pro He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
^ 580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
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595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp lie His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly His His His His His His 
835 840 

<210> 38 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 38 
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Met se. O.y Met .eu Pro .e. P.e Olu Pro .ys Oly - .eu 
1 s 



. T jila Tvr Arg Thr Phe His Ala Leu hya 
Leu Val ASP Gly His His Leu Ala Tyr Arg t 

20 

. riv Glu pro val Gin Ala Val Tyr Gly Pbe 
Gly Leu Thr Thr Ser Arg Gly Glu Pro va 

T T «u I.V8 Ala Leu Lys Glu Asp Gly Asp Ala Val He 
Ala Lys Ser Leu Leu Lys JU.a ueu j 

50 

v.. ,^ «P - - S= ^ " 

65 '^^ 

0., TV. .y. 01, P.O ... »o =lu «P P.e Pxo 0.. 

85 



^. «a «u n. .X» V.1 «P ^ - «V - «a 

100 

iM Ala Ser Leu Ala Lys 
Glu val pro Gly Tyr Glu Ala Asp Asp Val Leu Ala 

115 

«. ox. 0.U .IV TV- S ^ "° 

130 

Ti^ n4fi Val Leu His Pro Glu 
„p lyr Gin "P «9 "« J" jso 

145 

.IV TV. U. T.. T.P - ^ 5s 

«p 01= T., ^s. TV. - - - ?S 
180 



OIV V.1 .V. 01. .V. f„J «^ - 

195 

^„ 01. 01. T.P oiv s« Le. 01. M. ... ^y; 

210 

.V. p» »« 

225 

.Ve L=. T.P «P - -V= III "° 

245 



,.1 »e 01. P^ «P - 

260 

... 01. - oi. p- f^y - - - 

275 

s„ P„ .v. "a - 01. 01. «a P» TTP pro Pro Pro 01. 



Leu 

290 
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Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala He Gly Lys Thr Gin Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 



Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 
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^ s« u. .1. - - - 

T.U lie Arg Val Phe Gin Glu Gly wg Asp He His 
Gly Asp Glu Asn I.eu lie Arg va 

625 "° 



^ ;aa ^ »et O^V V.J P~ - - - 

645 

660 . 
. ;aa His Arg Leu Ser Gin Glu I^u Ala He Pro Tyr Glu 
Gly Met ser Ala His u g^s 

=v. Tie Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
Glu Ala Gin Ala Phe He Glu Arg y 
690 

^ ^ 0.. .v= ^ - OX, .r, «S ^ 

-705 

725 

^ s» ... - - - - - "* 

740 

0. - - - " " 

I ... K.. - 

770 

vax ... 0.. ^V. - - - - ro 

I L« «. .V. - - - S - ^" - 

805 

cx« V. .1. vax OX, X.. «v ox. ™ - - 

820 

Glu His His His His His His 
835 

<210> 39 
<211> 

<212> PRT ^ 

<213> Artificxal 

<220> 

<223> synthetic 

1 ^ 
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Leu Val Asp Qly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Qly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 

210 215 220. 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 



Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 
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^ «„ L» «a «a «a «3 olj olv -1 Hi= «| 

325 

^ .V. - ^ ^1 S 

340 

Ala Leu Arg Glu Qly Leu Gly Leu 



Leu Ala Lys Abp Leu Ser Val Leu Aia .eu 
355 



„p pro -et ^. «a -P "° 

Z - val Tyr 01, «. «P 

385 

«a 01, 01. «0 .1. - - - "° 

405 

01. OIV 01. 01. - - - ^ 

420 

o o I..U ser Ala Val Leu Ala His Met Glu Ala Thr 
Glu val Glu Arg Pro Leu Ser Aia v 
435 

0., val 1- «P val «. T,^ ^. «0 - - - 

450 

Ala 



3X. 01. n, ^ «9 01. Ma 01. val Ph. -a «a 01, 

„. - - "p f,; ^ 

485 

«p 01. ... 01, «. "a XI. 01, .ya ^ ^5 

s.. r. «a Ma val ... 01. Ma 1.. ol« - -a «o 

515 

^1 TH,^ Ara Glu Leu Thr Lys Leu Lys Ser 
lie val Glu Lys He Leu Gin Tyr Arg Glu Le 

530 

^ XI. ^ pro - - "p - SI "'^^ 

^"^ 550 
545 

... Hi. Thr ^ Ph. «n 01. Thr Ma Thr Ma Thr 01, ^ - 

565 

a.r «P pro 1.. 01. XI. P- val Thr Pro h.. 01, 
580 

0X„ XX. ^ «3 Ma Ph. Xla Ma 01. 01. 01, TrP - - "al 

595 

«a ... ».P T,r ..r 01. XI. 01. ... vaX Ma Hi. - 

610 
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Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Arg Asp lie Ala 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr lie Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 



Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 



Arg Val Lys Ser Val Arg Glu Ala 
740 

Pro Val Gin Gly Thr Ala Ala Asp 
755 760 

Leu Phe Pro Arg Leu Glu Glu. Met 
770 775 

Ala Asn Glu Leu Val Leu Glu Ala 
785 790 

Ala Arg Leu Ala Lys Glu Val Met 
805 

Pro Leu Glu Val Glu Val Gly He 
820 



Ala Glu Arg Met Ala Phe Asn Met 
745 750 

Leu Met Lys Leu Ala Met Val Lys 
765 

Gly Ala Arg Met Leu Leu Gin Val 
780 

Pro Lys Glu Arg Ala Glu Ala Val 
795 800 

Glu Gly Val Tyr Pro Leu Ala Val 
810 815 

Gly Glu Asp Trp Leu Ser Ala Lys 
825 830 



Glu His His His His His His 
835 



<210> 40 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 40 



Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
1-5 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
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m - ,.1 01. V.1 Ty. 01, P.e 
L ..u «. .1. - ol„ «P ojy -P "-^ 

50 _ 

val val Phe «P .y= - - ^ "* ^ 

"v TV. .ya -a. .ly - - - » "° 

85 



lie His val Leu His Pro Glu 

150 



ox, Tyr «u Xle P.o - ^ 



^„ ..u lu - -1 «P - =iv - 

ICQ ^^"^ 

V.X P.O 01, ^ ol« .u ^P «P V.1 - - - 

115 

Olu -ys Olu Oly ^ Olu v.1 He ^; 

130 

IVsp Leu Tyr Gin Leu Leu Ser ^^sp Arg He hx. 
145 

Tvr Leu He 

165 

,vi T«„ Thr Glv Asp Olu Ser Asp 
pro ASP Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly 

180 

^-1 TT^ riv Glu Lys Thr Ala Arg Lys Leu 
Asn Leu pro Gly Val Lys Gly lie Gly Glu Ly 

195 

^„ OIU oly se. ..u 01» - - «. - 
210 215 

.y= pro «a lie 01. Ly. U. ^ ^ 

Z - - -p - '•^ S "° 

245 

v.1 «P P.e «. .y. "» 

260 

ox. .e. OIU P- Oly s„ Hi, OX. P.e oly 

275 280 



.X. ... P.O -y. AXa La. OX. OX. .1. Pro T„ Pro P.o Pro OX. 

290 

«. p.. V.X OXy Ph. vax L.. S.r Ly. Olu Pro Hat Trp 

«. AXa Al. Oly OXy Ar, vaX Hi. Arg AXa 



Asp Leu Leu Ala Leu 
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aagggggagg 


agaggcttct 


ttggctttac 


gaggaggtgg 


aaaagcccct 


ttcgcgggtc 


1320 


ctggcccaca 


tggaggccac 


gggggtacgg 


ttggatgtgg 


cctacttaaa 


ggccctttcc 


1380 


ctggaggtgg 


aggcggagat 


aaggcgcttc 


gaggaggagg 


tccaccgcct 


ggccgggcat 


1440 


cctttcaacc 


tgaactcccg 


ggaccagctg 


gaaagggtca 


tctttgacga 


gcttgggctt 


1500 


cccgccatcg 


gcaagacgca 


gaagacgggc 


aagcgctcca 


ccagcgccgc 


cgttttggag 


1560 


gccttgcggg 


aggctcatcc 


catcgtggac 


cgcatccttc 


agtaccggga 


gctttccaag 


1620 


ctcaagggaa 


cctacatcga 


tcccttgcct 


gccctggtcc 


accccaagac 


gaaccgcctc 


1680 


cacacccgtt 


tcaaccagac 


ggccaccgcc 


acggggaggc 


ttagcagctc 


ggatcctaat 


1740 


ctgcaaaata 


tccccgtgcg 


cacccctttg 


ggccagcgga 


tccgccgggc 


cttcgtggcc 


1800 


gaggaggggt 


ggaggctggt 


ggttttggac 


tacagccaga 


ttgagctcag 


ggtcctggcg 


1860 


cacctttccg 


gggacgagaa 


cctaatccgg 


gtcttccagg 


agggccagga 


catccacacc 


1920 


cagacggcca 


gctggatgtt 


cggcgtgccc 


ccagaggccg 


tggattccct 


gatgcgccgg 


1980 


gcggccaaga 


ccatcaactt 


cggcgtcctc 


tacggcatgt 


ccgcccaccg 


gctttcggga 


2040 


gagctggcca 


tcccctacga 


ggaggcggtg 


gccttcatcg 


agcggtattt 


ccagagctac 


2100 


cccaaggtgc 


gggcctggat 


tgagaaaacc 


ctggcggaag 


gacgggaacg 


gggctatgtg 


2160 


gaaaccctct 


ttggccgccg 


gcgctacgtg 


cccgacttgg 


cttcccgggt 


gaagagcatc 


2220 


cgggaggcag 


cggagcgcat 


ggccttcaac 


atgccggtcc 


aqgqgaccgc 


cgcggatttg 


2280 


atgaaactgg 


ccatggtgaa 


gctctttccc 


aggcttcagg 


agctgggggc 


caggatgctt 


2340 


ttgcaggtgc 


acaacgaact 


ggtcctcgag 


gctcccaagg 


agcaagcgga 


ggaagtcgcc 


2400 


caggaggcca 


agcggaccat 


ggaggaggtg 


tggcccctga 


aggtgccctt 


ggaggtggaa 


2460 


9tgggcatcg 


gggaggactg 


gctttccgcc 


aaggcc 






2496 



<210> 348 

<211> 832 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 348 

Met Asn Ser Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu Val 

1 5 10 15 

Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys Gly Leu 
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^ S.r «, 01, =1U «o v.. .1. v.. «V "a 

ser ^« 1 .v. - »f r ™' 

^ «p Ly, «a pro s,. ^ ^ IT 



Pro 



Oly «. »o -mr pro OXu ..P Ph. Pro ^ «~ 

n. I.y. 01. 1 val «P Leu Leu oly Leu olu ^ Leu olu v,l 
100 

.r 1 Toil Ala Thr Leu Ala Lys Lys Ala 
Gly Phe Glu Ala Asp Asp Val Leu Ala Thr Le 

1« w 

lie Leu Thr Ala Aep Arg Asp Leu 



115 



Olu Lys Glu Gly Tyr Glu Val Arg He ueu 
130 

^ L.u L». s.r Olu n, ser U. Leu Hi. Pro Olu oly Tyr 



Xle Thr pro Olu Txp Leu Trp Olu Lye Tyr oly Leu Lys Pro a.r 

165 

el. Trp V.1 «P Tyr «a L.u Oly «P Pro -r «P n. 

180 -^^^ 

^ al, V.1 Ly. Oly 11. oly ol« Ly. Thr «e Ma Lye Leu XI. «0 

195 

Olu TTP Oly s.r L.U Olu ^ Leu Ly. .i. L.u olu 01. val Ly. 

210 215 

P„ »!, ser val olu Ly. XI. L.U Ser Hi. ».t olu «P Leu Ly. 

225 

:,u s.r L.U 01. Leu s.r Val Bi. Thr «p L.. Leu L.u 01» v.1 

„P P.. Ma "9 Olu pro A.P «9 olu oly L.U Ly. «. Ph. 

..u OIU «3 'I Olu Ph. Oly S.r Leu Leu Hi. Olu Phe oly Leu ^ 
275 

. V.1 Ala Ala Glu Glu Ala Pro Trp Pro Pro Pro Glu Gly 
Glu Ser Pro Val Ala Axa oxu « 

290 

„. P.. v.1 oly Tyr val Leu Ser Pro olu Pro ».t Trp M. olu 
305 "° 



Leu Asn 



.la Leu Ala Ala Ala Trp Gly Gly Arg Val Tyr Arg Ala Glu 
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325 



330 



335 



Asp Pro Leu Glu Ala Leu Arg Gly Leu Gly Glu Val Arg Gly Leu Leu 
340 345 350 

Ala Lys Asp Leu Ala Val Leu Ala Leu Arg Glu Gly lie Ala Leu Ala 
355 360 365 

Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser Asn 
370 375 380 

Thr Ala Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr Glu 
385 390 395 400 

Glu Ala Gly Glu Arg Ala Leu Leu Ser Glu Arg Leu Tyr Ala Ala Leu 
405 410 415 

Leu Lys Arg Leu Lys Gly Glu Glu Arg Leu Leu Trp Leu Tyr Glu Glu 
420 425 430 

Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr Gly 
435 440 445 

Val Arg Leu Asp Val Ala Tyr Leu Lys Ala Leu Ser Leu Glu Val Glu 
450 455 460 

Ala Glu He Arg Arg Phe Glu Glu Glu Val His Arg Leu Ala Gly His 
465 470 475 480 

Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val lie Phe Asp 
485 490 495 

Glu Leu Gly Leu Pro Ala He Gly Lys Thr Gin Lys Thr Gly Lys Arg 
500 505 510 

Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro He 
515 520 525 

Val Asp Arg He Leu Gin Tyr Arg Glu Leu Ser Lys Leu Lys Gly Thr 
530 535 540 

Tyr He Asp Pro Leu Pro Ala Leu Val His Pro Lys Thr Asn Arg Leu 
545 550 555 560 

His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser 
565 570 575 

Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly Gin 
580 585 590 

Arg He Arg Arg Ala Phe Val Ala Glu Glu Gly Trp Arg Leu Val Val 
595 600 605 

Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser Gly 
610 615 620 



Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Gin Asp He His Thr 



120 



PCT/USOl/17086 

WO 01/90337 



625 



630 



635 



640 



01. Thr Ala ser .r. Met Pha oly val Pro Pro olu «. val ».P ser 

645 

«„ H=t «, 1. .ys m U. Ph. «y V.1 LO. Tyr oly 

660 

«t S.r Ma Hi. Ar, »u s.r Gly Glu Ma U, Pro ^ 31. =1« 

675 

val Ala Phe He Glu Arg Tyr Phe Oln Ser Tyr Pro Lys Val Arg 
690 

;aa Trp He Glu Lys O^r Leu Ala Glu Oly Arg Glu Arg Gly Tyr Val 
705 7" ^" 

Olu Thr .eu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Ala Ser Arg 
725 

val Lys ser He Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met Pro 
740 

val 01. Oly Tto «. Al. A3P Leu «et Lye Le. Al. Met val Ly. Leu 

755 

Phe pro Arg Leu Gin Glu Leu Oly Ala Arg Met Leu Leu Gin Val His 

770 '^'^^ 
Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Gin Ala Glu Glu Val Ala 
785 •'SO 

om Glu Ala Lys Arg Thr Met Glu Glu Val Trp Pro Leu Lys Val Pro 
805 

.eu Glu val Glu val Gly He Gly Glu Asp Trp Leu Ser Ala Lys Ala 



820 



<210> 349 

<211> 2526 

<212> DNA 

<213> Artificial 

<220> 

<223> synthetic 



ftga^ttcca ccccactttt tgacctggag gaacccccca agcgggtgct tctggtggac 
ggccaccacc tggcctaccg caccttctat gccctgagcc tcaccacctc ccggggggag 
ccggtgcaga tggtctacgg cttcgcccgg agcctcctca aggccttgaa ggaggacgga 
caggcggtgg tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacgag 
gcctacaagg cgggccgggc ccccaccccg gaggacttcc cccgccagct cgccttggtc 300 



60 
120 
180 
240 
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aagcggctgg 


tggaccttct 


gggctttacc 


cgcctcgagg 


ccccggggta 


cgaggcggac 


360 


gacgtcctgg 


gcaccctggc 


caagaaggcc 


gaaagggagg 


ggatggaggt 


gcgcatcctc 


420 


acgggagacc 


gggacttctt 


ccagctcctc 


tccgagaagg 


tctcggtcct 


cctgccggac 


480 


gggaccctgg 


tcaccccaaa 


ggacgtccag 


gagaagtacg 


gggtgccccc 


ggagcgctgg 


540 


gtggacttcc 


gcgccctcac 


g99ggaccgc 


tcggacaaca 


tccccggggt 


ggcggggata 


600 


ggggagaaga 


ccgcccttcg 


actcctcgca 


gagtggggga 


gcgtggaaaa 


cctcctgaag 


660 


aacctggacc 


gggtaaagcc 


ggactcgctc 


cggcgcaaga 


tagaggcgca 


cctcgaggac 


720 


ctccacctct 


ccttagacct 


ggcccgcatc 


cgcaccgacc 


tccccctgga 


ggtggacttt 


780 


aaggccctgc 


gccgcaggac 


ccccgacctg 


gagggcctga 


gggccttttt 


ggaggagctg 


840 


gagttcggaa 


gcctcctcca 


cgagttcggc 


ctcctgggag 


gggagaagcc 


ccgggaggag 


900 


gccccctggc 


ccccgcccga 


aggggccttc 


gtgggcttcc 


tcctttcccg 


caaggagccc 


960 


atgtgggcgg 


agcttctggc 


cctggcggcg 


gcctcgggcg 


gccgcgtcca 


ccgggcaaca 


1020 


agcccggttg 


aggccctggc 


ggacctcaag 


gaggtccggg 


gcctcctcgc 


caaggacctc 


1080 


gccgtcttgg 


cctcgaggga 


ggggctagac 


ctcgtgcccg 


gggacgaccc 


catgctcctc 


1140 


gcctacctcc 


tggacccttc 


gaacaccacc 


cccgaggggg 


tggcgcggcg 


ctacgggggg 


1200 


gagtggacgg 


aggacgccgc 


ccaccgggcc 


ctcctctcgg 


agaggctcca 


tcggaacctc 


1260 


cttaagcgcc 


tcgaggggga 


ggagaagctc 


ctttggctct 


accacgaggt 


ggaaaagccc 


1320 


ctctcccggg 


tcctggccca 


tatggaggcc 


accggggtac 


ggcgggacgt 


ggcctacctt 


1380 


caggcccttt 


ccctggagct 


tgcggaggag 


atccgccgcc 


tcgaggagga 


ggtcttccgc 


1440 


ttggcgggcc 


accccttcaa 


cctcaactcc 


cgggaccagc 


tggaaagggt 


gctctttgac 


1500 


gagcttaggc 


ttcccgcctt 


gaagaagacg 


aagaagacag 


gcaagcgctc 


caccagcgcc 


1560 


gcggtgctgg 


aggccctacg 


ggaggcccac 


cccatcgtgg 


agaagatcct 


ccagcaccgg 


1620 


gagctcacca 


agctcaagaa 


cacctacgtg 


gaccccctcc 


caagcctcgt 


ccacccgagg 


1680 


acgggccgcc 


tccacacccg 


cttcaaccag 


acggccacgg 


ccacggggag 


gcttagtagc 


1740 


tccgacccca 


acctgcagaa 


catccccgtc 


cgcaccccct 


tgggccagag 


gatccgccgg 


1800 


gccttcgtgg 


ccgaggcggg 


ttgggcgttg 


gtggccctgg 


actatagcca 


gatagagctc 


1860 


cgcgtcctcg 


cccacctctc 


cggggacgaa 


aacctgatca 


gggtcttcca 


ggaggggaag 


1920 


gacatccaca 


cccagaccgc 


aagctggatg 


ttcggcgtcc 


ccccggaggc 


cgtggacccc 


1980 


ctgatgcgcc 


gggcggccaa 


gacggtgaac 


ttcggcgtcc 


tctacggcat 


gtccgcccat 


2040 



122 



wo 01/90337 

aggctctccG 
ttccaaagct 
cggggctacg 
gtgaagagcg 
gccgccgacc 
gcccgcatgc 
gaggaggtgg 
ctggaggtgg 
caccac 



aggagcttgc 
tccccaaggt 
tggaaaccct 
tcagggaggc 
tcatgaagct 
tcctccaggt 
cggctttggc 
aggtggggat 



catcccctac 
gcgggcctgg 
cttcggaaga 
cgcggagcgc 
cgccatggtg 
cgccaacgag 
caaggaggcc 
gggggaggac 



gaggaggcgg 
atagaaaaga 
aggcgctacg 
atggccttca 
aagctcttcc 
ctcctcctgg 
atggagaagg 
tggctttccg 



tggcctttat 
ccctggagga 
tgcccgacct 
acatgcccgt 
cccgcctccg 
aggcccccca 
cctatcccct 
ccaagggtca 



PCT/USOl/17086 

agagcgctac 2100 

ggggaggaag 2160 

caacgcccgg 2220 

ccagggcacc 2280 

ggagatgggg 2340 

agcgcgggcc 24 oo 
cgccgtgccc 2460 
ccaccaccac 2520 
2526 



<210> 350 

<211> 2505 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 350 

atgaattcca ccccactttt 
ggccaccacc tggcctaccg 
ccggtgcaga tggtctacgg 
caggcggtgg tcgtggtctt 
gcctacaagg cgggccgggc 
aagcggctgg tggaccttct 
gacgtcctgg gcaccctggc 
acgggagacc gggacttctt 
gggaccctgg tcaccccaaa 
gtggacttcc gcgccctcac 
ggggagaaga ccgcccttcg 
aacctggacc gggtaaagcc 
ctccacctct ccttagacct 
aaggccctgc gccgcaggac 



tgacctggag 
caccttctat 
cttcgcccgg 
tgacgccaag 
ccccaccccg 
gggctttacc 
caagaaggcc 
ccagctcctc 
ggacgtccag 
gggggaccgc 
actcctcgca 
ggactcgctc 
ggcccgcatc 
ccccgacctg 



gaacccccca 
gccctgagcc 
agcctcctca 
gccccctcct 
gaggacttcc 
cgcctcgagg 
gaaagggagg 
tccgagaagg 
gagaagtacg 
tcggacaaca 
gagtggggga 
cggcgcaaga 
cgcaccgacc 
gagggcctga 



agcgggtgct 
tcaccacctc 
aggccttgaa 
tccgccacga 
cccgccagct 
ccccggggta 
ggatggaggt 
tctcggtcct 
gggtgccccc 
tccccggggt 
gcgtggaaaa 
tagaggcgca 
tccccctgga 

gggCCttttt 



tctggtggac 
ccggggggag 
ggaggacgga 
ggcctacgag 
cgccttggtc 
cgaggcggac 
gcgcatcctc 
cctgccggac 
ggagcgctgg 
ggcggggata 
cctcctgaag 
cctcgaggac 
ggtggacttt 
ggaggagctg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
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gagttcggaa 


gcctcctcca 


cgagttcggc 


ctcctgggag 


gggagaagcc ccgggaggag 


900 


gccccctggc 


ccccgcccga 


aggggccttc 


gtgggcttcc 


tcctttcccg caaggagccc 


960 


atgtgggcgg 


agcttctggc 


cctggcggcg 


gcctcgggcg 


gccgcgtcca ccgggcaaca 


1020 


agcccggttg 


aggccctggc 


cgacctcaag 


gaggcccggg 


ggttcctggc caaggacctg 


1080 


gccgttttgg 


ccctgcggga 


gggggtggcc 


ctggacccca 


cggacgaccc cctcctggtg 


1140 


gcctacctcc 


tggacccggc 


caacacccac 


cccgaggggg 


tggcccggcg ctacgggggc 


1200 


gagttcacgg 


aggacgcagc 


ggagagggcc 


ctcctctccg 


agaggctctt ccagaacctc 


1260 


tttaaacggc 


tttccgagaa 


gctcctctgg 


ctctaccagg 


aggtggagcg gcccctctcc 


1320 


cgggtcttgg 


cccacatgga 


ggcccggggg 


gtgaggctgg 


acgtccccct tctggaggcc 


1380 


ctctcctttg 


agctggagaa 


ggagatggag 


cgcctggagg 


gggaggtctt ccgtttggcc 


1440 


ggccacccct 


tcaacctcaa 


ctcccgcgac 


cagctggaaa 


gggtcctctt tgacgagctg 


1500 


ggcctcaccc 


cggtgggccg 


gacgcagaag 


acgggcaagc 


gctccaccgc ccagggggcc 


1560 


ctggaggccc 


tccggggggc 


ccaccccatc 


gtggagctca 


tcctccagta ccgggagctt 


1620 


tccaagctca 


aaagcaccta 


cctggacccc 


ctgccccggc 


tcgtccaccc gcggacgggc 


1680 


cggctccaca 


cccgcttcaa 


ccagacggcc 


acggccacgg 


gaaggctttc cagctccgac 


1740 


cccaacctgc 


agaacatccc 


cgtgcgcacc 


cccttggggc 


agcgcatccg caaggccttc 


1800 


gtggccgagg 


aggggtggct 


ccttttggcg 


gcggactact 


cccagattga gctccgggtc 


1860 


ctggcccacc 


tctcggggga 


cgagaacctg 


aagcgggtct 


tccgggaggg gaaggacatc 


1920 


cataccgaga 


ccgccgcctg 


gatgttcggc 


ttagaccccg 


ctctggtgga tccaaagatg 


1980 


cgccgggcgg 


ccaagacggt 


caacttcggc 


gtcctctacg 


ggatgtccgc ccacaggctc 


2040 


tcccaggagc 


tcggcataga 


ctacaaggag 


gcggaggcct 


ttattgagcg ctacttccag 


2100 


agcttcccca 


aggtgcgggc 


ctggatagaa 


aggaccctgg 


aggagggccg gacgcggggc 


2160 


tacgtggaga 


ccctgttcgg 


caggaggcgc 


tatgtgcccg 


acctggcctc ccgggtccgc 


2220 


ucggcgcggg 


^gy^-sscgga 


gcggacggcc 


uucaacaugc 


ccgugcaggg caccgccgcc 




gacctgatga 


agatcgccat 


ggtcaagctc 


ttccccaggc 


taaagcccct gggggcccac 


2340 


ctcctcctcc 


aagtgcacaa 


cgagctggtc 


ctggaggtgc 


ccgaggaccg ggccgaggag 


2400 


gccaaggccc 


tggtcaagga 


ggtcatggag 


aacgcctacc 


ccctggacgt gcccctcgag 


2460 


gtggaggtgg 


gcgtgggtcg 


ggactggctg 


gaggcgaagc 


aggat 


2505 
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<210> 351 

<211> 835 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 351 

s„ ™. «o «u ..e OJ. =X« - - - 

val «P L »i. »i» IT 
^^r^s.r^ 0., oju va. H.. vax ..V ... 

1 -V - - "° " " " 

50 " 
va. v.. ..a ^a »U S« - - 

1 - - - ~ r "° - 5r 

^. va. lv= - va. - -V - - - 

100 

, . val Leu Gly Thr Leu Ala Lys 

Glu Ala pro Gly Tyr Qlu Ala Asp Asp Val 
115 "° 

, « .- f.i„ val Arg lie Leu Tbr Qly Asp Arg 
Lys Ala Glu Arg Glu Gly Met Glu Val Arg 

130 

^ P., 01. «u ..u =XU .ya val .a. val ..u »o ^^P 

145 

.1, ™, val «o "P val 01. 01. .va ^ val P.O 

^ 165 

^ * AT a T eu Thr Gly Asp Arg Ser Asp 
pro Glu Arg Trp Val Asp Phe Arg Ala Leu Thr Giy 

180 

Ua «o .1, val .a 01, Xla 01, 01« - - - - 

195 

, . riv ser Val Glu Asn Leu Leu Lys Asn Leu Asp Arg 
Leu Ala Glu Trp Gly Ser vax oj-u ^20 
210 ^" 

val -«P - - - - - 

22 5 

Hla sa. L.U «P La. Ma 11. ^ 

245 
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Glu Val Asp Phe Lys Ala Leu Arg Arg Arg Thr Pro Asp Leu Glu Gly 
260 265 270 

lieu Arg Ala Phe Leu Glu Glu Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Gly Gly Glu Lys Pro Arg Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Leu Leu Ser Arg Lys Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Leu Ala Leu Ala Ala Ala Ser Gly Gly Arg Val 
325 330 335 

His Arg Ala Thr Ser Pro Val Glu Ala Leu Ala Asp Leu Lys Glu Ala 

340 345 350 

Arg Gly Phe Leu Ala Lys Asp Leu Ala Val Leu Ala Leu Arg Glu Gly 
355 360 365 

Val Ala Leu Asp Pro Thr Asp Asp Pro Leu Leu Val Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ala Asn Thr His Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Phe Thr Glu Asp Ala Ala Glu Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

Phe Gin Asn Leu Phe Lys Arg Leu Ser Glu Lys Leu Leu Trp Leu Tyr 
420 425 430 

Gin Glu Val Glu Arg Pro Leu Ser Arg Val Leu Ala His Met Glu Ala 
435 440 445 

Arg Gly Val Arg Leu Asp Val Pro Leu Leu Glu Ala Leu Ser Phe Glu 
450 455 460 

Leu Glu Lys Glu Met Glu Arg Leu Glu Gly Glu Val Phe Arg Leu Ala 
465 470 475 480 

Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu 
485 490 495 

Phe Asp Glu Leu Gly Leu Thr Pro Val Gly Arg Thr Gin Lys Thr Gly 
500 505 510 

Lys Arg Ser Thr Ala Gin Gly Ala Leu Glu Ala Leu Arg Gly Ala His 
515 520 525 

Pro lie Val Glu Leu He Leu Gin Tyr Arg Glu Leu Ser Lys Leu Lys 
530 535 540 



Ser Thr Tyr Leu Asp Pro Leu Pro Arg Leu Val His Pro Arg Thr Gly 
545 550 555 560 
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^ I.U His Thr Arg Phe Asn Gin Th. Ala Thr Ma Thr Oly Arg Leu 

565 570 
ser ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu 
580 

«y Gl„ AT, Ly. Ala .J. val «a 01„ «u 31y Trp Leu 



595 

Leu Ala Ala Asp Tyr Ser 
610 



Gin He Glu Leu Arg Val Leu Ala His Leu 



615 



620 



ser Gly Aap Glu Asn Leu Lys Arg Val Phe Arg Glu Gly Lys Asp lie 
625 "° 

His Thr Glu Thr Ala Ala Trp Met Phe Gly Leu Asp Pro Ala Leu Val 
645 

pro Lys Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu 

660 

^ Oly Met se. Ma Hi. Olu ^ fV "« ^ ^' 

675 

^ys Glu Ala Glu Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys 
690 

val Arg Ala Trp He Glu Arg Thr Leu Glu Glu Gly Arg Thr Arg Gly 
705 "^^^ 

Tyr val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Ala 
725 

ser Arg Val Arg Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn 
740 

Met pro val Gin Gly Thr Ala Ala Asp Leu Met Lys lie Ala Met Val 



755 



760 



.ys Leu Phe Pro Arg Leu Lys Pro Leu Gly Ala His Leu Leu Leu Gin 
770 '^'^^ 



Val His Asn Glu Leu 
785 



val Leu Glu Val Pro Glu Asp Arg Ala Glu Glu 



790 



795 



Ala Lys Ala Leu Val Lys Glu Val Met Glu Asn Ala Tyr Pro Leu Asp 



805 



Val Pro Leu 



Glu val Glu val Gly Val Gly Arg Asp Trp Leu Glu Ala 



820 



825 



Lys Gin Asp 
835 

<210> 352 
<211> 2496 
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<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 352 



atgaattccc 


tgcccctctt 


tgagcccaag 


ggccgggtgc 


ttctggtgga 


cggccaccac 


60 


ctggcctacc 


gtaccttttt 


tgccctgaag 


ggcctcacca 


ccagccgcgg 


ggagccggtc 


120 


caggcggtgt 


acgggtttgc 


caagagcctt 


ttgaaggcgc 


taagggaaga 


cggggatgtg 


180 


gtgatcgtgg 


tgtttgacgc 


caaggccccc 


tccttccgcc 


accagaccta 


cgaggcctac 


240 


aaggcggggc 


gggctcccac 


ccccgaggac 


tttccccggc 


agcttgccct 


tatcaaggag 


300 


atggtggacc 


ttttgggctt 


tacccgcctc 


gaggtgccgg 


gctttgaagc 


ggatgacgtc 


360 


ctggctaccc 


tggccaagaa 


ggcggaaaag 


gaaggctacg 


aagtgcgcat 


cctcaccgcg 


420 


gaccgggacc 


tttaccagct 


tctttcggag 


cgaatctcca 


tccttcaccc 


ggagggttac 


480 


ctgatcaccc 


cggagtggct 


ttgggagaag 


tatgggctta 


agccttccca 


gtgggtggac 


540 


taccgggcct 


tggccgggga 


cccttccgac 


aacatccccg 


gcgtgaaggg 


catcggggag 


600 


aagacggcgg 


ccaagctgat 


ccgggagtgg 


ggaagcctgg 


aaaaccttct 


taagcacctg 


660 


gaacaggtga 


aacctgcctc 


cgtgcgggag 


aagatcctta 


gccacatgga 


ggacctcaag 


720 


ctatccctgg 


agctatcccg 


ggtgcacacg 


gacttgctcc 


ttcaggtgga 


cttcgcccgg 


780 


cgccgggagc 


cggaccggga 


ggggcttaag 


gcctttttgg 


agaggctgga 


gttcggaagc 


840 


ctcctccacg 


agttcggcct 


gttggaaagc 


ccggtggcgg 


cggaggaagc 


tccctggccg 


900 


ccccccgagg 


gagccttcgt 


sgggtacgtt 


ctttcccgcc 


ccgagcccat 


gtgggcggag 


960 


cttaacgcct 


tggccgccgc 


ctggggcggc 


cgcgtttacc 


gggcggagga 


tcccttggag 


1020 


gccttgcggg 


ggcttgggga 


ggtgaggggg 


cttttggcca 


aggacctggc 


ggtgctggcc 


1080 


ctgagggaag 


ggattgccct 


ggcaccgggc 


gacgacccca 


tgctcctcgc 


ctacctcctg 


1140 


gatccttcca 


acaccgcccc 


cgaaggggta 


gcccggcgct 


acggggggga 


gtggaccgag 


1200 


gaggcggggg 


aaagggcgct 


gctttccgaa 


aggctttacg 


ccgccctcct 


gaagcggctt 


1260 


aagggggagg 


agaggcttct 


ttggctttac 


gaggaggtgg 


aaaagcccct 


ttcgcgggtc 


1320 


ctggcccaca 


tggaggccac 


gggggtacgg 


ttggatgtgg 


cctacttaaa 


ggccctttcc 


1380 


ctggaggtgg 


aggcggagat 


aaggcgcttc 


gaggaggagg 


tccaccgcct 


ggccgggcat 


1440 


cctttcaacc 


tgaactcccg 


ggaccagctg 


gaaagggtca 


tctttgacga 


gcttgggctt 


1500 
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1740 
1800 
1860 
1920 



cccgccatcg gcaagacgca gaagacgggc aagcgctcca ccagcgccgc cgttttggag 1560 

gccttgcggg aggctcatcc catcgtggac cgcatccttc agtaocggga gctttccaag 1620 

ctcaagggaa cctacatcga tcccttgcct gccctggtcc accccaagac gaaccgcctc 1680 
cacacccgtt tcaaccagac ggccaccgcc acggggaggc ttagcagctc ggatcctaat 
ctgcaaaata tccccgtgcg cacccctttg ggccagcgga tccgccgggc cttcgtggcc 
gaggaggggt ggaggctggt ggttttggac tacagccaga ttgagctcag ggtcctggcg 
cacctttccg gggacgagaa cctaatccgg gtcttccagg agggccagga catccacacc 

cagacggcca gctggatgtt cggcgtgccc ccagaggccg tggattccct gatgcgccgg 1980 

gcggccaaga ccatcaactt cggcgtcctc tacggcatgt ccgcccaccg gctttcggga 2040 
gagctggcca tcccctacga ggaggcggtg gccttcatcg agcggtattt ccagagctac 2100 
cccaaggtgc gggcctggat tgagaaaacc ctggcggaag gacgggaacg gggctatgtg 2160 
gaaaccctct ttggccgccg gcgctacgtg cccgacttgg cttcccgggt gaagagcatc 2220 
cgggaggcag cggagcgcat ggccttcaac atgccggtcc aggggaccgc cgcggatttg 2280 
atgaaactgg ccatggtgaa gctctttccc aggcttcagg agctgggggc caggatgctt 2340 
ttgcaggtgc acaacgaact ggtcctcgag gctcccaagg agcaagcgga ggaagtcgcc 2400 
caggaggcca agcggaccat ggaggaggtg tggcccctga aggtgccctt ggaggtggaa 2460 
gtgggcatcg gggaggactg gctttccgcc aaggoc 

<210> 353 

<:211> 832 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 353 

Met Asn Ser Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu Val 
1 5 1° 



ASP Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys Gly Leu 

20 25 

Thr Thr ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala Lys 
35 40 45 

ser Leu Leu Lys Ala Leu Arg Glu Asp Gly Asp Val Val He Val Val 



2496 
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Phe A?p Ala hys Ala Pro Ser Phe Arg His Gin Thr Tyr Glu Ala Tyr 
65 70 75 80 

Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu Ala 
85 90 95 

Leu lie Lys Glu Met Val Asp Leu Leu Gly Phe Thr Arg Leu Glu Val 
100 105 110 

Pro Gly Phe Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys Lys Ala 
115 120 125 

Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Arg Asp Leu 
130 135 140 

Tyr Gin Leu Leu Ser Glu Arg lie Ser He Leu His Pro Glu Gly Tyr 
145 150 155 160 

Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly Leu Lys Pro Ser 
165 170 175 

Gin Trp Val Asp Tyr Arg Ala Leu Ala Gly Asp Pro Ser Asp Asn He 
180 185 190 

Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Ala Lys Leu He Arg 
195 200 205 

Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys His Leu Glu Gin Val Lys 
210 215 220 

Pro Ala Ser Val Arg Glu Lys He Leu Ser His Met Glu Asp Leu Lys 
225 230 235 240 

Leu Ser Leu Glu Leu Ser Arg Val His Thr Asp Leu Leu Leu Gin Val 
245 250 255 

Asp Phe Ala Arg Arg Arg Glu Pro Asp Arg Glu Gly Leu Lys Ala Phe 
260 265 270 

Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu Leu 
275 280 285 

Glu Ser Pro Val Ala Ala Glu Glu Ala Pro Trp Pro Pro Pro Glu Gly 
290 295 300 

Ala Phe Val Gly Tyr Val Leu Ser Arg Pro Glu Pro Met Trp Ala Glu 
305 310 315 320 

Leu Asn Ala Leu Ala Ala Ala Trp Gly Gly Arg Val Tyr Arg Ala Glu 
325 330 335 

Asp Pro Leu Glu Ala Leu Arg Gly Leu Gly Glu Val Arg Gly Leu Leu 
340 345 350 

Ala Lys Asp Leu Ala Val Leu Ala Leu Arg Glu Gly He Ala Leu Ala 
355 360 365 
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pro Gly ASP Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser Asn 

370 ^"^5 
Thr Ala pro Glu Gly Val Ala Arg Arg Tyr Gly Oly Glu Trp Thr Glu 
385 "0 395 

Glu Ala Gly Glu Arg Ala Leu Leu Ser Glu Arg Leu Tyr Ala Ala Leu 
405 

.eu Lys Arg Leu Lys Gly Glu Glu Arg Leu Leu Trp Leu Tyr Glu Glu 
420 

val Glu Lys pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr Gly 

val Arg Leu Asp val Ala Tyr Leu Lys Ala Leu Ser Leu Glu Val Glu 
450 

Ma Glu lie Arg Arg Phe Glu Glu Glu Val His Arg Leu Ala Gly His 
465 ^•'O 475 

pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val He Phe Asp 
485 

Olu Leu Gly Leu Pro Ala He Gly Lys «.r Glu Lys Thr Gly Lys Arg 
500 

ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro He 
515 

val ASP Arg He Leu Gin Tyr Arg Glu Leu Ser Lys Leu Lys Gly Thr 

530 

Tyr He Asp Pro Leu Pro Ala Leu val His Pro Lys Thr Asn Arg Leu 
545 550 . 555 

His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser 
565 

ser ASP pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly Gin 

580 585 
Arg He Arg Arg Ala Phe Val Ala Glu Glu Gly Trp Arg Leu Val Val 
595 

.eu ASP Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser Gly 
610 

ASP Glu Asn Leu He Arg Val Phe Gin Glu Gly Gin Asp He His Thr 

625 630 

aln Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp Ser 
645 

.eu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr Gly 
660 
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Met Ser Ala His Arg Leu Ser Gly Glu Leu Ala He Pro Tyr Glu Glu 
675 660 685 

Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Tyr Pro Lys Val Arg 
690 695 700 

Ala Trp He Glu Lys Thr Leu Ala Glu Gly Arg Glu Arg Gly Tyr Val 
705 710 715 720 

Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Ala Ser Arg 
725 730 735 

Val Lys Ser He Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met Pro 
740 745 750 

Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys Leu 
755 760 765 

Phe Pro Arg Leu Gin Glu Leu Gly Ala Arg Met Leu Leu Gin Val His 
770 775 780 

Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Gin Ala Glu Glu Val Ala 
785 790 795 800 

Gin Glu Ala Lys Arg Thr Met Glu Glu Val Trp Pro Leu Lys Val Pro 
805 810 . 815 

Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys Ala 
820 825 830 



<210> 


354 


<211> 


42 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


354 



ggcctcaccc cggtgaagcg gacgaagaag acgggcaagc gc 42 



<210> 


355 


<211> 


42 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


355 



gcgcttgccc gtcttcttcg tccgcttcac cggggtgagg cc 42 



<210> 356 
<211> 33 
<212> DNA 
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<213> Artificial 
<220> 

<223> Synthetic 

<400> 356 33 
ctcctcctcc aagtggccaa cgagctggtc ctg 



<210> 357 

<211> 33 

<2X2> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 357 33 
caggaccagc tcgttggcca cttggaggag gag 

<210> 358 

<211> 2505 

<212> DNA 

<213> Artificial 

<220> 

<223> synthetic 



^tga^ttcca ccccactttt tgacctggag gaacccccca agcgggtgct tctggtggac 
ggccaccacc tggcctaccg caccttctat gccctgagcc tcaccacctc ccggggggag 
ccggtgcaga tggtctacgg cttcgcccgg agcctcctca aggccttgaa ggaggacgga 
caggcggtgg tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacgag 
gcctacaagg cgggccgggc ccccaccccg gaggacttcc cccgccagct cgccttggtc 
aagcggctgg tggaccttct gggcctggtc cgcctcgagg ccccggggta cgaggcggac 
gacgtcctgg gcaccctggc caagaaggcc gaaagggagg ggatggaggt gcgcatcctc 
acgggagacc gggacttctt ccagctcctc tccgagaagg tctcggtcct cctgccggac 
gggaccctgg tcaccccaaa ggacgtccag gagaagtacg gggtgccccc ggagcgctgg 
gtggacttcc gcgccctcac gggggaccgc tcggacaaoa tccccggggt ggcggggata 
ggggagaaga ccgcccttcg actcctcgca gagtggggga gcgtggaaaa cctcctgaag 
aacctggacc gggtaaagcc ggactcgctc cggcgcaaga tagaggcgca cctcgaggac 
ctccacctct ccttagacct ggcccgcatc cgcaccgacc tccccctgga ggtggacttt 
aaggccctgc gccgcaggac ccccgacctg gagggcctga gggccttttt ggaggagctg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
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gagttcggaa 


gcctcctcca 


cgagttcggc 


ctcctgggag 


gggagaagcc ccgggaggag 


900 


gccccctcfgc 


ccccgcccga 


aggggccttc 


gtgggcttcc 


tcctttcccg caaggagccc 


960 


atgtgggcgg 


agcttctggc 


cctggcggcg 


gcctcgggcg 


gccgcgtcca ccgggcaaca 


1020 


agcccggttg 


aggccctggc 


cgacctcaag 


gaggcccggg 


ggttcctggc caaggacctg 


1080 


gccgttttgg 


ccctgcggga 


gggggtggcc 


ctggacccca 


cggacgaccc cctcctggtg 


1140 


gcctacctcc 


tggacccggc 


caacacccac 


cccgaggggg 


tggcccggcg ctacgggggc 


1200 


gagttcacgg 


aggacgcagc 


ggagagggcc 


ctcctctccg 


agaggctctt ccagaacctc 


1260 


tttaaacggc 


tttccgagaa 


gctcctctgg 


ctctaccagg 


aggtggagcg gcccctctcc 


1320 


cgggtcttgg 


cccacatgga 


ggcccggggg 


gtgaggctgg 


acgtccccct tctggaggcc 


1380 


ctctcctttg 


agctggagaa 


ggagatggag 


cgcctggagg 


gggaggtctt ccgtttggcc 


1440 


ggccacccct 


tcaacctcaa 


ctcccgcgac 


cagctggaaa 


gggtcctctt tgacgagctg 


1500 


ggcctcaccc 


cggtgaagcg 


gacgaagaag 


acgggcaagc 


gctccaccgc ccagggggcc 


1560 


ctggaggccc 


tccggggggc 


ccaccccatc 


gtggagctca 


tcctccagta ccgggagctt 


1620 


tccaagctca 


aaagcaccta 


cctggacccc 


ctgccccggc 


tcgtccaccc gcggacgggc 


1680 


cggctccaca 


cccgcttcaa 


ccagacggcc 


acggccacgg 


gaaggctttc cagctccgac 


1740 


cccaacctgc 


agaacatccc 


cgtgcgcacc 


cccttggggc 


agcgcatccg caaggccttc 


1800 


gtggccgagg 


aggggtggct 


ccttttggcg 


gcggactact 


cccagattga gctccgggtc 


1860 


ctggcccacc 


tctcggggga 


cgagaacctg 


aagcgggtct 


tccgggaggg gaaggacatc 


1920 


cataccgaga 


ccgccgcctg 


gatgttcggc 


ttagaccccg 


ctctggtgga tccaaagatg 


1980 


cgccgggcgg 


ccaagacggt 


caacttcggc 


gtcctctacg 


ggatgtccgc ccacaggctc 


2040 


tcccaggagc 


tcggcataga 


ctacaaggag 


gcggaggcct 


ttattgagcg ctacttccag 


2100 


agcttcccca 


aggtgcgggc 


ctggatagaa 


aggaccctgg 


aggagggccg gacgcggggc 


2160 


tacgtggaga 


ccctgttcgg 


caggaggcgc 


tatgtgcccg 


acctggcctc ccgggtccgc 


2220 


tcggtgcggg 


aggcggcgga 


gcggatggcc 


ttcaacatgc 


ccgtgcaggg caccgccgcc 


2280 


gacctgatga 


agatcgccat 


ggtcaagctc 


ttccGcaggc 


taaagcccct gggggcccac 


2340 


ctcctcctcc 


aagtggccaa 


cgagctggtc 


ctggaggtgc 


ccgaggaccg ggccgaggag 


2400 


gccaaggccc 


tggtcaagga 


ggtcatggag 


aacgcctacc 


ccctggacgt gcccctcgag 


2460 


gtggaggtgg 


gcgtgggtcg 


ggactggctg 


gaggcgaagc 


aggat 


2505 
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<210> 359 

<211> 835 

<212> PRT 

<213> Artificial 



<220> 

<223> Synthetic 
<400> 359 



Met Asn Ser Thr Pro Leu Phe Asp hen Glu Glu Pro Pro Lys Arg Val 

5 10 . 

Leu val Asp Qly His His Leu Ala Tyr Arg Thr Phe Tyr Ala Leu 
20 25 30 



1 



Leu 



ser Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Met Val Tyr Gly Phe 
35 40 45 

Ala Arg Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Gin Ala Val Val 
50 55 60 

val val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Glu 
65 ■'O "^5 

Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 .9° ^ 

Leu Ala Leu Val Lys Arg Leu Val Asp Leu Leu Gly Leu Val Arg Leu 
100 105 110 

Glu Ala Pro Gly Tyr Glu Ala Asp Asp Val Leu Gly Thr Leu Ala Lys 

115 120 125 

Lys Ala Glu Arg Glu Gly Met Glu Val Arg He Leu Thr Gly Asp Arg 
130 135 "0 

ASP Phe Phe Gin Leu Leu Ser Glu Lys Val Ser Val Leu Leu Pro Asp 
14i 150 155 "0 

Gly Thr Leu Val Thr Pro Lys Asp Val Gin Glu Lys Tyr Gly Val Pro 
165 170 1^5 

Pro Glu Arg Trp Val Asp Phe Arg Ala Leu Thr Gly Asp Arg Ser Asp 
180 185 "0 

Asn He pro Gly Val Ala Gly He Gly Glu Lys Thr Ma Leu Arg Leu 
195 200 205 

Leu Ala Glu Trp Qly Ser Val Glu Asn Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

val Lys pro Asp Ser Leu Arg Arg Lys He Glu Ala His Leu Glu Asp 
Leu His Leu Ser Leu Asp Leu Ala Arg He Arg Thr Asp Leu Pro Leu 
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245 



250 



255 



Glu Val Asp Phe Lys Ala Leu Arg Arg Arg Thr Pro Asp Leu Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Glu Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Gly Gly Glu Lys Pro Arg Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Leu Leu Ser Arg Lys Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Leu Ala Leu Ala Ala Ala Ser Gly Gly Arg Val 
325 330 335 

His Arg Ala Thr Ser Pro Val Glu Ala Leu Ala Asp Leu Lys Glu Ala 
340 345 350 

Arg Gly Phe Leu Ala Lys Asp Leu Ala Val Leu Ala Leu Arg Glu Gly 
355 360 365 ' 

Val Ala Leu Asp Pro Thr Asp Asp Pro Leu Leu Val Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ala Asn Thr His Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Phe Thr Glu Asp Ala Ala Glu Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

Phe Gin Asn Leu Phe Lys Arg Leu Ser Glu Lys Leu Leu Trp Leu Tyr 
420 425 430 

Gin Glu Val Glu Arg Pro Leu Ser Arg Val Leu Ala His Met Glu Ala 
435 440 445 

Arg Gly Val Arg Leu Asp Val Pro Leu Leu Glu Ala Leu Ser Phe Glu 
450 455 460 

Leu Glu Lys Glu Met Glu Arg Leu Glu Gly Glu Val Phe Arg Leu Ala 
465 470 475 480 

Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu 
485 490 495 

Phe Asp Glu Leu Gly Leu Thr Pro Val Lys Arg Thr Lys Lys Thr Gly 
500 505 510 

Lys Arg Ser Thr Ala Gin Gly Ala Leu Glu Ala Leu Arg Gly Ala His 
515 520 525 

Pro lie Val Glu Leu lie Leu Gin Tyr Arg Glu Leu Ser Lys Leu Lys 
530 535 540 



Ser Thr Tyr Leji Asp Pro Leu Pro Arg Leu Val His Pro Arg Thr Gly 
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555 "° 

545 «° 

.eu His xnr P.e Asn OXn ^ .la OXy -| - 

Ser Ser Ser ^ ^ ;.n .eu .Xn .sn Xle Pro Val - .eu 

580 ^^'^ 

01, 01. n, »s .y. V.X «a oiu oiu oxv - 

595 

«a «a «P TV. - - »• - - S 
.e. Oly «P 3« L.U val ..e «| OXu olv '.V ^ 

625 "° 



01. Th. ^a «. H.t OIJ - - S» 

P.„ H« Z ^ "° 

660 

. c ,.ia His Arg Leu Ser Gin Qlu Leu Gly He Asp Tyr 
Tyr Gly Met Ser Ala His Arg ^eu 
675 

01. «a 01. Ma P.e II, 31. ^ Ty. ... 01. Se. P« L,. 

690 

val ^ Ma T„ XXa ol. ^ T.. 01. olu oly «9 Thr OJy 

val 01. Tto L.. Pha Oly xr, Ty. val Pro A=P La. Ala 

725 

sar val ^ Sa, val 01. Ma Ma 01. Mat Ma P.. 

P„ val II Oly ^ Ma M. «P .y= "J M. »at ,.1 

755 

riv Ala His Leu Leu Leu Gin 
Phe pro Arg Leu Lys Pro Leu Gly Ala his 



Lys Leu PHe Pro Atg 78O 
770 

VM Ma «. 01. val 01. Val Pro 01. «P 

785 

.y. Ma Le. val .ya 01. val Hat 01. M. Tyr Pro La. «P 

805 

V.1 pro ... 01. val 01. val oly v.l Oly "P 



820 



Lys Gin Asp 
835 

<210> 360 
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<211> 42 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 360 

gggcttcccg ccatcaagaa gacgaagaag acgggcaagc gc 42 



<210> 361 

<211> 42 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 361 

gcgcttgccc gtcttcttcg tcttcttgat ggcgggaagc cc 42 



<210> 362 

<211> 33 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 
<400> 362 

atgcttttgc aggtggccaa cgaactggtc etc. 33 



<210> 363 

<211> 33 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 363 

gaggaccagt tcgttggcca cctgcaaaag cat 33 



<210> 364 

<211> 2496 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 
<400> 364 

atgaattccc tgcccctctt tgagcccaag ggccgggtgc ttctggtgga cggccaccac 60 



138 



aocctcacca ccagccgcgg ggagccggtc 12° 

— ^^.^ - 

„.c,,.. .c^'- — « ^.,,o«.c 

,^.,.c.c 3« 

«^..c= — ^^^^'.^ L»=.C - 

..cc„,.. „ 

»-ttacoagot tctttcggag cgaatctcca toct 

^0«..CC ™ ^^^^^^^^^ 

t.cc99g«t MSCosg^ga cccttccs ^ ^^^^^^^ „ 

„ c<=«,qctgat ocgggaatgg g9a««'99 
g.„.gg.a ..cc.=o« ...g^ag a«^ , 
ag™ g..=.- ,,,^,,„,. . 

cgccgggago cgg.ccggg. ' , 

V r.i-t-aaaaagc ccggtggcgg cggaggaag 
agttcggcct g.tggaa . ^^^^^^^^^^ ^^^^^^^^^^ 

— --"^^^ cg...acc gggcggagga ~g ^ 

— tggccgccgc c.ggggc. ^^^^^^^^^^ 3 

gccttgcggg ggcttgggga ggtgaggg99 ^^^^^^^^ ^ 

aaattgccct ggcaccgggc gacgacccca tgctcctcgc 
ctgagggaag ggattgcc ^^^^^^^^^^ ^^^^^^^^.^ 

,„a acaccgcccc cgaaggggta gcc gg^g^^ ^^^^^^^^^^ ^^^^^^^^^^ 
gaggcggggg aaagggcgct gotttccgaa agg= ^^^^^^^^^^ 

^nttct ttggctttac gaggaggtgg aaaag 
aagggggagg • J ,,^^,,^,3 cctac«aaa ggccctttcc 

ctggcccaca tggaggccac gggggtacgg ^^^^^^^ 
„«,aat aaggcgcttc gaggaggagg tccaccg 

'^'"t: - 

™" Z 

cacacccgtt tcaaccagac ggc 
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ctgcaaaata 


tccccgtgcg 


cacccctttg 


ggccagcgga 


tccgccgggc 


cttcgtggcc 


1800 


gaggaggggt 


ggaggctggt 


ggttttggac 


tacagccaga 


ttgagctcag 


ggtcctggcg 


1860 


cacctttccg 


gggacgagaa 


cctaatccgg 


gtcttccagg 


agggccagga 


catccacacc 


1920 


cagacggcca 


gctggatgtt 


cggcgtgccc 


ccagaggccg 


tggattccct 


gatgcgccgg 


1980 


gcggccaaga 


ccatcaactt 


cggcgtcctc 


tacggcatgt 


ccgcccaccg 


gctttcggga 


2040 


gagctggcca 


tcccctacga 


ggaggcggtg 


gccttcatcg 


agcggtattt 


ccagagctac 


2100 


cccaaggtgc 


gggcctggat 


tgagaaaacc 


ctggcggaag 


gacgggaacg 


gggctatgtg 


2160 


gaaaccctct 


ttggccgccg 


gcgctacgtg 


cccgacttgg 


cttcccgggt 


gaagagcatc 


2220 


ccrcfcraGracaQ 


ccraacrcacat 


cracc^kcaac 


ataccaatcc 


aaaociaccac 




2280 


atgaaactgg 


ccatggtgaa 


gctctttccc 


aggcttcagg 


agctgggggc 


caggatgctt 


2340 


ttgcaggtgc 


acaacgaact 


ggtcctcgag 


gctcccaagg 


agcaagcgga 


ggaagtcgcc 


2400 


caggaggcca 


agcggaccat 


ggaggaggtg 


tggcccctga 


aggtgccctt 


ggaggtggaa 


2460 


gtgggcatcg 


gggaggactg 


gctttccgcc 


aaggcc 






2496 



<210> 365 

<211> 832 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 365 

Met Asn Ser Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu Val 
15 10 15 

Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys Gly Leu 
20 25 30 

Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala Lys 
35 40 45 

Ser Leu Leu Lys Ala Leu Arg Glu Asp Gly Asp Val Val lie Val Val 
50 55 60 

Phe Asp Ala Lys Ala Pro Ser Phe Arg His Gin Thr Tyr Glu Ala Tyr 
65 70 75 80 

Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu Ala 
85 90 95 

Leu lie Lys Glu Met Val Asp Leu Leu Gly Leu Glu Arg Leu Glu Val 
100 105 110 
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^ ^r^^ Teu Ala Thr Leu Ala Lys Lys Ala 
pro Gly Phe Glu Ala Asp Asp Val Leu Ala xnr 

115 " 



lie Leu Thr Ala Asp Arg Asp Leu 

Giu ^J-y — ^ 

■lie 

130 



Glu Lys Glu Gly Tyr Glu Val Arg lie .eu ~- 



^ 01. .eu .eu Ser Glu He Ser Xle .eu His Pro Olu Gly Tyr 



145 



.eu lie pro Glu Trp .eu Trp Glu Lye Tyr Gly .eu .ys Pro Ser 



pro Gly val Lys Gly He Gly Glu Lys xnr ^a ^» 
195 -^"^ 



165 "0 

, X niv AsD Pro Ser Asp Asn He 

Gin Trp val Asp Tyr Arg Ala Leu Ala Gly Asp 
180 

Thr Ala Ala Lys Leu He Arg 
205 

T„o w^s Leu Glu Gin Val Lys 
Glu Trp Gly ser Leu Glu Asn Leu Leu Lys Hxs Leu 

210 

s.. v,x ^ oxu .ys ... - Hi. Het OIU »P - ^v. 

225 

^„ 2« '.^ s r« 
«. ^ - sr. "° 

Leu His Olu Phe Gly Leu Leu 

280 



Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu «xs ^^^^^^ 
275 



p„ val -a. «. 31U alu M. P~ T.P Pjo P.O pro Olu =1V 



Glu Ser Pro vax aj.* "■^•^ 300 

290 295 
.1. P^ val OIV ^ val pro .lu Pro ».t Trp Xla olj 

305 

^v ^ .la .e. «a Ma M. TrP ^ ?S 

325 ^-^^ 

p„ Olu «a L» Olv ^« OIV 01» V.1 01, 
^, "I V.1 -u - - - - - 

355 



p„ .1. «p «P -o Ma. ..u .1. Tvr - - ^ 

370 

^ pro 01. Oly val ;a. «3 «9 Tyr Oly Oly olu Trp Thr Olu 

Z Oly OIU Z -u sar 01. - ^' ^ 
405 
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Leu Lys Arg Leu Lys Gly Glu Olu Arg Leu Leu Trp Leu Tyr Glu Glu 
420 425 430 

Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr Gly 
435 440 445 

Val Arg Leu Asp Val Ala Tyr Leu Lys Ala Leu Ser Leu Glu Val Glu 
450 455 460 

Ala Glu He Arg Arg Phe Glu Glu Glu Val His Arg Leu Ala Gly His 
465 470 475 480 

Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val He Phe Asp 
4B5 490 495 

Glu Leu Gly Leu Pro Ala He Lys Lys Thr Arg Lys Thr Gly Lys Arg 
500 505 510 

Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro He 
515 520 525 

Val Asp Arg He Leu Gin Tyr Arg Glu Leu Ser Lys Leu Lys Gly Thr 
530 535 540 

Tyr He Asp Pro Leu Pro Ala Leu Val His Pro Lys Thr Asn Arg Leu 
545 550 555 560 

His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser 
565 570 575 

Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly Gin 
580 585 590 

Arg He Arg Arg Ala Phe Val Ala Glu Glu Gly Trp Arg Leu Val Val 
595 600 605 

Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser Gly 
610 615 620 

Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Gin Asp He His Thr 
625 630 635 640 

Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp Ser 
645 650 655 

Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr Gly 
660 665 670 

Met Ser Ala His Arg Leu Ser Gly Glu Leu Ala He Pro Tyr Glu Glu 
675 680 685 

Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Tyr Pro Lys Val Arg 
690 695 700 



Ala Trp He Glu Lys Thr Leu Ala Glu Gly Arg Glu Arg Gly Tyr Val 
705 710 715 720 
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rv « THTY- val Pro Asp Leu Ala Ser Arg 
Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro a p 

725 

val Lys ser Ue Arg Olu Ala Ala Glu Arg Met Ala Phe Asn Met Pro 
740 

V.1 01. aiy «« "a "P Tes ""^ 

755 

Phe pro Arg X.eu Gin Glu .eu Gly Ala Arg Met .eu Leu Gin Val His 
770 '^^^ 

Cl» val LOU 01. P~ 01. Oln >a. ol. Olu Val «a 
785 ''5° 

01„ 01« Ala .ys T>„ H.t Olu Olu Val Trp Pro Lau Lys Val Pro 

805 

^„ Olu val Olu V.1 Oly U. Oly olu .r, La» Sax Jla Ly. Ma 

820 



<210> 366 

<211> 2505 

<212> DNA 

<213> Artificial 

<220> 

<223> synthetic 



:;^^«tcS c=cc.ct«t t,acc.g,., gaacccccc. .gC999«=t tot«t<,9ac 
^coaccacc ts.cct.co. c.=«to..t ,=.=t.a,cc tcaccacctc ccg^^a, 
cc^«ca,a t^tctac^ ottc^cc. ..cctcctc. a^ccttgaa ,,a3,aC3. 
cag^cg^ta, tc^^ctt tgao^caa, .cc.cctcct toc.cc.o.a ..cctao,., 
,cotacaa« =»9CC999= ccccacccc, ,.99.c«== 

aa,cg,«^ t^aocttct ^.«t.cc csoctc,as3 ccco.,^t. c..,,c^ao 
^cgtcct,, 9=acc=t^. caagaa^gcc ,a.a,8gagg g^atgg.ggt sc90.toctc 
ac,a...acc 3g.ac«ctt ocaectcctc to=,a,aa« tctc^'cct c=«=c,.ac 
,,,.coct., toaceccaaa .,actc=a, 3.9a.9«c, .SS«ccccc »agc,ct^ 
.tssacttoc 5c,=0=tc.= 9,59ga=c,o tco^aca.oa tcocc9,S9t ,9C,gg3ata 
^a,a.,a oc.occttc, .ctccco. g.,t,,,39. 

aacctggacc ^3t«.a3~ ^""."c o,,c,c..ga ..^aggcca cct=,a«ac 
ctocacotct cct^aaacct ^gcccsc.tc c^caccgaoc tccccot^sa 9St9..cttt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
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aaggccctgc 


gccgcaggac 


ccccgacctg 


gagggcctga 


gggccttttt ggaggagctg 


840 


gagttcggaa 


gcctcctcca 


cgagttcggc 


ctcctgggag 


gggagaagcc ccgggaggag 


900 


gccccctggc 


ccccgcccga 


aggggccttc 


gtgggcttcc 


tcctttcccg caaggagccc 


960 


atgtgggcgg 


agcttctggc 


cctggcggcg 


gcctcgggcg 


gccgcgtcca ccgggcaaca 


1020 


agcccggttg 


aggccctggc 


cgacctcaag 


gaggcccggg 


ggttcctggc caaggacctg 


1080 


gccgttttgg 


ccctgcggga 


gggggtggcc 


ctggacccca 


cggacgaccc cctcctggtg 


1140 


gcctacctcc 


tggacccggc 


caacacccac 


cccgaggggg 


tggcccggcg ctacgggggc 


1200 


gagttcacgg 


aggacgcagc 


ggagagggcc 


ctcctctccg 


agaggctctt ccagaacctc 


1260 


tttaaacggc 


tttccgagaa 


gctcctctgg 


ctctaccagg 


aggtggagcg gcccctctcc 


1320 


cgggtcttgg 


cccacatgga 


ggcccggggg 


gtgaggctgg 


acgtccccct tctggaggcc 


1380 


ctctcctttg 


agctggagaa 


ggagatggag 


cgcctggagg 


gggaggtctt ccgtttggcc 


1440 


ggccacccct 


tcaacctcaa 


ctcccgcgac 


cagctggaaa 


gggtcctctt tgacgagctg 


1500 


ggcctcaccc 


cggtgaagcg 


gacgaagaag 


acgggcaagc 


gctccaccgc ccagggggcc 


1560 


ctggaggccc 


tccggggggc 


ccaccccatc 


gtggagctca 


tcctccagta ccgggagctt 


1620 


tccaagctca 


aaagcaccta 


cctggacccc 


ctgccccggc 


tcgtccaccc gcggacgggc 


1680 


cggctccaca 


cccgcttcaa 


ccagacggcc 


acggccacgg 


gaaggctttc cagctccgac 


1740 


cccaacctgc 


agaacatccc 


cgtgcgcacc 


cccttggggc 


agcgcatccg caaggccttc 


1800 


gtggccgagg 


aggggtggct 


ccttttggcg 


gcggactact 


cccagattga gctccgggtc 


1860 


ctggcccacc 


tctcggggga 


cgagaacctg 


aagcgggtct 


tccgggaggg gaaggacatc 


1920 


cataccgaga 


ccgccgcctg 


gatgttcggc 


ttagaccccg 


ctctggtgga tccaaagatg 


1980 


cgccgggcgg 


ccaagacggt 


caacttcggc 


gtcctctacg 


ggatgtccgc ccacaggctc 


2040 


tcccaggagc 


tcggcataga 


ctacaaggag 


gcggaggcct 


ttattgagcg ctacttccag 


2100 


agcttcccca 


aggtgcgggc 


ctggatagaa 


aggaccctgg 


aggagggccg gacgcggggc 


2160 


tacgtggaga 


ccctgttcgg 


caggaggcgc 


tatgtgcccg 


acctggcctc ccgggtccgc 


2220 


i-^yy uyL-yyy 


"■yycggcgya 


gcggax-ggcc 


uuv-aaca ugc 


ccgugcaggg caccgccgcc 


Q A 

Z4S O V 


gacctgatga 


agatcgccat 


ggtcaagctc 


ttccccaggc 


taaagcccct gggggcccac 


2340 


ctcctcctcc 


aagtggccaa 


cgagctggtc 


ctggaggtgc 


ccgaggaccg ggccgaggag 


2400 


gccaaggccc 


tggtcaagga 


ggtcatggag 


aacgcctacc 


ccctggacgt gcccctcgag 


2460 


gtggaggtgg 


gcgtgggtcg 


ggactggctg 


gaggcgaagc 


aggat 


2505 
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<210> 367 

<211> 835 

<212> PRT 

<213> Artificial 



<220> 

<223> Synthetic 

rji ^.o. --^-'^''-^ 

^ Tvr Arq Thr Phe Tyr Ala Leu 

Leu val ASP Gly His His Leu Ala Tyr Arg 

1 ^ ^ - - - - - - 

^* 70 
100 

T riv Thr Leu Ala Lys 

pro 01, T,. 01« »P «P «1 

115 

01. 0. oiv 0. n. oiv «p ^ 

130 

0. s» ox. V.X s« v,X - - 

:;^..v.x^..o..«pv.xox;ox..v.^ox,v.x.» 
ox. ^ v" - - - - ^» SI 

180 

„o ox, V.X ox, XX. ox, ox. - - - 

1x1 ^ ox, v.x ox. ^ - - X.. "P - 

..x Z «o - - - - s= - 

225 
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Leu His lieu Ser Leu Asp Leu Ala Arg lie Arg Thr Asp Leu Pro Leu 
245 250 255 

Glu Val Asp Phe Lys Ala Leu Arg Arg Arg Thr Pro Asp Leu Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Glu Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Gly Gly Glu Lys Pro Arg Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Leu Leu Ser Arg Lys Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Leu Ala Leu Ala Ala Ala Ser Gly Gly Arg Val 
325 330 335 

His Arg Ala Thr Ser Pro Val Glu Ala Leu Ala Asp Leu Lys Glu Ala 
340 345 350 

Arg Gly Phe Leu Ala Lys Asp Leu Ala Val Leu Ala Leu Arg Glu Gly 
355 360 365 

Val Ala Leu Asp Pro Thr Asp Asp Pro Leu Leu Val Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ala Asn Thr His Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Phe Thr Glu Asp Ala Ala Glu Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

Phe Gin Asn Leu Phe Lys Arg Leu Ser Glu Lys Leu Leu Trp Leu Tyr 
420 425 430 

Gin Glu Val Glu Arg Pro Leu Ser Arg Val Leu Ala His Met Glu Ala 
435 440 445 

Arg Gly Val Arg Leu Asp Val Pro Leu Leu Glu Ala Leu Ser Phe Glu 
450 '455 460 

Leu Glu Lys Glu Met Glu Arg Leu Glu Gly Glu Val Phe Arg Leu Ala 
465 470 475 480 

Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu 
485 490 495 

Phe Asp Glu Leu Gly .Leu Thr Pro Val Lys Arg Thr Lys Lys Thr Gly 
500 505 510 

Lys Arg Ser Thr Ala Gin Gly Ala Leu Glu Ala Leu Arg Gly Ala His 
515 520 525. 

Pro lie Val Glu Leu He Leu Gin Tyr Arg Glu Leu Ser Lys Leu Lys 
530 535 540 
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™, «P - - - - III 

545 "0 



^. HU - - - - 

565 ^' 

n ^ TTo Val Arg Thr Pro beu 
7.o« Pro Asn Leu Gin Asn He Pro vax 
er Ser Ser Asp Pro Asn ucu 59Q 

580 

.1. ph^ val Ala Glu Glu Gly Trp Leu Leu 
ay Gin Arg He Arg Lys Ala Phe Val Ala 

595 

„a «. «P - S - - 

610 

v=>i Phe Arq Glu Gly I»yB Asp lie 
ser Gly Asp Glu Asn Leu Lys Arg Val Phe Arg 

625 ^ 

H,. 0.. T.. «a «a ».t OXV ^ «P - - "a. 

645 

^ ^ «.,«a va. Olv val ^ 

660 

^ ox, «.t s« -a .e« 0.„ - ^ 

OXU I OXU «. n. O.U - - - - ^" 

val «. XU 01„ - al« 01. Oly ™ ^ f,i! 

705 '^^^ 

^ ,al alu P*. «y «^ S ^ "° 

725 

sa. val ^ se. Val CI. Ma «a .lu «« ... - 

740 

val 01. 01, ^ - - S^s 

^ I Pr= - - «^ »• 

770 

val ^. Olu Lau val ... 01. val «o o- «P ^5 

r. .V. «. - -J ''•V' - ^' - 

val «0 L.. 01. vll 01. val 01, val 01, ^ «P - 

Lys Gin Asp 
835 
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<210> 368 
<211> 2496 

<212> DNA 

<213> Artificial 






<220> 

<223> Synthetic 






^ A nn^ "SCO 

atgaattccc 


tgcccctctt 


tgagcccaag ggccgggtgc ttctggtgga cggccaccac 


60 


ctggcctacc 


gtaccttttt 


tgccctgaag ggcctcacca ccagccgcgg ggagccggtc 


120 


caggcggtgt 


acgggtttgc 


caagagcctt ttgaaggcgc taagggaaga cggggatgtg 


180 


gtgatcgtgg 


tgtttgacgc 


caaggccccc tccttccgcc accagaccta cgaggcctac 


240 


aaggcggggc 


gggctcccac 


ccccgaggac tttccccggc agcttgccct tatcaaggag 


300 


atggtggacc 


ttttgggctt 


tacccgcctc gaggtgccgg gctttgaagc ggatgacgtc 


360 


ctggctaccc 


tggccaagaa 


ggcggaaaag gaaggctacg aagtgcgcat cctcaccgcg 


420 


gaccgggacc 


tttaccagct 


tctttcggag cgaatctcca tccttcaccc ggagggttac 


480 


ctgatcaccc 


cggagtggct 


ttgggagaag tatgggctta agccttccca gtgggtggac 


540 


taccgggcct 


tggccgggga 


cccttccgac aacatccccg gcgtgaaggg catcggggag 


600 


aagacggcgg 


ccaagctgat 


ccgggagtgg ggaagcctgg aaaaccttct taagcacctg 


660 


gaacaggtga 


aacctgcctc 


cgtgcgggag aagatcctta gccacatgga ggacctcaag 


720 


ctatccctgg 


agctatcccg 


ggtgcacacg gacttgctcc ttcaggtgga cttcgcccgg 


780 


cgccgggagc 


cggaccggga 


ggggcttaag gcctttttgg agaggctgga gttcggaagc 


840 


ctcctccacg 


agttcggcct 


gttggaaagc ccggtggcgg cggaggaagc tccctggccg 


900 


ccccccgagg 


gagccttcgt 


ggggtacgtt ctttcccgcc ccgagcccat gtgggcggag 


960 


cttaacgcct 


tggccgccgc 


ctggggcggc cgcgtttacc gggcggagga tcccttggag 


1020 


gccttgcggg 


ggcttgggga 


ggtgaggggg cttttggcca aggacctggc ggtgctggcc 


1080 


ctgagggaag 


ggattgccct 


ggcaccgggc gacgacccca tgctcctcgc ctacctcctg 


1140 


gatccttcca 


acaccgcccc 


cgaaggggta gcccggcgct acggggggga gtggaccgag 


1200 


gaggcggggg 


aaagggcgct 


gctttccgaa aggctttacg ccgccctcct gaagcggctt 


1260 


aagggggagg 


agaggcttct 


ttggctttac gaggaggtgg aaaagcccct ttcgcgggtc 


1320 


ctggcccaca 


tggaggccac 


99gggtacgg ttggatgtgg cctacttaaa ggccctttcc 


1380 


ctggaggtgg 


aggcggagat 


aaggcgcttc gaggaggagg tccaccgcct ggccgggcat 


1440 
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cctttcaacc tgaactcccg ggaccagctg gaaagggtca tctttgacga gcttgggctt 1500 

cccgccatca agaagacgag gaagacgggc aagcgctcca ccagcgccgc cgttttggag 1560 

gccttgcggg aggctcatcc catcgtggac cgcatccttc agtaccggga gctttccaag 1620 

ctcaagggaa cctacatcga tcccttgcct gccctggtcc accccaagac gaaccgcctc 1680 

cacacccgtt tcaaccagac ggccaccgcc acggggaggc ttagcagctc ggatcctaat 1740 

ctgcaaaata tccccgtgcg cacccctttg ggccagcgga tccgccgggc cttcgtggcc 1800 

gaggaggggt ggaggctggt ggttttggac tacagccaga ttgagctcag ggtcctggcg 1860 

cacctttccg gggacgagaa cctaatccgg gtcttccagg agggccagga catccacacc 1920 

cagacggcca gctggatgtt cggcgtgccc ccagaggccg tggattccct gatgcgccgg 1980 

gcggccaaga ccatcaactt cggcgtcctc tacggcatgt ccgcccaccg gctttcggga 2040 

gagctggcca tcccctacga ggaggcggtg gccttcatcg agcggtattt ccagagctac 2100 

cccaaggtgc gggcctggat tgagaaaacc ctggcggaag gacgggaacg gggctatgtg 2160 

gaaaccctct ttggccgccg gcgctacgtg cccgacttgg cttcccgggt gaagagcatc 2220 

cgggaggcag cggagcgcat ggccttcaac atgccggtcc aggggaccgc cgcggatttg 2280 

atgaaactgg ccatggtgaa gctctttccc aggcttcagg agctgggggc caggatgctt 2340 

ttgcaggtgc acaacgaact ggtcctcgag gctcccaagg agcaagcgga ggaagtcgcc 24 00 

caggaggcca agcggaccat ggaggaggtg tggcccctga aggtgccctt ggaggtggaa 2460 
gtgggcatcg gggaggactg gctttccgcc aaggcc 



<210> 369 

<211> 832 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 369 

Met Asn Ser Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu Val 
15 10 15 

Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys Gly Leu 
20 25 30 

Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala Lys 
35 40 45 

Ser Leu Leu Lys Ala Leu Arg Glu Asp Gly Asp Val Val lie Val Val 



2496 
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50 



55 



60 



Phe Asp Ala Lys Ala Pro Ser Phe Arg His Gin Thr Tyr Qlu Ala Tyr 
65 70 75 80 

Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu Ala 
85 90 95 

Leu lie Lys Glu Met Val Asp Leu Leu Gly Phe Thr Arg Leu Glu Val 
100 105 110 

Pro Gly Phe Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys Lys Ala 
115 120 125 

Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Arg Asp Leu 
130 135 140 

Tyr Gin Leu Leu Ser Glu Arg lie Ser lie Leu His Pro Glu Gly Tyr 
145 150 155 160 

Leu lie Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly Leu Lys Pro Ser 
165 170 175 

Gin Trp Val Asp Tyr Arg Ala Leu Ala Gly Asp Pro Ser Asp Asn lie 
180 185 190 

Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Ala Lys Leu lie Arg 
195 200 205 

Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys His Leu Glu Gin Val Lys 
210 .215 220 

Pro Ala Ser Val Arg Glu Lys lie Leu Ser His Met Glu Asp Leu Lys 
225 230 235 240 

Leu Ser Leu Glu Leu Ser Arg Val His Thr Asp Leu Leu Leu Gin Val 
245 250 255 

Asp Phe Ala Arg Arg Arg Glu Pro Asp Arg Glu Gly Leu Lys Ala Phe 
260 265 270 

Leu Glu TVrg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu Leu 
275 280 285 

Glu Ser Pro Val Ala Ala Glu Glu Ala Pro Trp Pro Pro Pro Glu Gly 
290 295 300 

Ala Phe Val Gly Tyr Val Leu Ser Arg Pro Glu Pro Met Trp Ala Glu 
305 310 315 320 

Leu Asn Ala Leu Ala Ala Ala Trp Gly Gly Arg Val Tyr Arg Ala Glu 
325 330 335 

Asp Pro Leu Glu Ala Leu Arg Gly Leu Gly Glu Val Arg Gly Leu Leu 
340 345 350 

Ala Lys Asp Leu Ala Val Leu Ala Leu Arg Glu Gly lie Ala Leu Ala 
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355 360 365 

Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser Asn 
370 375 380 

Thr Ala Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr Glu 
385 390 395 400 

Glu Ala Gly Glu Arg Ala Leu Leu Ser Glu Arg Leu Ty^ Ala Ala Leu 
405 410 415 

Leu Lys Arg Leu Lys Gly Glu Glu Arg Leu Leu Trp Leu Tyr Glu Glu 
420 425 430 

Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr Gly 
435 440 445 

Val Arg Leu Asp Val Ala Tyr Leu Lys Ala Leu Ser Leu Glu Val Glu 
450 455 460 

Ala Glu lie Arg Arg Phe Glu Glu Glu Val His Arg Leu Ala Gly His 
465 470 475 480 

Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val lie Phe Asp 
485 490 495 

Glu Leu Gly Leu Pro Ala lie Lys Lys Thr Arg Lys Thr Gly Lys Arg 
500 505 510 

Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro He 
515 520 525 

Val Asp Arg He Leu Gin Tyr Arg Glu Leu Ser Lys Leu Lys Gly Thr 
530 535 540 

Tyr He Asp Pro Leu Pro Ala Leu Val His Pro Lys Thr Asn Arg Leu 
545 550 555 560 

His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser 
565 570 575 

Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly Gin 
560 585 590 

Arg He Arg Arg Ala Phe Val Ala Glu Glu Gly Trp Arg Leu Val Val 
595 600 605 

Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser Gly 
610 615 620 

Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Gin Asp He His Thr 
625 630 635 640 

Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp Ser 

645 650 655 

Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr Gly 
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660 665 670 

Met Ser Ala His Arg Leu Ser Gly Glu Leu Ala He Pro Tyr Glu Glu 

675 680 685 

Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Tyr Pro Lys Val Arg 
690 695 700 

Ala Trp He Glu Lys Thr Leu Ala Glu Gly Arg Glu Arg Gly Tyr Val 
7.05 710 715 720 

Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Ala Ser Arg 
725 730 735 

Val Lys Ser He Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met Pro 
740 745 750 

Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys Leu 
755 760 765 

Phe Pro Arg Leu Gin Glu Leu Gly Ala Arg Met Leu Leu Gin Val His 
770 775 780 

Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Gin Ala Glu Glu Val Ala 
785 790 795 800 

Gin Glu Ala Lys Arg Thr Met Glu Glu Val Trp Pro Leu Lys Val Pro 
805 810 815 

Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys Ala 
820 825 830 



<210> 


370 


<211> 


33 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


370 



ccctccgaca acctcgccgg ggtcaagggc ate 33 



<210> 


371 


<211> 


33 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


371 



ccctccgaca acctcaaggg ggtcaagggc ate 33 



<210> 372 
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<211> 18 

<212> DNA 

<213> TVrtificial 

<220> 

<223> Synthetic 

<400> 372 
gaggttgtcg gaggggtc 



<210> 373 
<211> 2526 
<212> DNA 
<213> Artificial 






<220> 

<223> Synthetic 






<400> 373 
atgaattccg 


ctggcgatgct 


tccgctcttt gaacccaaag gccgggtcct cctggtggac 


60 


ggccaccacc 


tggcctaccg 


caccttcttc gccctgaagg gcctcaccac gagccggggc 


1 on 


gaaccggtgc 


aggcggtcta 


cggcttcgcc aagagcctcc tcaaggccct gaaggaggac 




y y y i,.&w<>»«y y 


cccrtcttcot 


ggtctttgac gccaaggccc cctccttccg ccacgaggcc 


240 


tacaaacTCct 


&w»»y y ^y y y 


gagggccccg acccccgagg acttcccccg gcagctcgcc 


300 


ctcatcaagg 


agctggtgga 


cctcctgggg tttacccgcc tcgaggtccc cggctacgag 


360 


gcggacgacg 


ttctcgccac 


cctggccaag aaggcggaaa aggaggggta cgaggtgcgc 


420 


atcctcaccg 


ccgaccgcga 


cctctaccaa ctcgtctccg accgcgtcgc cgtcctccac 


480 


cccgagggcc 


acctcatcac 


cccggagtgg ctttgggaga agtacggcct caggccggag 


540 


cagtgggtgg 


acttccgcgc 


cctcgtgggg gacccctccg acaacctcgc cggggtcaag 


600 


ggcatcgggg 


agaagaccgc 


cctcaagctc ctcaaggagt ggggaagcct ggaaaacctc 


660 


ctcaagaacc 


tggaccgggt 


aaagccagaa aacgtccggg agaagatcaa ggcccacctg 


720 


gaagacctca 


ggctctcctt 


ggagctctcc cgggtgcgca ccgacctccc cctggaggtg 


780 


gacctcgccc 


aggggcggga 


gcccgaccgg gaggggctta gggccttcct ggagaggctg 


840 


gagttcggca 


gcctcctcca 


cgagttcggc ctcctggagg cccccgcccc cctggaggag 


900 


gccccctggc 


ccccgccgga 


aggggccttc gtgggcttcg tcctctcccg ccccgagccc 


960 


atgtgggcgg 


agcttaaagc 


cctggccgcc tgcaggggcg gccgcgtgca ccgggcagca 


1020 


gaccccttgg 


cggggctaaa 


ggacctcaag gaggtccggg gcctcctcgc caaggacctc 


1080 


gccgtcttgg 


cctcgaggga 


ggggctagac ctcgtgcccg gggacgaccc catgctcctc 


1140 



I 
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gcctacctcc 


tggacccttc 


gaacaccacc cccgaggggg tggcgcggcg ctacgggggg 


1200 


gagtggacgg 


aggacgccgc 


ccaccgggcc ctcctctcgg agaggctcca tcggaacctc 


1260 


cttaagcgcc 


tcgaggggga 


ggagaagctc ctttggctct accacgaggt ggaaaagccc 


1320 


ctctcccggg 


tcctggccca 


tatggaggcc accggggtac ggcgggacgt ggcctacctt 


1380 


caggcccttt 


ccctggagct 


tgcggaggag atccgccgcc tcgaggagga ggtcttccgc 


1440 


ttggcgggcc 


accccttcaa 


cctcaactcc cgggaccagc tggaaagggt gctctttgac 


1500 


gagcttaggc 


ttcccgcctt 


gaagaagacg aagaagacag gcaagcgctc caccagcgcc 


1560 


gcggtgctgg 


aggccctacg 


ggaggcccac cccatcgtgg agaagatcct ccagcaccgg 


1620 


gagctcacca 


agctcaagaa 


cacctacgtg gaccccctcc caagcctcgt ccacccgagg 


1680 


acgggccgcc 


tccacacccg 


cttcaaccag acggccacgg ccacggggag gcttagtagc 


1740 


tccgacccca 


acctgcagaa 


catccccgtc cgcaccccct tgggccagag gatccgccgg 


1800 


gccttcgtgg 


ccgaggcggg 


ttgggcgttg gtggccctgg actatagcca gatagagctc 


1860 


cgcgtcctcg 


cccacctctc 


cggggacgaa aacctgatca gggtcttcca ggaggggaag 


1920 


gacatccaca 


cccagaccgc 


aagctggatg ttcggcgtcc ccccggaggc cgtggacccc 


1980 


ctgatgcgcc 


gggcggccaa 


gacggtgaac ttcggcgtcc tctacggcat gtccgcccat 


2040 


aggctctccc 


aggagcttgc 


catcccctac gaggaggcgg tggcctttat agagcgctac 


2100 


btccaaagct 


tccccaaggt 


gcgggcctgg atagaaaaga ccctggagga ggggaggaag 


2160 


cggggctacg 


tggaaaccct 


cttcggaaga aggcgctacg tgcccgacct caacgcccgg 


2220 


gtgaagagcg 


tcagggaggc 


cgcggagcgc atggccttca acatgcccgt ccagggcacc 


2280 


gccgccgacc 


tcatgaagct 


cgccatggtg aagctcttcc cccgcctccg ggagatgggg 


2340 


gcccgcatgc 


tcctccaggt 


cgccaacgag ctcctcctgg aggcccccca agcgcgggcc 


2400 


gaggaggtgg 


cggctttggc 


caaggaggcc atggagaagg cctatcccct cgccgtgccc 


2460 


ctggaggtgg 


aggtggggat 


gggggaggac tggctttccg ccaagggtca ccaccaccac 


2520 


caccac 






2526 



<210> 374 

<211> 842 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
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<400> 374 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
IS 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 

20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 BO 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 

85 90 95 

Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Tip Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Ala Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 
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Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 

450 455 460 

Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 

565 570 575 



Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr 
580 585 590 
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pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

ASP He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arq Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly His His His His His His 

840 





835 


<210> 


375 


<211> 


2526 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


375 
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akgaattccg 


aggcgatgct 


tccgctcttt 


gaacccaaag 


gccgggtcct 


cctggtggac 


60 


ggccaccacc 


tggcctaccg 


caccttcttc 


gccctgaagg 


gcctcaccac 


gagccggggc 


120 


gaaccggtgc 


aggcggtcta 


cggcttcgcc 


aagagcctcc 


tcaaggccct 


gaaggaggac 


180 


gggtacaagg 


ccgtcttcgt 


ggtctttgac 


gccaaggccc 


cctccttccg 


ccacgaggcc 


240 


tacgaggcct 


acaaggcggg 


gagggccccg 


acccccgagg 


acttcccccg 


gcagctcgcc 


300 


ctcatcaagg 


agctggtgga 


cctcctgggg 


tttacccgcc 


tcgaggtccc 


cggctacgag 


360 


gcggacgacg 


ttctcgccac 


cctggccaag 


aaggcggaaa 


aggaggggta 


cgaggtgcgc 


420 


atcctcaccg 


ccgaccgcga 


cctctaccaa 


ctcgtctccg 


accgcgtcgc 


cgtcctccac 


480 


cccgagggcc 


acctcatcac 


cccggagtgg 


ctttgggaga 


agtacggcct 


caggccggag 


540 


cagtgggtgg 


acttccgcgc 


cctcgtgggg 


gacccctccg 


acaacctcaa 


gggggtcaag 


600 


ggcatcgggg 


agaagaccgc 


cctcaagctc 


ctcaaggagt 


ggggaagcct 


ggaaaacctc 


660 


ctcaagaacc 


tggaccgggt 


aaagccagaa 


aacgtccggg 


agaagatcaa 


ggcccacctg 


720 


gaagacctca 


ggctctcctt 


ggagctctcc 


cgggtgcgca 


ccgacctccc 


cctggaggtg 


780 


gacctcgccc 


aggggcggga 


gcccgaccgg 


gaggggctta 


gggccttcct 


ggagaggctg 


840 


gagttcggca 


gcctcctcca 


cgagttcggc 


ctcctggagg 


cccccgcccc 


cctggaggag 


900 


gccccctggc 


ccccgccgga 


aggggccttc 


gtgggcttcg 


tcctctcccg 


ccccgagccc 


960 


atgtgggcgg 


agcttaaagc 


cctggccgcc 


tgcaggggcg 


gccgcgtgca 


ccgggcagca 


1020 


gaccccttgg 


cggggctaaa 


ggacctcaag 


gaggtccggg 


gcctcctcgc 


caaggacctc 


1080 


gccgtcttgg 


cctcgaggga 


ggggctagac 


ctcgtgcccg 


gggacgaccc 


catgctcctc 


1140 


gcctacctcc 


tggacccttc 


gaacaccacc 


cccgaggggg 


tggcgcggcg 


ctacgggggg 


1200 


gagtggacgg 


aggacgccgc 


ccaccgggcc 


ctcctctcgg 


agaggctcca 


tcggaacctc 


1260 


cttaagcgcc 


tcgaggggga 


ggagaagctc 


ctttggctct 


accacgaggt 


ggaaaagccc 


1320 


ctctcccggg 


tcctggccca 


tatggaggcc 


accggggtac 


ggcgggacgt 


ggcctacctt 


1380 


caggcccttt 


ccctggagct 


tgcggaggag 


atccgccgcc 


tcgaggagga 


ggtcttccgc 


1440 








\«yyy d^voayc 




o 4" rra 

y wuCuc^ycLC 




gagcttaggc 


ttcccgcctt 


gaagaagacg 


aagaagacag 


gcaagcgctc 


caccagcgcc 


1560 


gcggtgctgg 


aggccctacg 


ggaggcccac 


cccatcgtgg 


agaagatcct 


ccagcaccgg 


1620 


gagctcacca 


agctcaagaa 


cacctacgtg 


gaccccctcc 


caagcctcgt 


ccacccgagg 


1680 


acgggccgcc 


tccacacccg 


cttcaaccag 


acggccacgg 


ccacggggag 


gcttagtagc 


1740 
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<220> 

<223> Synthetic 
<400> 198 

ccgtcacgcc tcctcctcat tgaatt 



<210> 199 

<211> 35 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 199 

ccaaaagtcc agtgatgatt ttcaccaggc aagta 



<210> 200 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 200 

•cagattggaa gcatccatct 



<210> 201 

<211> 19 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<40p> 201 

gattcaatga ggaggaggc 



<210> 202 

<211> 24 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 202 

ccgtcacgcc tccatctgtt tagg 



<210> 203 
<211> 22 
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<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 203 

caggtcctgg aaggagcact ta 22 



<210> 204 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 204 

gccatcagct tctttgttct tgtcatc 27 



<210> 205 

<211> 20 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 

<400> 205 

gccctaaaca gatggaggcg 20 



<210> 206 

<211> 24 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 
<400> 206 

ccgtcacgcc tcctccagtt gtag 24 



<210> 207 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 207 

aaaatcatct gtaaatccag cagtaaatga 30 
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<210> 208 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 208 

ctgtgttttc tttgtagaac 



<210> 209 

<211> 17 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 209 
ctacaactgg aggaggc 



<210> 210 

<211> 23 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 210 

ccgtcacgcc tcctctcagt tct 



<210> 211 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 211 

gtgtggtcca ctctcaatca a 



<210> 212 

<211> 26 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
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<220> 

<221> modified_base 
<222> (1)..(1) 

<223> The residue at tH position contains a TET-label. 
<400> 212 

attagaaagg aagggaagaa agcgaa 26 



<210> 213 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 213 

gcttgacggg gaaagccggc gaacgtggcg 30 



<210> 214 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 214 

cttgacgggg aaagccggcg aacgtggcga 30 



<210> 215 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 215 

tgacggggaa agccggcgaa cgtggcgaga 30 



<210> 216 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 216 

acggggaaag ccggcgaacg tggcgagaaa 30 
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<210> 217 

<21X> 53 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 217 

ttcgctttct tcccttcctt tctcgccacg ttcgccggct ttccccgtca age 53 



<210> 218 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<220> 

<221> modif ied_base 

<222> (1) . . (1) 

<223> The residue at tH positions contains a fluoroscein label. 



<400> 218 

tttccctcct cctcttcc 18 



<210> 219 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 219 

acacagtgtc ctcccgctcc tcctgagcaa 30 



<210> 220 

<211> 54 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 220 

atgaggaaga ggaggagggt gctcaggagg agcgggagga cactgtgtct gtca 54 

<210> 221 

<211> 840 
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<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 221 

Met Asn Ser Qlu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Arg Glu Asp Gly Asp Ala Val 
50 55 60 

lie Val Val Phe Asp Ala Glu Ala Pro Ser Phe Arg His Glu Ala Tyr 
65 70 75 80 

Gly Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg 
85 90 95 

Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg 
100 105 110 

Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu Ala 
115 120 125 

Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp 
130 135 140 

Lys Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro 
145. 150 155 160 

Glu Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu 
165 170 175 

Arg Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser 
180 185 190 

Asp Asn Leu Pro Gly Val Lys GJ.y lie Gly Glu Lys Thr Ala Leu Lys 
195 200 205 

Leu Leu Lys Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp 
210 215 220 

Arg Leu Lys Pro Ala lie Arg Glu Lys He Leu Ala His Met Asp Asp 
225 230 235 240 

Leu Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu 
245 250 255 

Glu Val Asp Phe Ala Lys Arg Arg Qlu Pro Asp Arg Glu Gly Leu Lys 



164 



wo 01/90337 



PCT/USOl/17086 



260 



265 



270 



Ala Phe Leu Glu Arg Leu Glu Phe Qly Ser Leu Leu His Glu Phe Gly 
275 280 285 

Leu Leu Gly Gly Glu Lys Pro Arg Glu Glu Ala Pro Trp Pro Pro Pro 
290 295 300 

Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp 
305 310 315 320 

Ala Asp Leu Leu Ala Leu Ala Ala Cys Arg Gly Gly Arg Val His Arg 
325 330 335 

Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lya Glu Val Arg Gly 
340 345 350 

Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp 
355 360 365 

Leu Val Pro Qly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro 
370 375 380 

Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp 
385 390 395 400 

Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg 
405 410 415 

Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr 
420 425 430 

His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala 
435 440 445 

Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu 
450 455 460 

Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala 
465 470 475 480 

Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu 
485 490 495 

Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys Thr Gly 
500 505 510 

Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His 
515 520 525 



Pro He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys Leu Lys 
530 535 540 

Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly 

545 550 555 560 

Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu 
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565 570 575 

Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu 
580 585 590 

Gly Gin Arg lie Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu 
595 600 605 

Val Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu 
610 615 620 

Ser Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Lys Asp lie 
.625 630 635 640 

His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val 
645 650 655 

Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu 
660 665 670 

Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr 
675 680 685 

Glu Glu Ala Val Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys 

690 695 700 

Val Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly 
705 710 715 720 

Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn 
725 730 735 

Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn 
740 745 750 

Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val 
755 760 765 

Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin 
770 775 780 

Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu 
785 790 795 800 

Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala 

805 810 815 

Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala 
820 825 830 

Lys Gly His His His His His His 
835 840 

<210> 222 

<21l5» 2520 

<212> DNA 

<213> Artificial 
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<220> 

<223> Synthetic 










<400> 222 
atgaattccg 


aggcgatgct 


tccgctcttt 


gaacccaaag gccggguwct 




60 


ggccaccacc 


tggcctaccg 


cacccuuuuu 






120 


gagccggtcc 


aggcggtgta 


cgggtuugcc 




a a a A a a a.cra c 

day GLy ci&y 


180 


ggggacgcgg 


tgatcgtggt 


Cuuugacgcc 




r*cfaaacctac 


240 


ggggggtaca 


aggcggggcg 


g ^ O u V- w lo' d w w 




acttaccctt 


300 


atcaaggagc 


tggtggacct 


cccggggccc 


acccgcctcg ayywcwi*v.#yy 


*-» w a *»y dy y wy 


360 


gacgacgttc 


tcgccaccct 


ggccaagaag 


gcggaaaagg aggggtacga 


gguycycdtu 


420 


ctcaccgccg 


acaaagacct 


ttaccagctc 


ctttccgacc gcatccacgt 


cctccacccc 


Ann 


gaggggtacc 


tcatcacccc 


ggcctggctt 


tgggaaaagt acggcctgag 


gcccgaccag 




tgggccgact 


accgggccct 


gaccggggac 


gagtccgaca accttcccgg 


ggccaagggc 




atcggggaga 


agaccgccct 


caagctcctc 


aaggagtggg ggagcctgga 


agccctcctc 


boU 


aagaacctgg 


accggctgaa 


gcccgccatc 


cgggagaaga tcctggccca 


cacggacgau 


no ft 


ctgaagctct 


cctgggacct 


ggccaaggtg 


cgcaccgacc tgcccctgga 


ggtggacttc 


/ ou 


gccaaaaggc 


gggagcccga 


ccgggagggg 


cttaaggcct ttttggagag 


gccggagcuc 


R A ft 


ggcagcctcc 


tccacgagtt 


cggcctcctg 


ggaggggaga agccccggga 


ggaggccccc 


Qftft 


tggcccccgc 


cggaaggggc 


cttcgtgggc 


tttgtgcttt cccgcaagga 


gcccatgtgg 




gccgatcttc 


tggccctggc 


cgcctgcagg 


ggcggccgcg tgcaccgggc 


agcagacccc 


1 ft O ft 


ttggcggggc 


taaaggacct 


caaggaggtc 


cggggcctcc tcgccaagga 


cctcgccgtc 


*1 A Q ft 


ttggcctcga 


gggaggggct 


agacctcgtg 


cccggggacg accccatgct 


cctcgcctac 


•t 1 ii n 
1X40 


ctcctggacc 


cttcgaacac 


cacccccgag 


ggggtggcgc ggcgctacgg 


gggggagtgg 


1200 


acggaggacg 


ccgcccaccg 


ggccctcctc 


tcggagaggc tccatcggaa 


cctccttaag 


l^oO 


cgcctcgagg 


gggaggagaa 


gctcctttgg 


ctctaccacg aggtggaaaa 


gcccctctcc 


1320 


cgggtcctgg 


cccatatgga 


ggccaccggg 


gtacggcggg acgtggccta 


ccttcaggcc 


1380 


ctttccctgg 


agcttgcgga 


ggagatccgc 


cgcctcgagg aggaggtctt 


ccgcttggcg 


1440 


ggccacccct 


tcaacctcaa 


ctcccgggac 


cagctggaaa gggtgctctt 


tgacgagctt 


1500 


aggcttcccg 


ccttgaagaa 


gacgaagaag 


acaggcaagc gctccaccag 


cgccgcggtg 


1560 
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ctggaggccc 


tacgggaggc 


ccaccccatc 


gtggagaaga 


tcctccagca 


ccgggagctc 


1620 


accaagctca 


agaacaccta 


cgtggacccc 


ctcccaagcc 


tcgtccaccc 


gaggacgggc 


1680 


cgcctccaca 


cccgcttcaa 


ccagacggcc 


acggccacgg 


ggaggcttag 


tagctccgac 


1740 


cccaacctgc 


agaacatccc 


cgtccgcacc 


cccttgggcc 


agaggatccg 


ccgggccttc 


1800 


gtggccgagg 


cgggttgggc 


gttggtggcc 


ctggactata 


gccagataga 


gctccgcgtc 


1860 


ctcgcccacc 


tctccgggga 


cgaaaacctg 


atcagggtct 


tccaggaggg 


gaaggacatc 


1920 


cacacccaga 


ccgcaagctg 


gatgttcggc 


gtccccccgg 


aggccgtgga 


ccccctgatg 


1980 


cgccgggcgg 


ccaagacggt 


gaacttcggc 


gtcctctacg 


gcatgtccgc 


ccataggctc 


2040 


tcccaggagc 


ttgccatccc 


ctacgaggag 


gcggtggcct 


ttatagagcg 


ctacttccaa 


2100 


agcttcccca 


aggtgcgggc 


ctggatagaa 


aagaccctgg 


aggaggggag 


gaagcggggc 


2160 


tacgtggaaa 


ccctcttcgg 


aagaaggcgc 


tacgtgcccg 


acctcaacgc 


ccgggtgaag 


2220 




ayy uuy cyga. 


y cyca,uyy cc 


■f- ^ /-< a -J /-» 4- /-f /-I 

t u a a. ^ d u y 


ccy uccayyy 


c ci o cy c^y C L. 


9 5 fin 


gacctcatga 


agctcgccat 


ggtgaagctc 


ttcccccgcc 


tccgggagat 


gggggcccgc 


2340 


atgctcctcc 


aggtcgccaa 


cgagctcctc 


ctggaggccc 


cccaagcgcg 


ggccgaggag 


2400 


gtggcggctt 


tggccaagga 


ggccatggag 


aaggcctatc 


ccctcgccgt 


gcccctggag 


2460 


gtggaggtgg 


ggatggggga 


ggactggctt 


tccgccaagg 


gtcaccacca 


ccaccaccac 


2520 



<210> 223 

<211> 16 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<220> 

<221> n 

<222> (1)..{1) 

<223> TH 5' end has a fluorescein label. 



<220> 

<221> n 

<222> (6) . . (6) 

<223> The residue at tH position is a cy3 abasic linker group. 
<400> 223 

ncgctntctc gctcgc 16 
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<210> 224 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 224 

acggaacgag cgtctttg 



<210> 225 

<211> 32 

<212> RNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 225 

gcgagcgaga cagcgaaaga cgcucguucc gu 



<210> 226 

<211> 32 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 226 

gcgagcgaga cagcgaaaga cgctcgttcc gt 



<210> 227 

<211> 29 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 227 

acggaacgag cgtctttcat ctgtcaatc 



<210> 228 

<211> 26 

<212> RNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 228 
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ucacggcagu uggugcggaa cgcacg - 26 



<210> 229 

<211> 26 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 229 

tcacggcagt tggtgcggaa cgcacg 26 



<210> 


230 


<211> 


30 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<220> 




<221> 


n 


<222> 


(30) . . (30) 


<223> 


TH 3' end is modified with 


<400> 


230 


cggaggaagc agttggtgcg cctcgttaan 


<210> 


231 


<211> 


23 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<220> 




<221> 


n 


<222> 


(1)..(1) 


<223> 


TH 5* end is labeled with 


<400> 


231 


ntccttctca actgcttcct ccg 


<210> 


232 


<211> 


28 


<212> 


DNA 


<213> 


Artificial 


<220> 





30 



23 
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<223> Synthetic 

<220> 

<221> n 

<222> (28) . . (28) 

<223> TH 3' end is modified with a biotin moiety. 



<400> 232 

aacgaggcgc acctcaaatc tccctttn 



<210> 233 

<211> 26 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<220> 

<221> n 

<222> (1)..(1) 

<223> TH 5* end is labeled with fluorescein. 



<400> 233 

nagcgagaca gcgaaagacg ctcgtt 



<210> 234 

<211> 17 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<220> 

<221> n 

<222> (1)..(1) 

<223> TH 5' end is labeled with fluorescein. 



<400> 234 
nttttcgctg tctcgct 



<210> 235 

<211> 13 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 235 
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acgagcgtct ttg 13 



<210> 236 

<211> 36 

<212> DNA 

<213> Artificial 

<220> 

<:223> Synthetic 

<400> 236 

cacgaattcg gggatgctgc ccctctttga gcccaa 36 



<210> 237 

<211> 34 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 237 

gtgagatcta tcactccttg gcggagagcc agtc 



34 



<210> 238 

<211> 2502 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 238 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 

caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 120 

ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 

gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 240 

gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 300 

aaggagctgg tggacctcct ggggctggcg cgcctcgagg tcccgggcta cgaggcggac 360 

gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 420 

accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 480 

gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 540 

gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 600 

ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 660 
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aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 720 

aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 780 

aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 840 

agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 900 

cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 960 

gatcttctgg ccctggccgc cgccaggggg ggccgggtcc accgggcccc cgagccttat 1020 

aaagccctca gggacctgaa ggaggcgcgg gggcttctcg ccaaagacct gagcgttctg 1080 

gccctgaggg aaggccttgg cctcccgccc ggcgacgacc ccatgctcct cgcctacctc 1140 

ctggaccctt ccaacaccac ccccgagggg gtggcccggc gctacggcgg ggagtggacg 1200 

gaggaggcgg gggagcgggc cgccctttcc gagaggctct tcgccaacct gtgggggagg 1260 

cttgaggggg aggagaggct cctttggctt taccgggagg tggagaggcc cctttccgct 1320 

gtcctggccc acatggaggc cacgggggtg cgcctggacg tggcctatct cagggccttg 1380 

tccctggagg tggccgggga gatcgcccgc ctcgaggccg aggtcttccg cctggccggc 1440 

caccccttca acctcaactc ccgggaccag ctggaaaggg tcctctttga cgagctaggg 1500 

cttcccgcca tcggcaagac ggagaagacc ggcaagcgct ccaccagcgc cgccgtcctg 1560 

gaggccctcc gcgaggccca ccccatcgtg gagaagatcc tgcagtaccg ggagctcacc 1620 

aagctgaaga gcacctacat tgaccccttg ccggacctca tccaccccag gacgggccgc 1680 

ctccacaccc gcttcaacca gacggccacg gccacgggca ggctaagtag ctccgatccc 1740 

•aacctccaga acatccccgt ccgcaccccg cttgggcaga ggatccgccg ggccttcatc 1800 

gccgaggagg ggtggctatt ggtggccctg gactatagcc agatagagct cagggtgctg 1860 

gcccacctct ccggcgacga gaacctgatc cgggtcttcc aggaggggcg ggacatccac 1920 

acggagaccg ccagctggat gttcggcgtc ccccgggagg ccgtggaccc cctgatgcgc 1980 

cgggcggcca agaccatcaa cttcggggtc ctctacggca tgtcggccca ccgcctctcc 2040 

caggagctag ccatccctta cgaggaggcc caggccttca ttgagcgcta ctttcagagc 2100 

ttccccaagg tgcgggcctg gattgagaag accctggagg agggcaggag gcgggggtac 2160 

gtggagaccc tcttcggccg ccgccgctac gtgccagacc tagaggcccg ggtgaagagc 2220 

gtgcgggagg cggccgagcg catggccttc aacatgcccg tccggggcac cgccgccgac 2280 

ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 2340 
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ctccttcagg tccacgacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 2400 
gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 2460 
gaggtgggga taggggagga ctggctctcc gccaaggagt ga 2502 



<210> 


239 


<211> 


833 


<212> 


PRT 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


239 


Met Asn Ser Gly Met 


1 


5 


Leu Val Asp Gly Hie 




20 



10 15 



25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 . 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 

145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 



174 



wo 01/90337 



PCT/USOl/17086 



210 



215 



220 



Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala* Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Trp Gly Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Gly Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala He Gly Lys Thr Glu Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
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515 



520 



525 



He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 

645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 



Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 



Arg Val Lys Ser Val Arg Glu Ala 
740 

Pro Val Arg Gly Thr Ala Ala Asp 
755 760 

Leu Phe Pro Arg Leu Glu Glu Met 
770 775 

His Asp Glu Leu Val Leu Glu Ala 
785 790 

Ala Arg Leu Ala Lys Glu Val Met 
805 

Pro Leu Glu Val Glu Val Gly He 



Ala Glu Arg Met Ala Phe Asn Met 
745 750 

Leu Met Lys Leu Ala Met Val Lys 
765 

Gly Ala Arg Met Leu Leu Gin Val 
780 

Pro Lys Glu Arg Ala Glu Ala Val 
795 800 

Glu Gly Val- Tyr Pro Leu Ala Val 
810 815 

Gly Glu Asp Trp Leu Ser Ala Lys 
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820 



825 



830 



Glu 



<210> 


240 


<211> 


28 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


240 



cacgaattcc gaggcgatgc ttccgctc 



<210> 241 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 241 

tcgacgtcga ctaacccttg gcggaaagcc 



<210> 


242 


<211> 


23 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


242 



gcatcgcctc ggaattcatg gtc 



<210> 243 
<211> 836 
<212> PRT 

<213> Thermus thermophilus 
<400> 243 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 
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Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu lie Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys lie Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 

275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Asp Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 
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Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 

565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 
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Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro ^ Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 ' 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val His Asp Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly 
835 

<210> 244 

<211> 2511 

<212> DNA 

<213> Therraus thermophilus 

<400> 244 



atgaattccg aggcgatgct tccgctcttt 


gaacccaaag 


gccgggtcct 


cctggtggac 


60 


ggccaccacc tggcctaccg caccttcttc 


gccctgaagg 


gcctcaccac 


gagccggggc 


120 


gaaccggtgc aggcggtcta cggcttcgcc 


aagagcctcc 


tcaaggccct 


gaaggaggac 


180 


gggtacaagg ccgtcttcgt ggtctttgac 


gccaaggccc 


cctccttccg 


ccacgaggcc 


240 


tacgaggcct acaaggcggg gagggccccg 


acccccgagg 


acttcccccg 


gcagctcgcc 


300 


ctcatcaagg agctggtgga cctcctgggg 


tttacccgcc 


tcgaggtccc 


cggctacgag 


360 


gcggacgacg ttctcgccac cctggccaag 


aaggcggaaa 


aggaggggta 


cgaggtgcgc 


420 


atcctcaccg ccgaccgcga cctctaccaa 


ctcgtctccg 


accgcgtcgc 


cgtcctccac 


480 
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ccc9«^999cc 


acctcatcac 


Gccggag'tgg 


Cui-tgggaga 




cActaccaaAa 
uciyyu*-yyay 


540 


cagcgggcgg 


accuccgcgc 


ccucgugggg 






caacratcaao 


600 


ggcatcgggg 


agaagaccgc 






3^ cici^ v.* ^ u> 


cicra.a.aacctc 


660 


ctCoiagaacc 


tggaccgggt 


aaagccagaa 




ar^a arTJ!4 1" a 


aaccc AOC^Q 


720 


gaagacctca 


ggctctcctt 


ggagcuctcc 


cggg^gcg^^ 


r« cr a f" ^ r» 


0 r» fc cf a a era t CT 


780 


gacctcgccc 


aggggcggga 


gcccgaccgg 


gaggggccua 


ftnn^ ^ 4" t» ^ ^ 

gggccuuCwu 


yyayoyguuy 


840 


gagttcggca 


gcctcctcca 


cgagttcggc 


c ti c c t ggagg 


cccccgccww 


cctyycLyyay 


900 


gccccctggc 


ccccgccgga 


aggggccttc 


gtgggcttcg 


uccuCucccg 


CCCCyciyccc 


7 0 w 


atgtgggcgg 


age 1 1 aaagc 


cccggccgcc 


cgcagggacg 


y ccy gy uy ca 




1020 


gaccccttgg 


cggggctaaa 


ggacctcaag 


gaggtccggg 


gcc tec ccyc 


^ a a (~r<^ a r* ^ 
(.^ctciyycLi^(>^u>iM« 


1080 


gccgtcttgg 


cctcgaggga 


9939c cagac 


wtcgcgcccg 






1140 


gcctaccticc 


cggacccctc 


CaaCaCCaCC 


cccgaggggg 


t-yycyt-yyt-y 


^***^yyyyyy 


1200 


gagcggacgg 


aggacgccgc 


^ ^ 9 #v ^ 

ccaccgggcc 




«y ayy v.* 




1260 


cuuaagcgcc 


tcgaggggga 




r» t" i" I" cm r* t" f +■ 

^ L> L> L>^^I^ImW u 


a f» r* a r* a a crot" 


craaaaaaccc 


1320 


cccucccggg 


ccccggccca 


Cawggaggcc 


Awcgggguac 


ggcgyyacgt 


aacioY aec 1 1 

yy WWtvClwwiv l» 


1380 


caggcccttt 


ccctggagct 


tgcggaggag 


auccgccgcc 


tcgaggagya 


yy u uv^v-y 0 


1440 


ttggcgggcc 


accccttzcaa 


ccticaactcc 


cgggaccagc 


cggaaaggyt. 




1500 


gagcttaggc 


ttcccgcctt 


ggggaagacg 


caaaagacag 


gcaagcgci^c 


c ci\^ ay oy 0 ^ 


1560 


gcggtgctgg 


aggccctacg 


ggaggcccac 


cccaccgcgg 


agaagacccu 


CCay waww>yy 


1 620 


gagctcacca 


agctcaagaa 


cacctacgtg 


gaccccctcc 


caagccucy^ 


CCoiCCCyciyy 


1 6R0 


acgggccgcc 


tccacacccg 


cttcaaccag 


acggccacgg 


ccacggggag 


gcbcagtagc 


J. t 


tccgacccca 


acctgcagaa 


catccccgtc 


cgcaccccct 


tgggccagag 


gacccgccgg 


1 Ann 


gccttcgtgg 


ccgaggcggg 


ttgggcgttg 


gtggccctgg 


actat^agcca 


gatagagctc 


xoou 


cgcgtcctcg 


cccacctctc 


cggggacgaa 


aacctgatca 


gggtcttcca 


ggaggggaag 


1920 


gacatccaca 


cccagaccgc 


aagctggatg 


ttcggcgtcc 


ccccggaggc 


cgtggacccc 


1980 


ctgatgcgcc 


gggcggccaa 


gacggtgaac 


ttcggcgtcc 


tctacggcat 


gtccgcccat 


2040 


aggctctccc 


aggagcttgc 


catcccctac 


gaggaggcgg 


tggcctttat 


agagcgctac 


2100 


ttccaaagct 


tccccaaggt 


gcgggcctgg 


atagaaaaga 


ccctggagga 


ggggaggaag 


2160 
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c9Sr99ctacg tggaaaccct cttcggaaga aggcgctacg tgcccgacct caacgcccgg 2220 

gtgaagagcg tcagggaggc cgcggagcgc atggccttca acatgcccgt ccagggcacc 2280 

gccgccgacc tcatgaagct cgccatggtg aagctcttcc cccgcctccg ggagatgggg 2340 

gcccgcatgc tcctccaggt ccacgacgag ctcctcctgg aggcccccca agcgcgggcc 2400 

gaggaggtgg cggctttggc caaggaggcc atggagaagg cctatcccct cgccgtgccc 2460 

ctggaggtgg aggtggggat gggggaggac tggctttccg ccaagggtta g 2511 



<210> 245 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 245 

atagccatgg tggagcggcc gctctcccgg 30 



<210> 246 

<211> 33 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 246 

aagcgtcgac tcaatcctgc ttcgcctcca gcc 



33 



<210> 247 

<211> 32 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 247 

aatcgaattc accccacttt ttgacctgga gg 32 



<210> 248 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
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<400> 248 



ccgggagagc ggccgctcca 


c 




21 


<210> 249 
<211> 2508 
<212> DNA 
<213> Artificial 








<220> 

<223> Synthetic 








<400> 249 
atggaattca 


ccccactttt 




aas^ccccGCA Acscaaa^csc^ tctcfCftoaa.c 


60 


ggccaccacc 


tggcctaccg 


r« a f* *^ t" t" r" a 1" 


cfpcct"ciaace tcacca.cctc ccaQCroiGQaQ 


120 


ccggtgcaga 


tggtctacgg 




Actcc1icct:ca. acfcrccttcraa. ooraooacocra 


180 


caggcggtgg 


tcgtggtctt 


ct^ ^ 




240 


gcctacaagg 


cgggccgggc 




aaaaacttcc cccaccaacl^ caccttaatc 


300 


aagcggctgg 


tggaccttct 


acicf e e t era fc f 


wy l»s^yciyy www^yyyywa v«v^ayy wyyaw 


360 


gacgtcctgg 


gcaccctggc 


w GiGivj Clay y v« ^ 


ciaaarmnanrt rfCfat"CTcraflcrt" ctr'or'at'f'ft'r' 

y eioiciy yy ciyy yy **^yy ^^yy y^yv»oiwv*^*v»^ 


420 


acgggagacc 


gggacttctt 




t* o r*rra na a nrr h"r«l"f^nCTt*r'r't* r*r'i'Csr*r'OCtriC* 
uwrLrycivjciciyy L. u. i» w y y u w w ww4»yi*Vii>yy&^* 


480 


gggaccctgg 


tcaccccaaa 


yy Gicy t ccd^ 


yeiyciay ucicy yyy t-yk^uu^v,* yyciyuyi.'Uyy 


540 


gtggacttcc 


gcgccctcac 




traaacaaca tccccaacrat Qcrccrcraaata 

^ v»y y R w c*ci wCfc ^ w v» w wy y y y i«» yy*»yyy3^**^ 


600 


ggggagaaga 


ccgcccttcg 


actcc tcaca 

t*W kf# W w w \^ ^ (A 


Qaatacrcracra crcatacia.aa.a. cctcctcraacr 

y^y^^yyyyy^ y ^y ^y^j**'-*^** ww^^wwwy***^^ 


660 


aacctggacc 


gggtaaagcc 


aaa.ctcoctic 

^ V* w w V^^K W W W 


caacacaaQa taaaoocaca cctcaaoQac 


720 


ctccacctct 


ccttagacct 


QCTCGCCfCatC 

y V* V* W « W \* 


CQcaccaacc tccccctOQa aatcraacttit 

Wy V W^4C»w w W WW W WW>^^^ ^^^^ W^*W \* W W 


780 


aaggccctgc 


gccgcaggac 


f f f fcia r» n i" Pf 
w^ci^^ uy 


aacicfcrf'ct'aa aacfccttfcfct crcracrcraacticf 
yciyyy wVi»uya yyywwk.i>uuu yy^yyayv* uy 


840 


gagttcggaa 


gcctcctcca 


^ys-y^^cyyc 


cccccgggag gggagaagcc ccgggaggay 




gccccctggc 


ccccgcccga 


aggggccttc 


gcgggcuccc cccccccccg caaggagccc 




atgtgggcgg 


agcttctggc 


cctggcggcg 


gcctcggagg gccgggtcca ccgggcEaca 


xuzu 


agcccggttg 


aggccctggc 


cgacctcaag 


gaggcccggg ggttcctggc caaggacctg 


1080 


gccgttttgg 


ccctgcggga 


gggggtggcc 


ctggacccca cggacgaccc cctcctggtg 


1140 


gcctacctcc 


tggacccggc 


caacacccac 


cccgaggggg tggcccggcg ctacgggggc 


1200 


gagttcacgg 


aggacgcagc 


ggagagggcc 


ctcctctccg agaggctctt ccagaacctc 


1260 


tttccccggc 


tttccgagaa 


gctcctctgg 


ctctaccagg aggtggagcg gcccctctcc 


1320 
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cgggtcttgg 


cccacatgga 


ggcccggggg gtgaggctgg 


acgtccccct tctggaggcc 


1380 


ctctcctttg 


agctggagaa 


ggagatggag cgcctggagg 


gggaggtctt ccgtttggcc 


1440 


ggccacccct 


tcaacctcaa 


ctcccgcgac cagctggaaa 


gggtcctctt tgacgagctg 


1500 


ggcctcaccc 


cggtgggccg 


gacggagaag acgggcaagc 


gctccaccgc ccagggggcc 


1560 


ctggaggccc 


tccggggggc 


ccaccccatc gtggagctca 


tcctccagta ccgggagctt 


1620 


tccaagctca 


aaagcaccta 


cctggacccc ctgccccggc 


tcgtccaccc gcggacgggc 


1680 


cggctccaca 


cccgcttcaa 


ccagacggcc acggccacgg 


gaaggctttc cagctccgac 


1740 


cccaacctgc 


agaacatccc 


cgtgcgcacc cccttggggc 


agcgcatccg caaggccttc 


1800 


gtggccgagg 


aggggtggct 


ccttttggcg gcggactact 


cccagattga gctccgggtc 


1860 


ctggcccacc 


tctcggggga 


cgagaacctg aagcgggtct 


tccgggaggg gaaggacatc 


1920 


cataccgaga 


ccgccgcctg 


gatgttcggc ttagaccccg 


ctctggtgga tccaaagatg 


1980 


cgccgggcgg 


ccaagacggt 


caacttcggc gtcctctacg 


ggatgtccgc ccacaggctc 


2040 


tcccaggagc 


tcggcataga 


ctacaaggag gcggaggcct 


ttattgagcg ctacttccag 


2100 


agcttcccca 


aggtgcgggc 


ctggatagaa aggaccctgg 


aggagggccg gacgcggggc 


2160 


tacgtggaga 


ccctgttcgg 


caggaggcgc tatgtgcccg 


acctggcctc ccgggtccgc 


2220 


^cggcgcggg 


aggcygcgga 


gcygauyyCC uuCaaCatgc 


ccgi^gco.ggg caccgcvgcc 


ou 


gacctgatga 


agatcgccat 


ggtcaagctc ttccccaggc 


taaagcccct gggggcccac 


2340 


ctcctcctcc 


aagtgcacga 


cgagctggtc ctggaggtgc 


ccgaggaccg ggccgaggag 


2400 


gccaaggccc 


tggtcaagga 


ggtcatggag aacgcctacc 


ccctggacgt gcccctcgag 


2460: 


gtggaggtgg 


gcgtgggtcg 


ggactggctg gaggcgaagc 


aggattga 


2508 



<210> 250 

<211> 835 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 250 

Met Glu Phe Thr Pro Leu Phe Asp Leu Glu Glu Pro Pro Lys Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Tyr Ala Leu 
20 25 30 
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ser Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Met Val Tyr Gly Phe 
35 40 45 

Ala Arg Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Gin Ala Val Val 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Glu 
65 70 75 80 

Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu Val Lys Arg Leu Val Asp Leu Leu Gly Leu Val Arg Leu 
100 105 110 

Glu Ala Pro Gly Tyr Glu Ala Asp Asp Val Leu Gly Thr Leu Ala Lys 
115 120 125 

Lys Ala Glu Arg Glu Gly Met Glu Val Arg lie Leu Thr Gly Asp Arg 
130 135 140 

Asp Phe Phe Gin Leu Leu Ser Glu Lys Val Ser Val Leu Leu Pro Asp 
145 150 155 160 

Gly thr Leu Val Thr Pro Lys Asp Val Gin Glu Lys Tyr Gly Val Pro 
165 170 175 

Pro Glu Arg Trp Val Asp Phe Arg Ala Leu Thr Gly Asp Arg Ser Asp 
180 185 190 

Asn lie Pro Gly Val Ala Gly lie Gly Glu Lys Thr Ala Leu Arg Leu 
195 200 205 

Leu Ala Glu Trp Gly Ser Val Glu Asn Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Val Lys Pro Asp Ser Leu Arg Arg Lys lie Glu Ala His Leu Glu Asp 
225 230 235 240 

Leu His Leu Ser Leu Asp Leu Ala Arg lie Arg Thr Asp Leu Pro Leu 

245 250 255 

Glu Val Asp Phe Lys Ala Leu Arg Arg Arg Thr Pro Asp Leu Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Glu Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 



Phe Gly Leu Leu Gly Gly Glu Lys Pro Arg Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Leu Leu Ser Arg Lys Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Leu Ala Leu Ala Ala Ala Ser Glu Gly Arg Val 
325 330 335 
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His Arg Ala Thr Ser Pro Val Glu Ala Leu Ala Asp Leu Lys Glu Ala 
340 345 350 

Arg Gly Phe Leu Ala Lys Asp Leu Ala Val Leu Ala Leu Arg Glu Qly 
355 360 365 

Val Ala Leu Asp Pro Thr Asp Asp Pro Leu Leu Val Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ala Asn Thr His Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Phe Thr Glu Asp Ala Ala Glu Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

Phe Gin Asn Leu Phe Pro Arg Leu Ser Glu Lys Leu Leu Trp Leu Tyr 
420 425 430 

Gin Glu Val Glu Arg Pro Leu Ser Arg Val Leu Ala His Met Glu Ala 
435 440 445 

Arg Gly Val Arg Leu Asp Val Pro Leu Leu Glu Ala Leu Ser Phe Glu 
450 455 460 

Leu Glu Lys Glu Met Glu Arg Leu Glu Gly Glu Val Phe Arg Leu Ala 
465 470 475 480 

Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu 
485 490 495 

Phe Asp Glu Leu Gly Leu Thr Pro Val Gly Arg Thr Glu Lys Thr Gly 
500 505 510 

Lys Arg Ser Thr Ala Gin Gly Ala Leu Glu Ala Leu Arg Gly Ala His 
515 520 525 

Pro lie Val Glu Leu lie Leu Gin Tyr Arg Glu Leu Ser Lys Leu Lys 
530 535 540 

Ser Thr Tyr Leu Asp Pro Leu Pro Arg Leu Val His Pro Arg Thr Gly 
545 550 555 560 

Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu 
565 570 575 

Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu 
580 585 590 

Gly Gin Arg lie Arg Lys Ala Phe Val Ala Glu Glu Gly Trp Leu Leu 
595 600 605 

Leu Ala Ala Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu 
610 615 620 



Ser Gly Asp Glu Asn Lgu Lys Arg Val Phe Arg Glu Gly Lys Asp lie 
625 630 635 640 
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His Thr Glu Thr Ala Ala Trp Met Phe Gly Leu Asp Pro Ala Leu Val 
645 650 655 

Asp Pro Lys Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu 
660 665 670 

Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Gly lie Asp Tyr 
675 680 685 

Lys Glu Ala Glu Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys 
690 695 700 

Val Arg Ala Trp lie Glu Arg Thr Leu Glu Glu Gly Arg Thr Arg Gly 
705 710 715 720 

Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Ala 
725 730 735 

Ser Arg Val Arg Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn 
740 745 750 

Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys He Ala Met Val 
755 760 765 

Lys Leu Phe Pro Arg Leu Lys Pro Leu Gly Ala His Leu Leu Leu Gin 
770 775 780 

Val His Asp Glu Leu Val Leu Glu Val Pro Glu Asp Arg Ala Glu Glu 
785 790 795 800 

Ala Lys Ala Leu Val Lys Glu Val Met Glu Asn Ala Tyr Pro Leu Asp 
805 810 815 

Val Pro Leu Glu Val Glu Val Gly Val Gly Arg Asp Trp Leu Glu Ala 
820 825 830 

Lys Gin Asp 
835 



<210> 


251 


<211> 


31 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


251 



actggaattc ctgcccctct ttgagcccaa g 31 



<210> 252 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 
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<223> Synthetic 
<400> 252 

aacagtcgac ctaggccttg gcggaaagcc 



30 



<210> 253 
<211> 2499 
<212> DNA 
<2X3> Artificial 

<220> 

<223> Synthetic 
<400> 253 

atggaattcc tgcccctctt tgagcccaag ggccgggtgc ttctggtgga cggccaccac 60 

ctggcctacc gtaccttttt tgccctgaag ggcctcacca .ccagccgcgg ggagccggtc 120 

caggcggtgt acgggtttgc caagagcctt ttgaaggcgc taagggaaga cggggatgtg 180 

gtgatcgtgg tgtttgacgc caaggccccc tccttccgcc accagaccta cgaggcctac 240 

aaggcggggc gggctcccac ccccgaggac tttccccggc agcttgccct tatcaaggag 300 

atggtggacc ttttgggcct ggagcgcctc gaggtgccgg gctttgaagc ggatgacgtc 360 

ctggctaccc tggccaagaa ggcggaaaag gaaggctacg aagtgcgcat cctcaccgcg 420 

gaccgggacc tttaccagct tctttcggag cgaatctcca tccttcaccc ggagggttac 480 

ctgatcaccc cggagtggct ttgggagaag tatgggctta agccttccca gtgggtggac 54 0 

taccgggcct tggccgggga cccttccgac aacatccccg gcgtgaaggg catcggggag 600 

aagacggcgg ccaagctgat ccgggagtgg ggaagcctgg aaaaccttct taagcacctg 660 

gaacaggtga aacctgcctc cgtgcgggag aagatcctta gccacatgga ggacctcaag 720 

ctatccctgg agctatcccg ggtgcacacg gacttgctcc ttcaggtgga cttcgcccgg 780 

cgccgggagc cggaccggga ggggcttaag gcctttttgg agaggctgga gttcggaagc 840 

ctcctccacg agttcggcct gttggaaagc ccggtggcgg cggaggaagc tccctggccg 900 

ccccccgagg gagccttcgt ggggtacgtt ctttcccgcc ccgagcccat gtgggcggag 960 

cttaacgcct tggccgccgc ctgggaggga agggtttacc gggcggagga tcccttggag 1020 

gccttgcggg ggcttgggga ggtgaggggg cttttggcca aggacctggc ggtgctggcc 1080 

ctgagggaag ggattgccct ggcaccgggc gacgacccca tgctcctcgc ctacctcctg 1140 

gatccttcca acaccgcccc cgaaggggta gcccggcgct acggggggga gtggaccgag 1200 

gaggcggggg aaagggcgct gctttccgaa aggctttacg ccgccctcct ggagcggctt 1260 
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aagggggagg agaggcttct ttggctttac gaggaggtgg aaaagcccct ttcgcgggtc 1320 

ctggcccaca tggaggccac gggggtacgg ttggatgtgg cctacttaaa ggccctttcc 1380 

ctggaggtgg aggcggagat aaggcgcttc gaggaggagg tccaccgcct ggccgggcat 1440 

cctttcaacc tgaactcccg ggaccagctg gaaagggtca tctttgacga gcttgggctt 1500 

cccgccatcg gcaagacgga gaagacgggc aagcgctcca ccagcgccgc cgttttggag 1560 

gccttgcggg aggctcatcc catcgtggac cgcatccttc agtaccggga gctttccaag 1620 

ctcaagggaa cctacatcga tcccttgcct gccctggtcc accccaagac gaaccgcctc 1680 

cacacccgtt tcaaccagac ggccaccgcc acggggaggc ttagcagctc ggatcctaat 174 0 

ctgcaaaata tccccgtgcg cacccctttg ggccagcgga tccgccgggc cttcgtggcc 1800 

gaggaggggt ggaggctggt ggttttggac tacagccaga ttgagctcag ggtcctggcg 1860 

cacctttccg gggacgagaa cctaatccgg gtcttccagg agggccagga catccacacc 1920 

cagacggcca gctggatgtt cggcgtgccc ccagaggccg tggattccct gatgcgccgg 1980 

gcggccaaga ccatcaactt cggcgtcctc tacggcatgt ccgcccaccg gctttcggga 2040 

gagctggcca tcccctacga ggaggcggtg gccttcatcg agcggtattt ccagagctac 2100 

cccaaggtgc gggcctggat tgagaaaacc ctggcggaag gacgggaacg gggctatgtg 2160 

gaaaccctct ttggccgccg gcgctacgtg cccgacttgg cttcccgggt gaagagcatc 2220 

cgggaggcag cggagcgcat ggccttcaac atgccggtcc aggggaccgc cgcggatttg 2280 

atgaaactgg ccatggtgaa gctctttccc aggcttcagg agctgggggc caggatgctt 2340 

ttgcaggtgc acgacgaact ggtcctcgag gctcccaagg agcaagcgga ggaagtcgcc 2400 

caggaggcca agcggaccat ggaggaggtg tggcccctga aggtgccctt ggaggtggaa 2460 

gtgggcatcg gggaggactg gctttccgcc aaggcctag 2499 



<210> 254 

<211> 832 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 254 

Met Glu Phe Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu Val 
15 10 15 

Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys Gly Leu 
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20 



25 



30 



Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala Lys 
35 40 45 

Ser Leu Leu Lys Ala Leu Arg Glu Asp Gly Asp Val Val lie Val Val 
50 55 60 

Phe Asp Ala Lys Ala Pro Ser Phe Arg His Gin Thr Tyr Glu Ala Tyr 
65 70 75 80 

Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu Ala 
85 90 95 

Leu lie Lys Glu Met Val Asp Leu Leu Gly Leu Glu Arg Leu Glu Val 
100 105 110 

Pro Gly Phe Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys Lys Ala 
115 120 125 

Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Arg Asp Leu 
130 135 140 

Tyr Gin Leu Leu Ser Glu Arg He Ser He Leu His Pro Glu Gly Tyr 
145 150 155 160 

Leu lie Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly Leu Lys Pro Ser 
165 170 175 

Gin Trp Val Asp Tyr Arg Ala Leu Ala Gly Asp Pro Ser Asp Asn He 
180 185 190 

Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Ala Lys Leu He Arg 
195 200 205 

Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys His Leu Glu Gin Val Lys 
210 215 220 

Pro Ala Ser Val Arg Glu Lys He Leu Ser His Met Glu Asp Leu Lys 
225 230 235 240 

Leu Ser Leu Glu Leu Ser Arg Val His Thr Asp Leu Leu Leu Gin Val 
245 250 255 

Asp Phe Ala Arg Arg Arg Glu Pro Asp Arg Glu Gly Leu Lys Ala Phe 
260 265 270 

Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu Leu 
275 280 285 

Glu Ser Pro Val Ala Ala Glu Glu Ala Pro Trp Pro Pro Pro Glu Gly 
290 295 300 

Ala Phe Val Gly Tyr Val Leu Ser Arg Pro Glu Pro Met Trp Ala Glu 
305 310 315 320 



Leu Asn Ala Leu Ala Ala Ala Trp Glu Gly Arg Val Tyr Arg Ala Glu 
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325 



330 



335 



Asp Pro Leu Glu Ala Leu Arg Gly Leu Gly Glu Val Arg Gly Leu Leu 
340 345 350 

Ala Lys Asp Leu Ala Val Leu Ala Leu Arg Glu Gly He Ala Leu Ala 
355 360 365 

Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser Asn 
370 375 380 

Thr Ala Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr Glu 
385 390 395 400 

Glu Ala Gly Glu Arg Ala Leu Leu Ser Glu Arg Leu Tyr Ala Ala Leu 
405 410 415 

Leu Glu Arg Leu Lys Gly Glu Glu Arg Leu Leu Trp Leu Tyr Glu Glu 
420 425 430 

Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr Gly 
435 440 445 

Val Arg Leu Asp Val Ala Tyr Leu Lys Ala Leu Ser Leu Glu Val Glu 
450 455 460 

Ala Glu He Arg Arg Phe Glu Glu Glu Val His Arg Leu Ala Gly His 
465 470 475 480 

Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val He Phe Asp 
485 490 495 

Glu Leu Gly Leu Pro Ala He Gly Lys Thr Glu Lys Thr Gly Lys Arg 
500 505 510 

Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro He 
515 520 525 

Val Asp Arg He Leu Gin Tyr Arg Glu Leu Ser Lys Leu Lys Gly Thr 
530 535 540 

Tyr He Asp Pro Leu Pro Ala Leu Val His Pro Lys Thr Asn Arg Leu 
545 550 555 560 

His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser 
565 570 575 

Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly Gin 
580 585 590 

Arg He Arg Arg Ala Phe Val Ala Glu Glu Gly Trp Arg Leu Val Val 
595 600 605 

Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser Gly 
610 615 620 

Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Gin Asp He His Thr 
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625 630 635 640 

Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp Ser 

645 650 655 

Leu Met Arg Arg Ala Ala Lys Thr lie Asn Phe Gly Val Leu Tyr Gly 
660 665 670 

Met Ser Ala His Arg Leu Ser Gly Glu Leu Ala lie Pro Tyr Glu Glu 
675 680 685 

Ala Val Ala Phe lie Glu Arg Tyr Phe Gin Ser Tyr Pro Lys Val Arg 
690 695 700 

Ala Trp He Glu Lys Thr Leu Ala Glu Gly Arg Glu Arg Gly Tyr Val 
705 710 715 720 

Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Ala Ser Arg 
725 730 735 

Val Lys Ser He Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met Pro 
740 745 750 

Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys Leu 
755 760 765 

Phe Pro Arg Leu Gin Glu Leu Gly Ala Arg Met Leu Leu Gin Val His 
770 775 780 

Asp Glu Leu Val Leu Glu Ala Pro Lys Glu Gin Ala Glu Glu Val Ala 
785 790 795 800 

Gin Glu Ala Lys Arg Thr Met Glu Glu Val Trp Pro Leu Lys Val Pro 
805 810 815 

Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys Ala 
820 825 830 

<210> 255 
<211> 20 
<212> DNA 
<213> Artificial 

<220> 

<223> Synthetic 

<400> 255 

cgatctcctc ggccacctcc 20 



<210> 256 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
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<400> 256 

ggcggtgccc tggacgggca 



<210> 257 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 



20 



<400> 257 

ccagctcgtt gtggacctga 20 



<210> 258 

<211> 2505 

<212> DNA 

<213> Thermus aquaticus 

<400> 258 



atgaattcgg 


ggatgctgcc 


cctctttgag 


I N j-j, -L--i-i_r-rii--nL-ra J ■ j uJ ■ j ■ -^t jni ji j~l j~i i~ ^T^V ^ A ^ 

cccaagggcc gggcccccct ggcggacggc 


o w 


caccacctgg 


cctaccgcac 


cttccacgcc 


ctgaagggcc tcaccaccag ccggggggag 


120 


ccggtgcagg 


cggtctacgg 


cttcgccaag 


agcctcctca aggccctcaa ggaggacggg 


180 


gacgcggtga 


tcgtggtctt 


tgacgccaag 


gccccctcct tccgccacga ggcctacggg 


240 


gggtacaagg 


cgggccgggc 


ccccacgccg 


gaggactttc cccggcaact cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggctggcg 


cgcctcgagg tcccgggcta cgaggcggac 


360 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg gctacgaggt ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca tccacgtcct ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg gcctgaggcc cgaccagtgg 


540 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc ttcccggggt caagggcatc 


600 


ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga gcctggaagc cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc tggcccacat ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc ccctggaggt ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc tggagaggct tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg ccctggagga ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc gcaaggagcc catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggg 


ggccgggtcc accgggcccc cgagccttat 


1020 
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aaagccctca 


gggacctgaa 


ggaggcgcgg 


gggcttctcg 


ccaaagacct gagcgttctg 


1080 


gccctgaggg 


aaggccttgg 


cctcccgccc 


ggcgacgacc 


ccatgctcct cgcctacctc 


1140 


ctggaccctt 


ccaacaccac 


ccccgagggg 


gtggcccggc 


gctacggcgg ggagtggacg 


1200 


gaggaggcgg 


gggagcgggc 


cgccctttcc 


gagaggctct 


tcgccaacct gtgggggagg 


1260 


cttgaggggg 


aggagaggct 


cctttggctt 


taccgggagg 


tggagaggcc cctttccgct 


1320 


gtcctggccc 


acatggaggc 


cacgggggtg 


cgcctggacg 


tggcctatct cagggccttg 


1380 


tccctggagg 


tggccgagga 


gatcgcccgc 


ctcgaggccg 


aggtcttccg cctggccggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tcctctttga cgagctaggg 


1500 


cttcccgcca 


tcggcaagac 


ggagaagacc 


ggcaagcgct 


ccaccagcgc cgccgtcctg 


1560 


gaggccctcc 


gcgaggccca 


ccccatcgtg 


gagaagatcc 


tgcagtaccg ggagctcacc 


1620 


aagctgaaga 


gcacctacat 


tgaccccttg 


ccggacctca 


tccaccccag gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggca 


ggctaagtag ctccgatccc 


1740 


aacctccaga 


acatccccgt 


ccgcaccccg 


cttgggcaga 


ggatccgccg ggccttcatc 


1800 


gccgaggagg 


ggtggctatt 


ggtggccctg 


gactatagcc 


agatagagct cagggtgctg 


1860 


gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg ggacatccac 


1920 


acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc cctgatgcgc 


1980 


cgggcggcca 


agaccatcaa 


cttcggggtc 


ctctacggca 


tgtcggccca ccgcctctcc 


2040 


caggagctag 


ccatccctta 


cgaggaggcc 


caggccttca 


ttgagcgcta ctttcagagc 


2100 


ttccccaagg 


tgcgggcctg 


gattgagaag 


accctggagg 


agggcaggag gcgggggtac 


2160 


gtggagaccc 


tcttcggccg 


ccgccgctac 


gtgccagacc 


tagaggcccg ggtgaagagc 


2220 




^aa^^a^a^a 


^ nt^/^ t" f~ 

wAuaa^^^ 


aaCa ua^^^a 


uwcdaaa^^^ wa^^a^^a^^ 


^ A O V 


ctcatgaagc 


tggctatggt 


gaagctcttc 


cccaggctgg 


aggaaatggg ggccaggatg 


2340 


ctccttcagg 


tccacaacga 


gctggtcctc 


gaggccccaa 


aagagagggc ggaggccgtg 


2400 


gcccggctgg 


ccaaggaggt 


catggagggg 


gtgtatcccc 


tggccgtgcc cctggaggtg 


2460 


gaggtgggga 


taggggagga 


ctggctctcc 


gccaaggagt 


gatag 


2505 



<210> 259 

<211> 833 

<212> PRT 

<213> Thermus aquaticus 

<400> 259 
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Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 
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Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Trp Gly Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala lie Gly Lys Thr Glu Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys lie Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr lie Asp Pro Leu Pro Asp Leu lie His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 

565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly 
580 565 590 

Gin Arg lie Arg Arg Ala Phe lie Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 
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Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 



Glu 



<210> 260 

<211> 26 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 260 

caggaggagc tcgttgtgga cctgga 
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<210> 


261 


<211> 


836 


<212> 


PRT 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


261 


Met Asn Ser Glu Ala 


1 


5 


Leu Leu Val Asp Gly 




20 



10 15 



25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lye Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
65 90 95 

Arg Gin Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu lie Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp. Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
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245 



250 255 



Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Qlu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Asp Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Qlu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin TQa Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
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545 



550 



555 



550 



Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg lie Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 

625 630 635 640 

Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 . 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 

740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val His Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly 
835 



<210> 262 
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<211> 2511 

<212> DNA 

<:213> Artificial 

<220> 

<223> Synthetic 
<400> 262 

atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 60 

ggccaccafcc tggcctaccg caccttcttc gccctgaagg gcctcaccac gagccggggc 120 

gaaccggtgc aggcggtcta cggcttcgcc aagagcctcc tcaaggccct gaaggaggac 180 

gggtacaagg ccgtcttcgt ggtctttgac gccaaggccc cctccttccg ccacgaggcc 240 

tacgaggcct acaaggcggg gagggccccg acccccgagg acttcccccg gcagctcgcc 300 

ctcatcaagg agctggtgga cctcctgggg tttacccgcc tcgaggtccc cggctacgag 360 

gcggacgacg ttctcgccac cctggccaag aaggcggaaa aggaggggta cgaggtgcgc 420 

atcctcaccg ccgaccgcga cctctaccaa ctcgtctccg accgcgtcgc cgtcctccac 480 

cccgagggcc acctcatcac cccggagtgg ctttgggaga agtacggcct caggccggag 540 

cagtgggtgg acttccgcgc cctcgtgggg gacccctccg acaacctccc cggggtcaag 600 

ggcatcgggg agaagaccgc cctcaagctc ctcaaggagt ggggaagcct ggaaaacctc 660 

ctcaagaacc tggaccgggt aaagccagaa aacgtccggg agaagatcaa ggcccacctg 720 

gaagacctca ggctctcctt ggagctctcc cgggtgcgca ccgacctccc cctggaggtg 780 

gacctcgccc aggggcggga gcccgaccgg gaggggctta gggccttcct ggagaggctg 840 

gagttcggca gcctcctcca cgagttcggc ctcctggagg cccccgcccc cctggaggag 900 

gccccctggc ccccgccgga aggggccttc gtgggcttcg tcctctcccg ccccgagccc 960 

atgtgggcgg agcttaaagc cctggccgcc tgcagggacg gccgggtgca ccgggcagca 1020 

gaccccttgg cggggctaaa ggacctcaag gaggtccggg gcctcctcgc caaggacctc 1080 

gccgtcttgg cctcgaggga ggggctagac ctcgtgcccg gggacgaccc catgctcctc 1140 

gcctacctcc tggacccctc caacaccacc cccgaggggg tggcgcggcg ctacgggggg 1200 

gagtggacgg aggacgccgc ccaccgggcc ctcctctcgg agaggctcca tcggaacctc 1260 

cttaagcgcc tcgaggggga ggagaagctc ctttggctct accacgaggt ggaaaagccc 1320 

ctctcccggg tcctggccca catggaggcc accggggtac ggcgggacgt ggcctacctt 1380 

caggcccttt ccctggagct tgcggaggag atccgccgcc tcgaggagga ggtcttccgc 1440 

ttggcgggcc accccttcaa cctcaactcc cgggaccagc tggaaagggt gctctttgac 1500 
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gagcttaggc 


ttcccgcctt 


9999^39^cg caaaagacag 


gcaagcgctc 


caccagcgcc 


1560 


gcggtgctgg 


aggccctacg 


ggaggcccac cccatcgtgg 


agaagatcct 


ccagcaccgg 


1620 


gagctcacca 


agctcaagaa 


cacctacgtg gaccccctcc 


caagcctcgt 


ccacccgagg 


* 1680 


acgggccgcc 


tccacacccg 


cttcaaccag acggccacgg 


ccacggggag 


gcttagtagc 


1740 


tccgacccca 


acctgcagaa 


catccccgtc cgcaccccct 


tgggccagag 


gatccgccgg 


1800 


gccttcgtgg 


ccgaggcggg 


ttgggcgttg gtggccctgg 


actatagcca 


gatagagctc 


1860 


cgcgtcctcg 


cccacctctc 


cggggacgaa aacctgatca 


gggtcttcca 


ggaggggaag 


1920 


gacatccaca 


cccagaccgc 


aagctggatg ttcggcgtcc 


ccccggaggc 


cgtggacccc 


1980 


ctgatgcgcc 


gggcggccaa 


gacggtgaac ttcggcgtcc 


tctacggcat 


gtccgcccat 


2040 


aggctctccc 


aggagcttgc 


catcccctac gaggaggcgg 


tggcctttat 


agagcgctac 


2100 


ttccaaagct 


tccccaaggt 


gcgggcctgg atagaaaaga 


ccctggagga 


ggggaggaag 


2160 


cggggctacg 


tggaaaccct 


cttcggaaga aggcgctacg 


tgcccgacct 


caacgcccgg 


2220 


gtgaagagcg 


tcagggaggc 


cgcggagcgc atggccttca 


acatgcccgt 


ccagggcacc 


2280 


gccgccgacc 


tcatgaagct 


cgccatggtg aagctcttcc 


cccgcctccg 


ggagatgggg 


2340 


gcccgcatgc 


tcctccaggt 


ccacaacgag ctcctcctgg 


aggcccccca 


agcgcgggcc 


2400 


gaggaggtgg 


cggctttggc 


caaggaggcc atggagaagg 


cctatcccct 


cgccgtgccc 


2460 


ctggaggtgg 


aggtggggat 


gggggaggac tggctttccg 


ccaagggtta 


g 


2511 



<210> 263 

<211> 50 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 263 

tctagaggat ctatcagtgg tggtggtggt ggtgctcctt ggcggagagc 50 

<210> 264 

<211> 58 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 264 
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tgcctgcagg tcgacgctag ctagtggtgg tggtggtggt gacccttggc ggaaagcc 58 

<210> 265 
<211> 2517 
<212> DNA 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 265 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 

caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 120 

ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 

gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 240 

gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 300 

aaggagctgg tggacctcct ggggctggcg cgcctcgagg tcccgggcta cgaggcggac 360 

gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 420 

accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 480 

gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 54 0 

gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 600 

ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 660 

aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 720 

aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 780 

aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 840 

agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 900 

cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 960 

gatcttctgg ccctggccgc cgccaggggg ggccgggtcc accgggcccc cgagccttat 1020 

aaagccctca gggacctgaa ggaggcgcgg gggcttctcg ccaaagacct gagcgttctg 1080 

gccctgaggg aaggccttgg cctcccgccc ggcgacgacc ccatgctcct cgcctacctc 1140 

ctggaccctt ccaacaccac ccccgagggg gtggcccggc gctacggcgg ggagtggacg 1200 

gaggaggcgg gggagcgggc cgccctttcc gagaggctct tcgccaacct gtgggggagg 1260 

cttgaggggg aggagaggct cctttggctt taccgggagg tggagaggcc cctttccgct 1320 

gtcctggccc acatggaggc cacgggggtg cgcctggacg tggcctatct cagggccttg 1380 
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tccctggagg 


tggccgagga 


gatcgcccgc 


ctcgaggccg 


aggtcttccg 


cctggccggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tcctctttga 


cgagctaggg 


1500 


cttcccgcca 


tcggcaagac 


ggagaagacc 


ggcaagcgct 


ccaccagcgc 


cgccgtcctg 


1560 


gaggccctcc 


gcgaggccca 


ccccatcgtg 


gagaagatcc 


tgcagtaccg 


ggagctcacc 


1620 


aagctgaaga 


gcacctacat 


tgaccccttg 


ccggacctca 


tccaccccag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggca 


ggctaagtag 


ctccgatccc 


1740 


aacctccaga 


acatccccgt 


ccgcaccccg 


cttgggcaga 


ggatccgccg 


ggccttcatc 


1800 


gccgaggagg 


ggtggctatt 


gSftggccctg 


gactatagcc 


agatagagct 


cagggtgctg 


1860 


gcccacctct 


ccggcgacga 


gaacctgatc 


C99gtcttcc 


aggaggggcg 


ggacatccac 


1920 


acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agaccatcaa 


cttcggggtc 


ctctacggca 


tgtcggccca 


ccgcctctcc 


2040 


caggagctag 


ccatccctta 


cgaggaggcc 


caggccttca 


ttgagcgcta 


ctttcagagc 


2100 


ttccccaagg 


tgcgggcctg 


gattgagaag 


accctggagg 


agggcaggag 


gcgggggtac 


2160 


gtggagaccc 


tcttcggccg 


ccgccgctac 


gtgccagacc 


tagaggcccg 


ggtgaagagc 


2220 


gtgcgggagg 


cggccgagcg 


catggccttc 


aacatgcccg 


tccagggcac 


cgccgccgac 


2280 


ctcatgaagc 


tggctatggt 


gaagctcttc 


cccaggctgg 


aggaaatggg 


ggccaggatg 


2340 


ctccttcagg 


tccacaacga 


gctggtcctc 


gaggccccaa 


aagagagggc 


ggaggccgtg 


2400 


gcccggctgg 


ccaaggaggt 


catggagggg 


gtgtatcccc 


tggccgtgcc 


cctggaggtg 


2460 


gaggtgggga 


taggggagga 


ctggctctcc 


gccaaggagc 


accaccacca 


ccaccac 


2517 



<210> 266 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 266 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
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35 



40 



45 



Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
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340 



345 



350 



Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Trp Gly Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala lie Gly Lys Thr Glu Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys lie Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr lie Asp Pro Leu Pro Asp Leu lie His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 



Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
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645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr lie Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 BOO 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 

<210> 267 

<211> 2526 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 267 

atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 60 
ggccaccacc tggcctaccg caccttcttc gccctgaagg gcctcaccac gagccggggc 120 
gaaccggtgc aggcggtcta cggcttcgcc aagagcctcc tcaaggccct gaaggaggac 180 
gggtacaagg ccgtcttcgt ggtctttgac gccaaggccc cctccttccg ccacgaggcc 240 
tacgaggcct acaaggcggg gagggccccg acccccgagg acttcccccg gcagctcgcc 300 
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ctcatcaagg 


agctggtgga 


cctcctgggg 


tttacccgcc 


tcgaggtccc 


cggctacgag 


360 


gcggacgacg 


ttctcgccac 


cctggccaag 


aaggcggaaa 


aggaggggta 


cgaggtgcgc 


420 


atcctcaccg 


ccgaccgcga 


cctctaccaa 


ctcgtctccg 


accgcgtcgc 


cgtcctccac 


480 


cccgagggcc 


acctcatcac 


cccggagtgg 


ctttgggaga 


agtacggcct 


caggccggag 


540 


cagtgggtgg 


acttccgcgc 


cctcgtgggg 


gacccctccg 


acaacctccc 


cggggtcaag 


600 


ggcatcgggg 


agaagaccgc 


cctcaagctc 


ctcaaggagt 


ggggaagcct 


ggaaaacctc 


660 


ctcaagaacc 


tggaccgggt 


aaagccagaa 


aacgtccggg 


agaagatcaa 


ggcccacctg 


720 


gaagacctca 


ggctctcctt 


ggagctctcc 


cgggtgcgca 


ccgacctccc 


cctggaggtg 


780 


gacctcgccc 


aggggcggga 


gcccgaccgg 


gaggggctta 


gggccttcct 


ggagaggctg 


840 


gagttcggca 


gcctcctcca 


cgagttcggc 


ctcctggagg 


cccccgcccc 


cctggaggag 


900 


gccccctggc 


ccccgccgga 


aggggccttc 


gtgggcttcg 


tcctctcccg 


ccccgagccc 


960 


atgtgggcgg 


agcttaaagc 


cctggccgcc 


tgcagggacg 


gccgggtgca 


ccgggcagca 


1020 


gaccccttgg 


cggggctaaa 


ggacctcaag 


gaggtccggg 


gcctcctcgc 


caaggacctc 


1080 


gccgtcttgg 


cctcgaggga 


ggggctagac 


ctcgtgcccg 


gggacgaccc 


catgctcctc 


1140 


gcctacctcc 


tggacccctc 


caacaccacc 


cccgaggggg 


tggcgcggcg 


ctacgggggg 


1200 


gagtggacgg 


aggacgccgc 


ccaccgggcc 


ctcctctcgg 


agaggctcca 


tcggaacctc 


1260 


cttaagcgcc 


tcgaggggga 


ggagaagctc 


ctttggctct 


accacgaggt 


ggaaaagccc 


1320 


ctctcccggg 


tcctggccca 


catggaggcc 


accggggtac 


ggcgggacgt 


ggcctacctt 


1380 


caggcccttt 


ccctggagct 


tgcggaggag 


atccgccgcc 


tcgaggagga 


ggtcttccgc 


1440 


ttggcgggcc 


accccttcaa 


cctcaactcc 


cgggaccagc 


tggaaagggt 


gctctttgac 


1500 


gagcttaggc 


ttcccgcctt 


ggggaagacg 


caaaagacag 


gcaagcgctc 


caccagcgcc 


1560 


gcggtgctgg 


aggccctacg 


ggaggcccac 


cccatcgtgg 


agaagatcct 


ccagcaccgg 


1620 


gagctcacca 


agctcaagaa 


cacctacgtg 


gaccccctcc 


caagcctcgt 


ccacccgagg 


1680 


acgggccgcc 


tccacacccg 


cttcaaccag 


acggccacgg 


ccacggggag 


gcttagtagc 


1740 








6iW w w V« \^ 


ticrciGrcca.aa.cf 


L»Vrf 


1800 


gccttcgtgg 


ccgaggcggg 


ttgggcgttg 


gtggccctgg 


actatagcca 


gatagagctc 


1860 


cgcgtcctcg 


cccacctctc 


cggggacgaa 


aacctgatca 


gggtcttcca 


ggaggggaag 


1920 


gacatccaca 


cccagaccgc 


aagctggatg 


ttcggcgtcc 


ccccggaggc 


cgtggacccc 


1980 


ctgatgcgcc 


gggcggccaa 


gacggtgaac 


ttcggcgtcc 


tctacggcat 


gtccgcccat 


2040 
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2526 



aggctctccc aggagcttgc catcccctac gaggaggcgg tggcctttat agagcgctac 2100 

ttccaaagct tccccaaggt gcgggcctgg atagaaaaga ccctggagga ggggaggaag 2160 

cggggctacg tggaaaccct cttcggaaga aggcgctacg tgcccgacct caacgcccgg 2220 

gtgaagagcg tcagggaggc cgcggagcgc atggccttca acatgcccgt ccagggcacc 2280 

gccgccgacc tcatgaagct cgccatggtg aagctcttcc cccgcctccg ggagatgggg 2340 

gcccgcatgc tcctccaggt ccacaacgag ctcctcctgg aggcccccca agcgcgggcc 2400 

gaggaggtgg cggctttggc caaggaggcc atggagaagg cctatcccct cgccgtgccc 2460 

ctggaggtgg aggtggggat gggggaggac tggctttccg ccaagggtca ccaccaccac 2520 
caccac 



<210> 268 

<211> 842 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 268 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala 
130 135 140 
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Asp Arg Asp Leu Tyx Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu lie Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys lie Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Asp Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 

420 425 430 



Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 
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Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro lie Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg. Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 
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Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val His Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly His His His His His His 
835 840 



<210> 


269 


<211> 


30 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


269 



gccgccaggg gcggccgcgt ccaccgggcc 30 



<210> 270 

<211> 31 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 270 

gcctgcaggg gcggccgcgt gcaccggggc a 31 



<210> 


271 


<211> 


26 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


271 



ctcctggacc cttcgaacac cacccc 26 



<210> 272 
<211> 23 
<212> DNA 
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<213> Artificial 
<220> 

<223> Synthetic 
<400> 272 

gtcctggccc atatggaggc cac 



<210> 273 

<211> 2526 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 



23 



<400> 273 

atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 
ggccaccacc tggcctaccg caccttcttc gccctgaagg gcctcaccac gagccggggc 
gaaccggtgc aggcggtcta cggcttcgcc aagagcctcc tcaaggccct gaaggaggac 
gggtacaagg ccgtcttcgt ggtctttgac gccaaggccc cctccttccg ccacgaggcc 
tacgaggcct acaaggcggg gagggccccg acccccgagg acttcccccg gcagctcgcc 
ctcatcaagg agctggtgga cctcctgggg tttacccgcc tcgaggtccc cggctacgag 
gcggacgacg ttctcgccac cctggccaag aaggcggaaa aggaggggta cgaggtgcgc 
atcctcaccg ccgaccgcga cctctaccaa ctcgtctccg accgcgtcgc cgtcctccac 
cccgagggcc acctcatcac cccggagtgg ctttgggaga agtacggcct caggccggag 
cagtgggtgg acttccgcgc cctcgtgggg gacccctccg acaacctccc cggggtcaag 
ggcatcgggg agaagaccgc cctcaagctc ctcaaggagt ggggaagcct ggaaaacctc 
ctcaagaacc tggaccgggt aaagccagaa aacgtccggg agaagatcaa ggcccacctg 
gaagacctca ggctctcctt ggagctctcc cgggtgcgca ccgacctccc cctggaggtg 
gacctcgccc aggggcggga gcccgaccgg gaggggctta gggccttcct ggagaggctg 
gagttcggca gcctcctcca cgagttcggc ctcctggagg cccccgcccc cctggaggag 
gccccctggc ccccgccgga aggggccttc gtgggcttcg tcctctcccg ccccgagccc 
atgtgggcgg agcttaaagc cctggccgcc tgcaggggcg gccgcgtgca ccgggcagca 1020 
gaccccttgg cggggctaaa ggacctcaag gaggtccggg gcctcctcgc caaggacctc 1080 
gccgtcttgg cctcgaggga ggggctagac ctcgtgcccg gggacgaccc catgctcctc 1140 
gcctacctcc tggacccttc gaacaccacc cccgaggggg tggcgcggcg ctacgggggg 1200 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
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gagtggacgg aggacgccgc ccaccgggcc ctcctctcgg agaggctcca tcggaacctc 1260 

cttaagcgcc tcgaggggga ggagaagctc ctttggctct accacgaggt ggaaaagccc 1320 

ctctcccggg tcctggccca tatggaggcc accggggtac ggcgggacgt ggcctacctt 1380 

caggcccttt ccctggagct tgcggaggag atccgccgcc tcgaggagga ggtcttccgc 1440 

ttggcgggcc accccttcaa cctcaactcc cgggaccagc tggaaagggt gctctttgac 1500 

gagcttaggc ttcccgcctt ggggaagacg caaaagacag gcaagcgctc caccagcgcc 1560 

gcggtgctgg aggccctacg ggaggcccac cccatcgtgg agaagatcct ccagcaccgg 1620 

gagctcacca agctcaagaa cacctacgtg gaccccctcc caagcctcgt ccacccgagg 1680 

acgggccgcc tccacacccg cttcaaccag acggccacgg ccacggggag gcttagtagc 1740 

tccgacccca acctgcagaa catccccgtc cgcaccccct tgggccagag gatccgccgg 1800 

gccttcgtgg ccgaggcggg ttgggcgttg gtggccctgg actatagcca gatagagctc * 1860 

cgcgtcctcg cccacctctc cggggacgaa aacctgatca gggtcttcca ggaggggaag 1920 

gacatccaca cccagaccgc aagctggatg ttcggcgtcc ccccggaggc cgtggacccc 1980 

ctgatgcgcc gggcggccaa gacggtgaac ttcggcgtcc tctacggcat gtccgcccat 2040 

aggctctccc aggagcttgc catcccctac gaggaggcgg tggcctttat agagcgctac 2100 

ttccaaagct tccccaaggt gcgggcctgg atagaaaaga ccctggagga ggggaggaag 2160 

cggggctacg tggaaaccct cttcggaaga aggcgctacg tgcccgacct caacgcccgg 2220 

gtgaagagcg tcagggaggc cgcggagcgc atggccttca acatgcccgt ccagggcacc 2260 

gccgccgacc tcatgaagct cgccatggtg aagctcttcc cccgcctccg ggagatgggg 2340 

gcccgcatgc tcctccaggt ccacaacgag ctcctcctgg aggcccccca agcgcgggcc 2400 

gaggaggtgg cggctttggc caaggaggcc atggagaagg cctatcccct cgccgtgccc 2460 

ctggaggtgg aggtggggat gggggaggac tggctttccg ccaagggtca ccaccaccac 2520 

caccac 2526 

<210> 274 
<211> 842 
<212> PRT 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 274 
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Met Asn Ser Glu Ala Met Leu Pro Leu Phe Qlu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
lis 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arq Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 
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Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Tip Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 

565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr 
580 585 590 



Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 
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Ala lieu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 

740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val His Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly His His His His His His 
835 840 



<210> 275 

<211> 2517 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 
<400> 275 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 
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caccacctgg 


cctaccgcac 


cttccacgcc 


ctgaagggcc 


tcaccaccag 


ccggggggag 


120 


ccggtgcagg 


cggtctacgg 


cttcgccaag 


agcctcctca 


aggccctcaa 


ggaggacggg 


180 


gacgcggtga 


tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga 


ggcctacggg 


240 


gggtacaagg 


cgggccgggc 


ccccacgccg 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggctggcg 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtcc 


accgggcccc 


cgagccttat 


1020 


aaagccctca 


gggacctgaa 


ggaggcgcgg 


gggcttctcg 


ccaaagacct 


gagcgttctg 


1080 


gccctgaggg 


aaggccttgg 


cctcccgccc 


ggcgacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcccggc 


gctacggcgg 


ggagtggacg 


1200 


gaggaggcgg 


gggagcgggc 


cgccctttcc 


gagaggctct 


tcgccaacct 


gtgggggagg 


1260 


cttgaggggg 


aggagaggct 


cctttggctt 


taccgggagg 


tggagaggcc 


cctttccgct 


1320 


gtcctggccc 


atatggaggc 


cacgggggtg 


cgcctggacg 


tggcctatct 


cagggccttg 


1380 


tccctggagg 


tggccgagga 


gatcgcccgc 


ctcgaggccg 


aggtcttccg 


cctggccggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tcctctttga 


cgagctaggg 


1500 










^ ^ a ^ a r% r^r* 
\» Vjel v« ^ ^ O 


eg ccg uCCuy 




gaggccctcc 


gcgaggccca 


ccccatcgtg 


gagaagatcc 


tgcagtaccg 


ggagctcacc 


1620 


aagctgaaga 


gcacctacat 


tgaccccttg 


ccggacctca 


tccaccccag 


gacgggccgc 


1680 


ctccacaccG 


gcttcaacca 


gacggccacg 


gccacgggca 


ggctaagtag 


ctccgatccc 


1740 


aacctccaga 


acatccccgt 


ccgcaccccg 


cttgggcaga 


ggatccgccg 


ggccttcatc 


1800 
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1860 
1920 
1980 
2040 



gccgaggagg ggtggctatt ggtggccctg gactatagcc agatagagct cagggtgctg 
gcccacctct ccggcgacga gaacctgatc cgggtcttcc aggaggggcg ggacatccac 
acggagaccg ccagctggat gttcggcgtc ccccgggagg ccgtggaccc cctgatgcgc 
cgggcggcca agaccatcaa cttcggggtc ctctacggca tgtcggccca ccgcctctcc 

caggagctag ccatccctta cgaggaggcc caggccttca ttgagcgcta ctttcagagc 2100 

ttccccaagg tgcgggcctg gattgagaag accctggagg agggcaggag gcgggggtac 2160 

gtggagaccc tcttcggccg ccgccgctac gtgccagacc tagaggcccg ggtgaagagc 2220 

gtgcgggagg cggccgagcg catggccttc aacatgcccg tccagggcac cgccgccgac 2280 

ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 2340 

ctccttcagg tccacaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 2400 

gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 2460 

gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 2517 

<210> 276 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 276 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 
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Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 

340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 



Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 
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Leu Trp Qly Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala He Qly Lys Thr Glu Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 
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Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe • Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly lie Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 



<210> 


277 


<211> 


27 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


277 



gaggaggcgg ggcaccgggc cgccctt 27 



<210> 


278 


<211> 


36 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


278 



ctttccgaga ggctccatcg gaacctgtgg gggagg 36 



<210> 279 

<211> 36 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 279 
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ctcttcgcca acctgcttaa gaggcttgag ggggag 36 



<210> 280 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 280 

aggccccttt cccgggtcct ggcccat 27 



<210> 281 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 281 

acgggggtgc gccgggacgt ggcctat 27 



<210> 282 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 282 

gtggcctatc tccaggcctt gtccctg 27 



<210> 283 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 283 

ttgtccctgg agcttgccga ggagatc 27 



<210> 284 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 
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<223> Synthetic 

<400> 284 

gccgaggaga tccgccgcct cgaggcc 27 



<210> 285 

<211> 27 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 



<400> 285 

gcccgcctcg aggaggaggt cttccgc 27 



<210> 286 

<211> 27 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 
<400> 286 

tttgacgagc taaggcttcc cgccatc 27 



<210> 287 

<211> 27 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 
<400> 287 

atcgccaaga cgcaaaagac cggcaag 27 



<210> 288 

<211> 27 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 
<400> 288 

aagatcctgc agcaccggga gctcacc 27 



<210> 289 
<211> 27 
<212> DNA 
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<213> Artificial 
<220> 

<223> Synthetic 
<400> 289 

accaagctga agaacaccta cattgac 



<210> 290 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 290 

aagagcacct acgtggaccc cttgccg 



<210> 291 

<211> 39 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 291 

attgacccct tgccgagcct cgtccacccc aggacgggc 39 



<210> 292 

<211> 33 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 292 

ttcgccaacc tgcttgggag gcttgagggg gag 33 



<210> 293 

<211> 33 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 293 

ctcttcgcca acctgtggaa gaggcttgag ggg 33 
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<210> 294 

<211> 33 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 294 

gcttgcggtc tgggtggcga tgtccttccc etc 



<210> 295 

<211> 33 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 295 

catgttgaag gccatggcct ccgcggcctc cct 



<210> 296 

<211> 30 

<212> DNA 

<213> Artificial 



33 



33 



<220> 

<223> Synthetic 

<400> 296 

caggaggagc tcgttggcga cctggaggag 30 

<210> 297 

<211> 45 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 297 

ggagcgcttg cctgtcttct tcgtcttctt caaggcggga ggcct 45 

<210> 298 

<211> 33 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 298 
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gaggaccagc tcgttggcga cctgaaggag cat 



<210> 299 

<211> 33 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 299 

gaggggcggg acatcgccac ggagaccgcc age 



<210> 300 

<211> 33 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 300 

cagaacatcc ccgtcgccac cccgcttggg cag 



<210> 301 

<211> 33 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 301 

gggcttcccg ccatcaagaa gacggagaag acc 



<210> 302 

<211> 39 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 302 

ctagggcttc ccgccatcaa gaagacgcaa aagaccggc 



<210> 303 

<211> 39 

<212> DNA 

<213> Artificial 

<220> 
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<223> Synthetic 
<400> 303 

ccggggaaag tcctcctccg tctcggcccg gcccgcctt 39 



<210> 304 

<2X1> 36 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 304 

cgggacctcg aggcgcgtga accccaggag gtccac 36 

<210> 305 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 305 

ctcctccacg agttcggc 18 



<210> 306 

<211> 48 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 306 

accggtcttc ttcgtcttct tcaacttggg aagcctgagc tcgtcaaa 48 



<210> 307 

<211> 33 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 307 

aagacgaaga agaccggtaa gcgctccacc age 

<210> 308 

<211> 52 

<212> DNA 



33 
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<213> Artificial 
<220> 

<223> Synthetic 

<400> 308 ^ 
gtcgactcta gatcagtggt ggtggtggtg gtgcttggcc gcccggcgca tc 



<210> 309 

<211> 60 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<220> 

<221> modified_base 

<222> (19) . . (42) . x: 4.1,^ K^oo 

<223> The bases in these positions within tH primer are 91% of the base 
shown and 3% each of the other 3 nucleotides 



<400> 309 

ggagcgctta ccggtctttt gcgtcttctt gatcttggga agccttagct cgtcaaagag 60 



<210> 310 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 310 
ctcctccacg agttcggc 



<210> 311 

<211> 60 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<220> 

<221> modif ied_base 

<222> (19).. (42) . ^^o. 4= +.1,^ K=eo 

<223> The bases at these positions within tH primer are 91% of the base 
shown and 3% each of the other 3 nucleotides. 



<400> 311 ^ 
caaaagaccg gtaagcgctc caccagcgcc gccgtcctgg aggccctccg cgaggcccac 5u 
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<210> 312 

<211> 52 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 312 

gtcgactcta gatcagtggt ggtggtggtg gtgcttggcc gcccggcgca tc 52 



<210> 313 

<211> 21 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 
<400> 313 

gtcggagggg tcccccacga g 21 



<210> 314 

<211> 17 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 314 

tgtggaattg tgagcgg 17 



<210> 315 

<211> 75 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<220> 

<22l> modif ied_base 

<222> (28)., (59) 

<223> The bases in these positions within tH primer are 91% of the base 
shown and 3% each of the other 3 nucleotides. 



<400> 315 

ctcgtggggg acccctccga caacctcccc ggggtcaagg gcatcgggga gaagaccgcc 60 
ctcaagcttc tcaag 75 
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<210> 316 

<211> 23 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 316 

gtggcctcca tatgggccag gac 



<210> 317 

<211> 36 

<212> DNA 

<:213> Artificial 

<220> 

<223> Synthetic 

<400> 317 

cgggacctcg aggcgcgtga accccaggag gtccac 



<210> 318 

<211> 39 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 318 

ccggggaaag tcctcctccg tctcggcccg gcccgcctt 



<:210> 319 

<211> 24 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 319 

gtcggactcg tcaccggtca gggc 



<210> 320 

<211> 75 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
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<220> 

<221> modif ied_base 
<222> (28).. (60) 

<223> The bases in these positions within tH primer are 91% of the base 
shown and 3% each o£ the other 3 nucleotides. 



<400> 320 

ctgaccggtg acgagtccga caaccttccc ggggtcaagg gcatcgggga gaggacggcg 60 
aggaagcttc tggag 75 



<210> 321 

<211> 33 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 321 

tagctcctgg gagagggcgt gggccgacat gcc 33 



<210> 322 

<211> 34 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 322 

cttccagaac ctctttaaac ggctttccga gaag 34 



<210> 323 

<211> 34 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 323 

cttctcggaa agccgtttaa agaggttctg gaag 34 



<210> 324 

<211> 31 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
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<400> 324 

ccggtgggcc ggacgcagaa gacgggcaag c 31 



<210> 325 

<211> 31 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 325 

gcttgcccgt cttctgcgtc cggcccaccg g 31 



<210> 326 

<211> 31 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 326 

ctcctccaag tgcacaacga gctggtcctg g 31 



<210> 327 

<211> 31 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 327 

ccaggaccag ctcgttgtgc acttggagga g 31 



<210> 328 

<211> 26 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 328 

gccgccctcc tgaagcggct taaggg 26 



<210> 329 

<211> 26 

<212> DNA 

<213> Artificial 
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<220> 

<223> Synthetic 
<400> 329 

cccttaagcc gcttcaggag ggcggc 26 



<210> 330 

<211> 28 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 330 

atcggcaaga cgcagaagac gggcaagc 28 



<210> 331 

<211> 28 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 331 

gcttgcccgt cttctgcgtc ttgccgat 28 



<210> 332 

<;211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 332 

ttgcaggtgc acaacgaact ggtcctc 27 



<:210> 333 

<211> 27 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 



<400> 333 

gaggaccagt tcgttgtgca cctgcaa 27 



<210> 334 
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<211> 36 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 334 

cagaccatga attccacccc actttttgac ctggag 



<210> 335 

<211> 36 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 335 

gtggacgcgg ccgcccgagg ccgccgccag ggccag 



<210> 336 

<211> 36 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 336 

cagaccatga attccctgcc cctctttgag cccaag 



<210> 337 

<211> 36 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 337 

gtaaaccgcg ccgccccagg cggcggccaa ggcgtt 



<210> 338 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 338 

gaggtggagc ggcccctctc ccgggtcttg 
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<210> 339 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

< 2 2 3 > Synt he t ic 

<400> 339 

caagacccgg gagaggggcc gctccacctc 30 



<210> 340 
<2ai> 2505 
<212> DNA 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 340 

atggaattca ccccactttt tgacctggag gaacccccca agcgggtgct tctggtggac 60 

ggccaccacc tggcctaccg caccttctat gccctgagcc tcaccacctc ccggggggag 120 

ccggtgcaga tggtctacgg cttcgcccgg agcctcctca aggccttgaa ggaggacgga 180 

caggcggtgg tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacgag 240 

gcctacaagg cgggccgggc ccccaccccg gaggacttcc cccgccagct cgccttggtc 3 00 

aagcggctgg tggaccttct gggcctggtc cgcctcgagg ccccggggta cgaggcggac 360 

gacgtcctgg gcaccctggc caagaaggcc gaaagggagg ggatggaggt gcgcatcctc 420 

acgggagacc gggacttctt ccagctcctc tccgagaagg tctcggtcct cctgccggac 480 

gggaccctgg tcaccccaaa ggacgtccag gagaagtacg gggtgccccc ggagcgctgg 540 

gtggacttcc gcgccctcac gggggaccgc tcggacaaca tccccggggt ggcggggata 600 

ggggagaaga ccgcccttcg actcctcgca gagtggggga gcgtggaaaa cctcctgaag 660 

aacctggacc gggtaaagcc ggactcgctc cggcgcaaga tagaggcgca cctcgaggac 720 

ctccacctct ccttagacct ggcccgcatc cgcaccgacc tccccctgga ggtggacttt 780 

aaggccctgc gccgcaggac ccccgacctg gagggcctga gggccttttt ggaggagctg ' 840 

gagttcggaa gcctcctcca cgagttcggc ctcctgggag gggagaagcc ccgggaggag 900 

gccccctggc ccccgcccga aggggccttc gtgggcttcc tcctttcccg caaggagccc 960 

atgtgggcgg agcttctggc cctggcggcg gcctcggagg gccgggtcca ccgggcaaca 1020 
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agcccggttg aggccctggc cgacctcaag gaggcccggg ggttcctggc caaggacctg 1080 

gccgttttgg ccctgcggga gggggtggcc ctggacccca cggacgaccc cctcctggtg 1140 

gcctacctcc tggacccggc caacacccac cccgaggggg tggcccggcg ctacgggggc 1200 

gagttcacgg aggacgcagc ggagagggcc ctcctctccg agaggctctt ccagaacctc 1260 

tttaaacggc tttccgagaa gctcctctgg ctctaccagg aggtggagcg gcccctctcc 1320 

cgggtcttgg cccacatgga ggcccggggg gtgaggctgg acgtccccct tctggaggcc 1380 

ctctcctttg agctggagaa ggagatggag cgcctggagg gggaggtctt ccgtttggcc 1440 

ggccacccct tcaacctcaa ctcccgcgac cagctggaaa gggtcctctt tgacgagctg 1500 

ggcctcaccc cggtgggccg gacgcagaag acgggcaagc gctccaccgc ccagggggcc 1560 

ctggaggccc tccggggggc ccaccccatc gtggagctca tcctccagta ccgggagctt 1620 

tccaagctca aaagcaccta cctggacccc ctgccccggc tcgtccaccc gcggacgggc 1680 

cggctccaca cccgcttcaa ccagacggcc acggccacgg gaaggctttc cagctccgac 1740 

cccaacctgc agaacatccc cgtgcgcacc cccttggggc agcgcatccg caaggccttc 1800 

gtggccgagg aggggtggct ccttttggcg gcggactact cccagattga gctccgggtc 1860 
ctggcccacc tctcggggga cgagaacctg aagcgggtct tccgggaggg gaaggacatc 1920 
cataccgaga ccgccgcctg gatgttcggc ttagaccccg ctctggtgga tccaaagatg 1980 
cgccgggcgg ccaagacggt caacttcggc gtcctctacg ggatgtccgc ccacaggctc 2040 
tcccaggagc tcggcataga ctacaaggag gcggaggcct ttattgagcg ctacttccag 2100 
agcttcccca aggtgcgggc ctggatagaa aggaccctgg aggagggccg gacgcggggc 2160 
tacgtggaga ccctgttcgg caggaggcgc tatgtgcccg acctggcctc ccgggtccgc 2220 
tcggtgcggg aggcggcgga gcggatggcc ttcaacatgc ccgtgcaggg caccgccgcc 2280 
gacctgatga agatcgccat ggtcaagctc ttccccaggc taaagcccct gggggcccac 2340 
ctcctcctcc aagtgcacaa cgagctggtc ctggaggtgc ccgaggaccg ggccgaggag 2400 
gccaaggccc tggtcaagga ggtcatggag aacgcctacc ccctggacgt gcccctcgag 2460 
gtggaggtgg gcgtgggtcg ggactggctg gaggcgaagc aggat 2505 



<210> 341 

<211> 835 

<212> PRT 

<213> Artificial 

<220> 
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<223> Synthetic 
<400> 341 

Met Glu Phe Thr Pro lieu Phe Asp Leu Glu Glu Pro Pro Lys Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Tyr Ala Leu 
20 25 30 

Ser Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Met Val Tyr Gly Phe 
35 40 45 

Ala Arg Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Gin Ala Val Val 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Glu 
65 70 75 80 

Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu Val Lys Arg Leu Val Asp Leu Leu Gly Leu Val Arg Leu 
100 105 110 

Glu Ala Pro Gly Tyr Glu Ala Asp Asp Val Leu Gly Thr Leu Ala Lys 
115 120 125 

Lys Ala Glu Arg Glu Gly Met Glu Val Arg lie Leu Thr Gly Asp Arg 
130 135 140 

Asp Phe Phe Gin Leu Leu Ser Glu Lys Val Ser Val Leu Leu Pro Asp 
145 150 155 160 

Gly Thr Leu Val Thr Pro Lys Asp Val Gin Glu Lys Tyr Gly Val Pro 
165 170 175 

Pro Glu Arg Trp Val Asp Phe Arg Ala Leu Thr Gly Asp Arg Ser Asp 

180 185 190 

Asn lie Pro Gly Val Ala Gly lie Gly Glu Lys Thr Ala Leu Arg Leu 
195 200 205 

Leu Ala Glu Trp Gly Ser Val Glu Asn Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Val Lys Pro Asp Ser Leu Arg Arg Lys He Glu Ala His Leu Glu Asp 
225 230 235 240 

Leu His Leu Ser Leu Asp Leu Ala Arg He Arg Thr Asp Leu Pro Leu 
245 250 255 

Glu Val Asp Phe Lys Ala Leu Arg Arg Arg Thr Pro Asp Leu Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Glu Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 
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Phe Gly Leu Leu Gly Gly Glu Lys Pro Arg Glu Glu Ala Pro Trp Pro 
290 295 aoo 

Pro Pro Glu Gly Ala Phe Val Gly Phe Leu Leu Ser Arg Lys Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Leu Ala Leu Ala Ala Ala Ser Glu Gly Arg Val 
325 330 335 

His Arg Ala Thr Ser Pro Val Glu Ala Leu Ala Asp Leu Lys Glu Ala 
340 345 350 

Arg Gly Phe Leu Ala Lys Asp Leu Ala Val Leu Ala Leu Arg Glu Gly 
355 360 365 

Val Ala Leu Asp Pro Thr Asp Asp Pro Leu Leu Val Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ala Asn Thr His Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Phe Thr Glu Asp Ala Ala Glu Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

Phe Gin Asn Leu Phe Lys Arg Leu Ser Glu Lys Leu Leu Trp Leu Tyr 
420 425 430 

Gin Glu Val Glu Arg Pro Leu Ser Arg Val Leu Ala His Met Glu Ala 
435 440 445 

Arg Gly Val Arg Leu Asp Val Pro Leu Leu Glu Ala Leu Ser Phe Glu 
450 455 460 

Leu Glu Lys Glu Met Glu Arg Leu Glu Gly Glu Val Phe Arg Leu Ala 
465 470 475 480 

Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu 
485 490 495 

Phe Asp Glu Leu Gly Leu Thr Pro Val Gly Arg Thr Gin Lys Thr Gly 
500 505 510 

Lys Arg Ser Thr Ala Gin Gly Ala Leu Glu Ala Leu Arg Gly Ala His 
515 520 525 

Pro lie Val Glu Leu lie Leu Gin Tyr Arg Glu Leu Ser Lys Leu Lys 
530 535 540 

Ser Thr Tyr Leu Asp Pro Leu Pro Arg Leu Val His Pro Arg Thr Gly 
545 550 555 560 

Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu 
565 570 575 

Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu 
580 585 590 
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Gly Gin Arg lie Arg Lys Ala Phe Val Ala Glu Glu Qly Trp Leu Leu 
595 600 605 

Leu Ala Ala Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu 
610 615 620 

Ser Gly Asp Glu Asn Leu Lys Arg Val Phe Arg Glu Gly Lys Asp He 
625 630 635 640 

His Thr Glu Thr Ala Ala Trp Met Phe Gly Leu Asp Pro Ala Leu Val 
645 650 655 

Asp Pro Lys Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu 
660 665 670 

Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Gly He Asp Tyr 

675 680 685 

Lys Glu Ala Glu Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys 
690 695 700 

Val Arg Ala Trp He Glu Arg Thr Leu Glu Glu Gly Arg Thr Arg Gly 
705 710 715 720 

Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Ala 
725 730 735 

Ser Arg Val Arg Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn 
740 745 750 

Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys He Ala Met Val 
755 760 765 

Lys Leu Phe Pro Arg Leu Lys Pro Leu Gly Ala His Leu Leu Leu Gin 
770 775 780 

Val His Asn Glu Leu Val Leu Glu' Val Pro Glu Asp Arg Ala Glu Glu 
•785 790 795 800 

Ala Lys Ala Leu Val Lys Glu Val Met Glu Asn Ala Tyr Pro Leu Asp 
805 810 815 

Val Pro Leu Glu Val Glu Val Gly Val Gly Arg Asp Trp Leu Glu Ala 
820 825 830 

Lys Gin Asp 
835 

<210> 342 

<211> 38 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
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<400> 342 

tggcggcggc ctcgggcggc cgcgtccacc gggcaaca 



<210> 343 

<211> 24 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 343 

cttctctcat ccgccaaaac agcc 



<210> 344 

<211> 38 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 344 



38 



tggccgccgc 


ctggggcggc 


cgcgtttacc gggcggag 


<210> 345 








<211> 2505 








<212> DNA 








<213> Artificial 






<220> 








<223> Synthetic 






<400> 345 






gaacccccca agcgggtgct tctggtggac 


atgaattcca 


ccccactttt 


tgacctggag 


ggccaccacc 


tggcctaccg 


caccttctat 


gccctgagcc tcaccacctc ccggggggag 


ccggtgcaga 


tggtctacgg 


cttcgcccgg 


agcctcctca aggccttgaa ggaggacgga 


caggcggtgg 


tcgtggtctt 


tgacgccaag 


gccccctcct tccgccacga ggcctacgag 


gcctacaagg 


cgggccgggc 


ccccaccccg 


gaggacttcc cccgccagct cgccttggtc 


aagcggctgg 


tggaccttct 


gggcctggtc 


cgcctcgagg ccccggggta cgaggcggac 


gacgtcctgg 


gcaccctggc 


caagaaggcc 


gaaagggagg ggatggaggt gcgcatcctc 


acgggagacc 


gggacttctfc 


ccagctcctc 


tccgagaagg tctcggtcct cctgccggac 


gggaccctgg 


tcaccccaaa 


ggacgtccag 


gagaagtacg gggtgccccc ggagcgctgg 


gtggacttcc 


gcgccctcac 


gggggaccgc 


tcggacaaca tccccggggt ggcggggata 



24 



38 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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ggggagaaga 


ccgcccttcg 


actcctcgca 


gagtggggga 


gcgtggaaaa 


cctcctgaag 


660 


aacctggacc 


gggtaaagcc 


ggactcgctc 


cggcgcaaga 


tagaggcgca 


cctcgaggac 


720 


ctccacctct 


ccttagacct 


ggcccgcatc 


cgcaccgacc 


tccccctgga 


ggtggacttt 


780 


aaggccctgc 


gccgcaggac 


ccccgacctg 


gagggcctga 


gggccttttt 


ggaggagctg 


840 


gagttcggaa 


gcctcctcca 


cgagttcggc 


ctcctgggag 


gggagaagcc 


ccgggaggag 


900 


gccccctggc 


ccccgcccga 


aggggccttc 


gtgggcttcc 


tcctttcccg 


caaggagccc 


960 


atgtgggcgg 


agcttctggc 


cctggcggcg 


gcctcgggcg 


gccgcgtcca 


ccgggcaaca 


1020 


agcccggttg 


aggccctggc 


cgacctcaag 


gaggcccggg 


ggttcctggc 


caaggacctg 


1080 


gccgttttgg 


ccctgcggga 


gggggtggcc 


ctggacccca 


cggacgaccc 


cctcctggtg 


1140 


gcctacctcc 


tggacccggc 


caacacccac 


cccgaggggg 


tggcccggcg 


ctacgggggc 


1200 


gagttcacgg 


aggacgcagc 


ggagagggcc 


ctcctctccg 


agaggctctt 


ccagaacctc 


1260 


tttaaacggc 


tttccgagaa 


gctcctctgg 


ctctaccagg 


aggtggagcg 


gcccctctcc 


1320 


cgggtcttgg 


cccacatgga 


ggcccggggg 


gtgaggctgg 


acgtccccct 


tctggaggcc 


1380 


ctctcctttg 


agctggagaa 


ggagatggag 


cgcctggagg 


gggaggtctt 


ccgtttggcc 


1440 


ggccacccct 


tcaacctcaa 


ctcccgcgac 


cagctggaaa 


gggtcctctt 


tgacgagctg 


1500 


ggcctcaccc 


cggtgggccg 


gacgcagaag 


acgggcaagc 


gctccaccgc 


ccagggggcc 


1560 


ctggaggccc 


tccggggggc 


ccaccccatc 


gtggagctca 


tcctccagta 


ccgggagctt 


1620 


tccaagctca 


aaagcaccta 


cctggacccc 


ctgccccggc 


tcgtccaccc 


gcggacgggc 


1680 


cggctccaca 


cccgcttcaa 


ccagacggcc 


acggccacgg 


gaaggctttc 


cagctccgac 


1740 


cccaacctgc 


agaacatccc 


cgtgcgcacc 


cccttggggc 


agcgcatccg 


caaggccttc 


1800 


gtggccgagg 


aggggtggct 


ccttttggcg 


gcggactact 


cccagattga 


gctccgggtc 


1860 


ctggcccacc 


tctcggggga 


cgagaacctg 


aagcgggtct 


tccgggaggg 


gaaggacatc 


1920 


cataccgaga 


ccgccgcctg 


gatgttcggc 


ttagaccccg 


ctctggtgga 


tccaaagatg 


1980 


cgccgggcgg 


ccaagacggt 


caacttcggc 


gtcctctacg 


ggatgtccgc 


ccacaggctc 


2040 


W W X* W Ciy W« 




^ ^ ^ ci ciy y ciy 


y ^yyciyy**^ ^ 


t" t" a ^ f'afiaca 
u i«cii» L>yciy wy 






agcttcccca 


aggtgcgggc 


ctggatagaa 


aggaccctgg 


aggagggccg 


gacgcggggc 


2160 


tacgtggaga 


ccctgttcgg 


caggaggcgc 


tatgtgcccg 


acctggcctc 


ccgggtccgc 


2220 


tcggtgcggg 


aggcggcgga 


gcggatggcc 


ttcaacatgc 


ccgtgcaggg 


caccgccgcc 


2280 


gacctgatga 


agatcgccat 


ggtcaagctc 


ttccGcaggc 


taaagcccct 


gggggcccac 


2340 
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ctcctcctcc aagtgcacaa cgagctggtc ctggaggtgc ccgaggaccg ggccgaggag 2400 
gccaaggccc tggtcaagga ggtcatggag aacgcctacc ccctggacgt gcccctcgag 2460 
gtggaggtgg gcgtgggtcg ggactggctg gaggcgaagc aggat 2505 



<210> 346 

<211> 835 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 346 

Met Asn Ser Thr Pro Leu Phe Asp Leu Glu Glu Pro Pro Lys Arg Val 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Tyr Ala Leu 
20 25 30 

Ser Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Met Val Tyr Gly Phe 
35 40 45 

Ala Arg Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Gin Ala Val Val 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Glu 
65 70 75 80 

Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu Val Lys Arg Leu Val Asp Leu Leu Gly Leu Val Arg Leu 
100 105 110 

Glu Ala Pro Gly Tyr Glu Ala Asp Asp Val Leu Gly Thr Leu Ala Lys 
115 120 125 

Lys Ala Glu Arg Glu Gly Met Glu Val Arg lie Leu Thr Gly Asp Arg 
130 135 140 

Asp Phe Phe Gin Leu Leu Ser Glu Lys Val Ser Val Leu Leu Pro Asp 
145 150 155 160 

Gly Thr Leu Val Thr Pro Lys Asp Val Gin Glu Lys Tyr Gly Val Pro 
165 170 175 

Pro Glu Arg Trp Val Asp Phe Arg Ala Leu Thr Gly Asp Arg Ser Asp 
180 185 190 

Asn He Pro Gly Val Ala Gly He Gly Glu Lys Thr Ala Leu Arg Leu 
195 200 205 
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Leu Ala Glu Trp Gly Ser Val Glu Asn Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Val Lys Pro Asp Ser Leu Arg Arg Lys lie Glu Ala His Leu Glu Asp 
225 230 235 240 

Leu His Leu Ser Leu Asp Leu Ala Arg He Arg Thr Asp Leu Pro Leu. 

245 250 255 

Glu Val Asp Phe Lys Ala Leu Arg Arg Arg Thr Pro Asp Leu Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Glu Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Gly Gly Glu Lys Pro Arg Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Leu Leu Ser Arg Lys Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Leu Ala Leu Ala Ala Ala Ser Gly Gly Arg Val 
325 330 335 

His Arg Ala Thr Ser Pro Val Glu Ala Leu Ala Asp Leu Lys Glu Ala 
340 345 350 

Arg Gly Phe Leu Ala Lys Asp Leu Ala Val Leu Ala Leu Arg Glu Gly 
355 360 365* 

Val Ala Leu Asp Pro Thr Asp Asp Pro Leu Leu Val Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ala Asn Thr His Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Phe Thr Glu Asp Ala Ala Glu Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

Phe Gin Asn Leu Phe Lys Arg Leu Ser Glu Lys Leu Leu Trp Leu Tyr 
420 425 430 

Gin Glu Val Glu Arg Pro Leu Ser Arg Val Leu Ala His Met Glu Ala 
435 440 445 

Arg Gly Val Arg Leu Asp Val Pro Leu Leu Glu Ala Leu Ser Phe Glu 
450 455 460 

Leu Glu Lys Glu Met Glu Arg Leu Glu Gly Glu Val Phe Arg Leu Ala 
465 470 475 480 

Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu 
485 490 495 



Phe Asp Glu Leu Gly Leu Thr Pro Val Gly Arg Thr Gin Lys Thr Gly 
500 505 510 
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hys Arg Ser Thr Ala Gin Gly Ala Leu Glu Ala Leu Arg Gly Ala His 
•515 520 525 

Pro He Val Glu Leu He Leu Gin Tyr Arg Glu Leu Ser Lys Leu Lys 
530 535 540 

Ser Thr Tyr Leu Asp Pro Leu Pro Arg Leu Val His Pro Arg Thr Gly 
545 550 555 560 

Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu 
565 570 575 

Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu 
580 585 590 

Gly Gin Arg He Arg Lys Ala Phe Val Ala Glu Glu Gly Trp Leu Leu 
595 600 605 

Leu Ala Ala Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu 
610 615 620 

Ser Gly Asp Glu Asn Leu Lys Arg Val Phe Arg Glu Gly Lys Asp He 
625 630 635 640 

His Thr Glu Thr Ala Ala Trp Met Phe Gly Leu Asp Pro Ala Leu Val 
645 650 655 

Asp Pro Lys Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu 
660 665 670 

Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Gly He Asp Tyr 
675 680 685 

Lys Glu Ala Glu Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys 
690 695 700 

Val Arg Ala Trp He Glu Arg Thr Leu Glu Glu Gly Arg Thr Arg Gly 
705 710 715 720 

Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Ala 
725 730 735 

Ser Arg Val Arg Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn 
740 745 - 750 

Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys He Ala Met Val 
755 760 765 

Lys Leu Phe Pro Arg Leu Lys Pro Leu Gly Ala His Leu Leu Leu Gin 
770 775 780 

Val His Asn Glu Leu Val Leu Glu Val Pro Glu Asp Arg Ala Glu Glu 
785 790 795 . 800 

Ala Lys Ala Leu Val Lys Glu Val Met Glu Asn Ala Tyr Pro Leu Asp 
805 810 815 
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Val Pro lieu Glu Val Glu Val Gly Val Gly Arg Asp Trp Leu Glu Ala 
820 825 830 

Lys Gin Asp 
835 

<210> 347 

<211> 2496 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 347 



atgaattccc 


tgcccctctt 


tgagcccaag 


ggccgggtgc 


ttctggtgga 


cggccaccac 


60 


ctggcctacc 


gtaccttttt 


tgccctgaag 


ggcctcacca 


ccagccgcgg 


gg age egg to 


120 


caggcggtgt 


acgggtttgc 


caagagcctt 


ttgaaggcgc 


taagggaaga 


cggggatgtg 


180 


gtgatcgtgg 


tgtttgacgc 


caaggccccc 


tccttccgcc 


accagaccta 


cgaggcctac 


240 


aaggcggggc 


gggctcccac 


ccccgaggac 


tttccccggc 


agcttgccct 


tatcaaggag 


300 


atggtggacc 


ttttgggcct 


ggagcgcctc 


gaggtgccgg 


gctttgaagc 


ggatgacgtc 


360 


ctggctaccc 


tggccaagaa 


ggcggaaaag 


gaaggctacg 


aagtgcgcat 


cctcaccgcg 


420 


gaccgggacc 


tttaccagct 


tctttcggag 


cgaatctcca 


tccttcaccc 


ggagggttac 


480 


ctgatcaccc 


cggagtggct 


ttgggagaag 


tatgggctta 


agccttccca 


gtgggtggac 


540 


taccgggcct 


tggccgggga 


cccttccgac 


aacatccccg 


gcgtgaaggg 


catcggggag 


600 


aagacggcgg 


ccaagctgat' 


ccgggagtgg 


ggaagcctgg 


aaaaccttct 


taagcacctg 


660 


gaacaggtga 


aacctgcctc 


cgtgcgggag 


aagatcctta 


gccacatgga 


ggacctcaag 


720 


ctatccctgg 


agctatcccg 


ggtgcacacg 


gacttgctcc 


ttcaggtgga 


cttcgcccgg 


780 


cgccgggagc 


cggaccggga 


ggggcttaag 


gcctttttgg 


agaggctgga 


gttcggaagc 


840 


ctcctccacg 


agttcggcct 


gttggaaagc 


ccggtggcgg 


cggaggaagc 


tccctggccg 


900 


ccccccgagg 


gagccttcgt 


ggggtacgtt 


ctttcccgcc 


ccgagcccat 


gtgggcggag 


960 


cttaacgcct 


tggccgccgc 


ctggggcggc 


cgcgtttacc 


gggcggagga 


tcccttggag 


1020 


gccttgcggg 


ggcttgggga 


ggtgaggggg 


cttttggcca 


aggacctggc 


ggtgctggcc 


1080 


ctgagggaag 


ggattgccct 


ggcaccgggc 


gacgacccca 


tgctcctcgc 


ctacctcctg 


1140 


gatccttcca 


acaccgcccc 


cgaaggggta 


gcccggcgct 


acggggggga 


gtggaccgag 


1200 


gaggcggggg 


aaagggcgct 


gctttccgaa 


aggctttacg 


ccgccctcct 


gaagcggctt 


1260 
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325 



330 



335 



Pro Glu Pro Tyr I*ys Ala lieu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu .Gly Leu Gly Leu 
355 ' 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala lie Gly Lys Thr Gin Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Ala Thr Pro Leu Gly 



580 



585 



590 



Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 



Gly 



Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
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625 



630 



635 



640 



Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 

645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 



Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 



Arg Val Lys Ser Val Arg Glu Ala 
740 

Pro Val Gin Gly Thr Ala Ala Asp 
755 760 

Leu Phe Pro Arg Leu Glu Glu Net 
770 775 

Ala Asn Glu Leu Val Leu Glu Ala 
785 790 

Ala Arg Leu Ala Lys Glu Val Net 
805 

Pro Leu Glu Val Glu Val Gly He 

820 



Ala Glu Arg Met Ala Phe Asn Net 
745 750 

Leu Met Lys Leu Ala Met Val Lys 
765 

Gly Ala Arg Net Leu Leu Gin Val 
780 

Pro Lys Glu Arg Ala Glu Ala Val 
795 800 

Glu Gly Val Tyr Pro Leu Ala Val 
810 815 

Gly Glu Asp Trp Leu Ser Ala Lys 
825 830 



Glu His His His His His His 
835 



<210> 41 

<211> 839 

<212> PRT 

<213> Artificial 



<220> 

<223> Synthetic 
<400> 41 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 
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Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 



249 



wo 01/90337 



PCT/USOl/17086 



Pro Glu Pro Tyr Lys Ala lieu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala lie Lys Lys Thr Gin Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys lie Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr lie Asp Pro Leu Pro Asp Leu lie His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 



Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
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645 



650 



655 



pro Leu Met Arg Arg Ala Ala Lys Thr lie Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

Ala Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Tirp Leu Ser Ala Lys 
820 825 830 



Glu His His His His His His 
835 



<210> 42 

<211> 839 

<212> PRT 

<213> Artificial 



<220> 

<223> Synthetic 
<400> 42 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 
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Ala Iiys Ser Leu Leu Lys Ala Leu 
50 55 

Val Val Phe Asp Ala Lys Ala Pro 
65 70 

Gly Tyr Lys Ala Gly Arg Ala Pro 
85 

Leu Ala Leu lie Lys Glu Leu Val 
100 

Glu Val Pro Gly Tyr Glu Ala Asp 
115 120 



Lys Glu Asp Gly Asp Ala Val He 
60 

Ser Phe Arg His Glu Ala Tyr Glu 
75 80 

Thr Pro Glu Asp Phe Pro Arg Gin 
90 95 

Asp Leu Leu Gly Leu Ala Arg Leu 
105 110 

Asp Val Leu Ala Ser Leu Ala Lys 
125 



Lys Ala Glu Lys Glu Gly Tyr Glu 
130 135 

Asp Leu Tyr Gin Leu Leu Ser Asp 
145 150 

Gly Tyr Leu He Thr Pro Ala Trp 
165 

Pro Asp Gin Trp Ala Asp Tyr Arg 
180 



Val Arg He Leu Thr Ala Asp Lys 
140 

Arg He His Val Leu His Pro Glu 
155 160 

Leu Trp Glu Lys Tyr Gly Leu Arg 
170 175 

Ala Leu Thr Gly Asp Glu Ser Asp 
185 190 



Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 



Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 



Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 

260 265 270 



Phe Leu Glu Arg Leu Glu Phe Gly 

275 280 

Leu Glu Ser Pro Lys Ala Leu Glu 
290 295 

Gly Ala Phe Val Gly Phe Val Leu 
305 310 

Asp Leu Leu Ala Leu Ala Ala Ala 
325 

Pro Glu Pro Tyr Lys Ala Leu Arg 
340 



Ser Leu Leu His Glu Phe Gly Leu 

285 

Glu Ala Pro Trp Pro Pro Pro Glu 
300 

Ser Arg Lys Glu Pro Met Trp Ala 
315 320 

Arg Gly Gly Arg Val His Arg Ala 
330 335 

Asp Leu Lys Glu Ala Arg Gly Leu 
345 350 
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Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala He Gly Lys Thr Gin Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 



Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 
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Pro Leu Met Arg Arg Ala Ala Lys 
660 

Gly Met Ser Ala His Arg Leu Ser 
675 680 



Thr lie Asn Phe Gly Val Leu Tyr 
665 670 

Gin Glu Leu Ala lie Pro Tyr Glu 
685 



Glu Ala Gin Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

Ala Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly lie Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 



Glu His His His His His His 
835 



<210> 43 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 43 



Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 

20 25 30 



Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 



Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 
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Val val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Glu Thr Glu Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
ISO 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 • 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 

260 265 270 

Phe- Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 
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Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala lie Gly Lys Thr Gin Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 

515 520 525 

He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 



Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 
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Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 ^80 

Glu Ala Gin Ala Phe lie Qlu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arq Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Qlu Qlu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

Ala Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 

<210> 44 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 44 

Met Asn ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
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65 



70 



75 



80 



Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 • 155 160 

Gly Tyr Leu He Thr Pro Ala Tip Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 



Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
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370 



375 



380 



Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala He Gly Lys Thr Gin Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu lie His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Tzrp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
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675 680 685 

Glu Ala Gin Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

Ala Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly lie Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 

<210> 45 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 45 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 

15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 
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Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leii Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 ' 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 
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Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
4Q5 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Lys Leu Lys Lys Thr Lys Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 

610 615 620 



Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 



625 



630 



635 



640 



Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 

660 665 670 



Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 
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Glu Ala Gin Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

Ala Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 

<210> 46 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 46 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
1 5 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 
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Leu Ala lieu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 

245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 



Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 
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Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 

420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
. 435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Lys He Asn Lys Thr Lys Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 * 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 
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Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 

725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Net Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

Ala Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly lie Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 



<210> 


47 


<211> 


839 


<212> 


PRT 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


47 


Met Asn Ser Gly Met 


1 


5 


Leu Val Asp Gly His 



10 15 



20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu TVla Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
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100 



105 110 



Glu Val Pro Qly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Qly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Qly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 

275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
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405 



410 



415 



Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg He Pro Lys He Lys Lys Thr His Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 



Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 



Ala Leu Asp Tyr Ser Gin He Glu 
610 615 

Gly Asp Glu Asn Leu He Arg Val 
625 630 

Thr Glu Thr Ala Ser Trp Met Phe 
645 



Leu Arg Val Leu Ala His Leu Ser 
620 

Phe Gin Glu Gly Arg Asp He His 
635 640 

Gly Val Pro Arg Glu Ala Val Asp 
650 655 



Pro Leu Met Arg Arg Ala Ala Lys 
660 

Gly Met Ser Ala His Arg Leu Ser 

675 680 

Glu Ala Gin Ala Phe He Glu Arg 

690 695 

Arg Ala Trp He Glu Lys Thr Leu 



Thr He Asn Phe Gly Val Leu Tyr 
665 670 

Gin Glu Leu Ala He Pro Tyr Glu 
685 

Tyr Phe Gin Ser Phe Pro Lys Val 
700 

Glu Glu Gly Arg Arg Arg Gly Tyr 
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705 710 



715 720 



Val 61u Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

Ala Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 

<210> 48 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 48 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 
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Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 

275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 

340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 



Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 
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Leu Leu Lya Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Lys Leu Lys Lys Thr Lys Lys Thr Gly Lys 
500 505 510 

Arg Ser Ser Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp. Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 SS5 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 
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Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

Ala^^sn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 615 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 

<210> 49 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 49 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 

65 70 " 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 
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Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

ASP Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 ' 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 

370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 
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Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 ' 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Lys Leu Lys Lys Thr Lys Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Leu Leu Glu Ala Leu Tlrg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys lie Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr lie Asp Pro Leu Pro Asp Leu lie His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Serj 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 



Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 
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Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

Ala Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 SIO 

Pro Leu Glu Val Glu Val Gly lie Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 

<210> 50 

<211> 842 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 50 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 

15 10 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 S5 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 ■ 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala 
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130 



135 



140 



Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu lie Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 . 190 

Ser Asp Asn Leu Arg Gly Val Arg Gly lie Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys lie Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 

370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 



Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
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435 



440 



445 



Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Qlu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Qlu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 



277 



wo 01/90337 



PCTAJSOl/17086 



740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 

755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu TVsp Trp Leu 
820 825 830 

Ser Ala Lys Gly His His His His His His 
835 840 



<210> 


51 


<211> 


842 


<212> 


PRT 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


51 


Met Asn Ser Glu Ala 


1 


5 


Leu Leu Val Asp Gly 



10 15 



20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala 
130 135 140 
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ASP Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 

pro Glu Gly His Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 



165 



170 



175 



Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 



ser Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Tyr Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 



210 



215 



220 



Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu 



225 



230 



235 



Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 

315 320 



305 



310 



Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 



405 



410 



His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 
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Glu Ala Thr Gly Val Arg Arg Asp val Ala Tyx Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg lie Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 

675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 



Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 
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Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly His His His His His His 

840 





835 


<210> 


52 


<211> 


842 


<212> 


PRT 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


52 



Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro. Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 



281 



wo 01/90337 



PCTAJSOl/17086 



Pro Glu Gly His Leu lie Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly lie Arg Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys lie Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu -Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 . 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 
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Leu Glu Leu Ala Glu Glu He Arg Arg Leu Qlu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 
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Met Val 
770 


Lys 


Leu 


Phe 


Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
775 780 


Leu Gin 
785 


Val 


Ala 


Asn 


Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
790 795 800 


Glu Qlu 


Val 


Ala 


Ala 
805 


Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
810 815 


Leu Ala 


Val 


Pro 
820 


Leu 


Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
825 830 


Ser Ala 


Lys 
835 


Gly 


His 


His His His His His 
840 


<210> 53 
<211> 839 
<212> PRT 
<213> Artificial 




<220> 

<223> Synthetic 




<400> 53 








Met Asn 
1 


Ser 


Gly 


Met 
5 


Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
10 15 


Leu Val 


Asp 


Gly 
20 


His 


His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
25 30 


Gly Leu 


Thr 
35 


Thr 


Ser 


Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
40 45 


Ala Lys 
SO 


Ser 


Leu 


Leu 


Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
55 60 


Val Val 
65 


Phe 


Asp 


Ala 


Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
70 75 80 


Gly Tyr 


Lys 


Ala 


Gly 
85 


Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
90 95 


Leu Ala 


Leu 


He 
100 


Lys 


Qlu Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 
105 110 


Glu Val 


Pro 
115 


Gly 


Tyr 


wJ.U nXGL cXo^ exO^ VCIX JJC LA /IlXCI *9^JL JJCIX JJjr o 

120 125 


Lys Ala 
130 


Qlu 


Lys 


Glu 


Gly Tyr Qlu Val Arg lie Leu Thr Ala Asp Lys 
135 140 


Asp Leu 
145 


Tyr 


Gin 


Leu 


Leu Ser Asp Arg He His Val Leu His Pro Glu 
150 155 160 


Gly Tyr 


Leu 


He 


Thr 


Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
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165 



170 



175 



Pro Asp Gin Trp Ala Asp Tyr Arg Ala hen Thr Gly Asp Glu Ser Asp 
180 185 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 



260 



265 



270 



Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
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465 



470 



475 



480 



His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Lys Leu Lys Lys Thr Lys Lys Thr Gly Lys 
500 505 510 

Arg Ser Ser Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys lie Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr lie Asp Pro Leu Pro Asp Leu lie His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg lie Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 



Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
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770 775 780 

Ala Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 

Pro Leu Glu Val Glu Val Gly lie Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 

<210> 54 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 54 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Qlv Tyr Lys Ala Gly Arg Ala Glu Thr Glu Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 
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Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Vxo Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 



Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 



288 



wo 01/90337 



PCTAJSOl/17086 



His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Pbe 
485 490 495 

Asp Glu Leu Arg Leu Pro Lys Leu Lys Lys Thr Lys Lys Tlir Gly Lys 
500 505 510 

Arg Ser Ser Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 

555 560 



545 



550 



Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His -Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arq Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 

715 720 



705 



710 



Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arq Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 
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Ala Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly lie Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 



<210> 


55 


<211> 


839 


<212> 


PRT 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


55 


Met Asn Ser Gly Met 


1 


5 


Leu Val Asp Gly His 




20 



10 15 



25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 

35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 
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Asn Leu Pro Gly Val Asn Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 



260 



265 



270 



Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 

475 480 



465 



470 



His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 
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Asp Glu Leu Arg Leu Pro Lys Leu Lys Lys Thr Lys Lys Thr Gly Lys 
500 505 510 

Arg fier Ser Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 

515 520 525 

lie Val Glu Lys lie Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 



Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 



Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 



Pro Leu Met Arg Arg Ala Ala Lys 
660 

Gly Met Ser Ala His Arg Leu Ser 
675 680 



Thr He Asn Phe Gly Val Leu Tyr 
665 670 

Gin Glu Leu Ala He Pro Tyr Glu 
685 



Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 

690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

Ala Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 
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Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro .Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 

<210> 56 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 56 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
* 50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 

115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu' Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Gin Arg Lys Leu 
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195 



200 



205 



Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 

325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 " 455 460 

Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 



Asp Glu Leu Arg Leu Pro Lys Leu Lys Lys Thr Lys Lys Thr Gly Lys 
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500 



505 



510 



Arg Ser Ser Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

Ala Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
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80S 810 815 

Pro Leu Glu Val Glu Val Gly lie Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 

<210> 57 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 57 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly Met Gly Glu Lys Thr Gly Arg Lys Leu 
195 200 205 
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Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala lie Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Lys Leu Lys Lys Thr Lys Lys Thr Gly Lys 
500 505 510 
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Arg Ser Ser Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro T^g Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 

675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

Ala Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 



Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 
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Pro Leu Glu Val Glu Val Gly He Gly Qlu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 





835 


<210> 


58 


<211> 


839 


<212> 


PRT 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


58 



Met Asn Ser Gly Met Leu Pro Leu Phe Qlu Pro Lys Gly Arg val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala, Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Asn Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 
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Leu Xiys Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Lys Leu Lys Lys Thr Lys Lys Thr Gly Lys 
500 505 510 



Arg Ser Ser Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 
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He Val Glu hys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Qly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

Ala Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 
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Glu His His His His His His 
835 

<210> 59 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 59 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Qly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 ' 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 

115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Pro Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
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225 



230 



235 



240 



Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 

290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 

450 455 460 

Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Lys Leu Lys Lys Thr Lys Lys Thr Gly Lys 
500 505 510 

Arg ser Ser Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 
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Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

Ala Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 



Glu His His His His His His 
835 
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<210> 60 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 60 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

■ Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
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245 



250 



255 



Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 

Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 . 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 

435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu Xle Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Lys Leu Lys Lys Thr Lys Lys Thr Gly Lys 
500 505 510 

Arg Ser Ser Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 



Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
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545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Ala Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 

725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

Ala Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu His His His His His His 
835 

<210> 61 
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<211> 839 
<212> PRT 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 61 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 

ISO 185 190 

Asn. Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 
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Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 365 

Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 
420 425 430 

Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu 
450 ' 455 460 

Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 

515 520 525 

lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys Leu Lys Asn 
530 535 540 

Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arg 
545 550 555 560 
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Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg lie Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Lys Asp lie His 
625 630 635 640 

Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr Glu 
675 680 685 

Glu Ala Val Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu Val 
785 790 795 800 

Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Gly His His His His His His 
835 

<210> 62 
<211> 842 
<212> PRT 
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<213> Artificial 
<220> 

<223> Synthetic 
<400> 62 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
iOO 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 

210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 
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lieu Arg Ala Phe Leu Glu TUrg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly 
355 360 365 

Leu Gly Leu Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu 
405 410 415 

Phe Ala Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp 
420 425 430 

Leu Tyr Arg Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser 
450 455 460 

Leu Glu Val Ala Glu Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Gly Leu Pro Ala He Lys Lys Thr Gin Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Ser Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg 
545 550 555 560 



Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 
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Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
580 585 590 

pro Leu Gly Gin Arg He Arg Arg Ala Phe He Ala Glu Qlu Gly Trp 
595 600 605 

Leu Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg 
625 630 635 640 

ASP He His Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Glu Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala 
785 790 795 SOO 

Glu Ala Val Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Glu His His His His His His 
835 840 

<210> 63<211> 835 

<212> PRT 

<213> Artificial 

<220> 
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<223> Synthetic 
<400> 63 

Met Glu Phe Thr Pro Leu Phe Asp Leu Glu Glu Pro Pro Lys Arg Val 
15 10 15 

Leu Leu Val Asp Oly His His Leu Ala Tyr Arg Thr Phe Tyr Ala Leu 
20 25 30 

Ser Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Met Val Tyr Gly Phe 
35 40 45 

Ala Arg Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Gin Ala Val Val 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Glu 
65 70 75 80 

Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu Val Lys Arg Leu Val Asp Leu Leu Gly Leu Val Arg Leu 
100 105 110 

Glu Ala Pro Gly Tyr Glu Ala Asp Asp Val Leu Gly Thr Leu Ala Lys 
115 120 125 

Lys Ala Glu Arg Glu Gly Met Glu Val Arg lie Leu Thr Gly Asp Arg 
130 135 140 

Asp Phe Phe Gin Leu Leu Ser Glu Lys Val Ser Val Leu Leu Pro Asp 
145 150 155 160 

Gly Thr Leu Val Thr Pro Lys Asp Val Gin Glu Lys Tyr Gly Val Pro 
165 170 175 

Pro Glu Arg Trp Val Asp Phe Arg Ala Leu Thr Gly Asp Arg Ser Asp 
180 185 190 

Asn lie Pro Gly Val Ala Gly lie Gly Glu Lys Thr Ala Leu Arg Leu 
195 200 205 

Leu Ala Glu Trp Gly Ser Val Glu Asn Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Val Lys Pro Asp Ser Leu Arg Arg Lys lie Glu Ala His Leu Glu Asp 
225 230 235 240 

Leu His Leu Ser Leu Asp Leu Ala Arg lie Arg Thr Asp Leu Pro Leu 
245 250 255 

Glu Val Asp Phe Lys Ala Leu Arg Arg Arg Thr Pro Asp Leu Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Glu Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 
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Phe Gly Leu Leu Gly Gly Glu Lys Pro Arg Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Leu Leu Ser Arg Lys Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Leu Ala Leu Ala Ala Ala Ser Glu Gly Arg Val 
325 330 335 

His Arg Ala Thr Ser Pro Val Glu Ala Leu Ala Asp Leu Lys Glu Ala 
340 345 350 

Arg Gly Phe Leu Ala Lys Asp Leu Ala Val Leu Ala Leu Arg Glu Gly 
355 360 365 

Val Ala Leu Asp Pro Thr Asp Asp Pro Leu Leu Val Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ala Asn Thr His Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Phe Thr Glu Asp Ala Ala Glu Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

Phe Gin Asn Leu Phe Lys Arg Leu Ser Glu Lys Leu Leu Trp Leu Tyr 
420 425 430 

Gin Glu Val Glu Arg Pro Leu Ser Arg Val Leu Ala His Met Glu Ala 
435 440 445 

Arg Gly Val Arg Leu Asp Val Pro Leu Leu Glu Ala Leu Ser Phe Glu 
450 455 460 

Leu Glu Lys Glu Met Glu Arg Leu Glu Gly Glu Val Phe Arg Leu Ala 
465 470 475 480 

Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu 
485 490 495 

Phe Asp Glu Leu Gly Leu Thr Pro Val Gly Arg Thr Gin Lys Thr Gly 
500 505 510 

Lys Arg Ser Thr Ala Gin Gly Ala Leu Glu Ala Leu Arg Gly Ala His 
515 520 525 

Pro lie Val Glu Leu lie Leu Gin Tyr Arg Glu Leu Ser Lys Leu Lys 
530 535 540 

Ser Thr Tyr Leu Asp Pro Leu Pro Arg Leu Val His Pro Arg Thr Gly 
545 550 555 560 

Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr- Ala Thr Gly Arg Leu 
565 570 575 

Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu 
580 585 590 
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Gly Gin Arg He Arg Lys Ala Phe Val Ala Glu Glu Gly Trp Leu Leu 
595 600 605 

Leu Ala Ala Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu 
610 615 620 

Ser Gly Asp Glu Asn Leu Lys Arg Val Phe Arg Glu Gly Lys Asp He 
625 630 635 640 

His Thr Glu Thr Ala Ala Trp Met Phe Gly Leu Asp Pro Ala Leu Val 
645 650 655 

Asp Pro Lys Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu 
660 665 670 

Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Gly He Asp Tyr 
675 680 685 

Lys Glu Ala Glu Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys 
690 695 700 

Val Arg Ala Trp He Glu Arg Thr Leu Glu Glu Gly Arg Thr Arg Gly 
705 710 715 720 

Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Ala 
725 730 735 

Ser Arg Val Arg Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn 

740 745 750 

Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys He Ala Met Val 
755 760 765 

Lys Leu Phe Pro Arg Leu Lys Pro Leu Gly Ala His Leu Leu Leu Gin 
770 775 780 

Val His Asn Glu Leu Val Leu Glu Val Pro Glu Asp Arg Ala Glu Glu 
785 790 795 800 

Ala Lys Ala Leu Val Lys Glu Val Met Glu Asn Ala Tyr Pro Leu Asp 
805 810 815 

Val Pro Leu Glu Val Glu Val Gly Val Gly Arg Asp Trp Leu Glu Ala 
620 825 830 

Lys Gin Asp 





835 


<210> 


64 


<211> 


832 


<212> 


PRT 


<213> 


Artificial 


<220> 




<223> 


Synthetic 
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<400> 64 , 

Met Glu Phe Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu Val 
15 10 15 

Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys Gly Leu 
20 25 30 

Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala Lys 
35 40 45 

Ser Leu Leu Lys Ala Leu Arg Glu Asp Gly Asp Val Val lie Val Val 
50 55 60 

Phe Asp Ala Lys Ala Pro Ser Phe Arg His Gin Thr Tyr Glu Ala Tyr 
65 70 75 80 

Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu Ala 
85 90 95 

Leu He Lys Glu Met Val Asp Leu Leu Gly Leu Glu Arg Leu Glu Val 
100 105 110 

Pro Gly Phe Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys Lys Ala 
115 120 125 

Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Arg Asp Leu 
130 135 140 

Tyr Gin Leu Leu Ser Glu Arg He Ser He Leu His Pro Glu Gly Tyr 
145 150 155 160 

Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly Leu Lys Pro Ser 
165 170 175 

Gin Trp Val Asp Tyr Arg Ala Leu Ala Gly Asp Pro Ser Asp Asn He 
180 185 190 

Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Ala Lys Leu He Arg 
195 200 205 

Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys His Leu Glu Gin Val Lys 
210 215 220 

Pro Ala Ser Val Arg Glu Lys He Leu Ser His Met Glu Asp Leu Lys 
225 230 235 240 

Leu Ser Leu Glu Leu Ser Arg Val His Thr Asp Leu Leu Leu Gin Val 
245 250 255 

Asp Phe Ala Arg Arg Arg Glu Pro Asp Arg Glu Gly Leu Lys Ala Phe 
260 265 270 

Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu Leu 

275 280 285 

Glu Ser Pro Val Ala Ala Glu Glu Ala Pro Trp Pro Pro Pro Glu Gly 
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290 



295 



300 



Ala Phe Val Gly Tyr Val Leu Ser Arg Pro Glu Pro Met Trp Ala Glu 
305 310 315 320 

Leu Asn Ala Leu Ala Ala Ala Trp Glu Gly Arg Val Tyr Arg Ala Glu 
325 330 335 

Asp Pro Leu Glu Ala Leu Arg Gly Leu Gly Glu Val Arg Gly Leu Leu 
340 345 350 

Ala Lys Asp Leu Ala Val Leu Ala Leu Arg Glu Gly lie Ala Leu Ala 
355 360 365 

Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser Asn 
370 375 380 

Thr Ala Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr Glu 
385 390 395 400 

Glu Ala Gly Glu Arg Ala Leu Leu Ser Glu Arg Leu Tyr Ala Ala Leu 
405 410 415 

Leu Lys Arg Leu Lys Gly Glu Glu Arg Leu Leu Trp Leu Tyr Glu Glu 
420 425 430 

Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr Gly 
435 440 445 

Val Arg Leu Asp Val Ala Tyr Leu Lys Ala Leu Ser Leu Glu Val Glu 
450 455 460 

Ala Glu lie Arg Arg Phe Glu Glu Glu Val His Arg Leu Ala Gly His 
465 470 475 480 

Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val lie Phe Asp 
485 490 495 

Glu Leu Gly Leu Pro Ala He Gly Lys Thr Gin Lys Thr Gly Lys Arg 
500 505 510 

Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro He 
515 520 525 

Val Asp Arg He Leu Gin Tyr Arg Glu Leu Ser Lys Leu Lys Gly Thr 
530 535 540 

Tyr He Asp Pro Leu Pro Ala Leu Val His Pro Lys Thr Asn Arg Leu 
545 550 555 560 

His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser 
565 570 575 

Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly Gin 
580 585 590 



Arg He Arg Arg Ala Phe Val Ala Glu Glu Gly Trp Arg Leu Val Val 
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595 600 605 

Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu Ser Gly 
610 615 620 

Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Gin Asp lie His Thr 
625 630 635 640 

Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp Ser 
645 650 655 

Leu Met Arg Arg Ala Ala Lys Thr lie Asn Phe Gly Val Leu Tyr Gly 
660 665 670 

Met Ser Ala His Arg Leu Ser Gly Glu Leu Ala lie Pro Tyr Glu Glu 
675 680 685 

Ala Val Ala Phe lie Qlu Arg Tyr Phe Gin Ser Tyr Pro Lys Val Arg 
690 695 700 

Ala Trp lie Glu Lys Thr Leu Ala Glu Gly Arg Glu Arg Gly Tyr Val 
705 710 715 720 

Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Ala Ser Arg 
725 730 735 

Val Lys Ser He. Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met Pro 
740 745 750 

Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys Leu 
755 760 765 

Phe Pro Arg Leu Gin Glu Leu Gly Ala Arg Met Leu Leu Gin Val His 

770 775 780 

Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Gin Ala Glu Glu Val Ala 
785 790 795 800 

Gin Glu Ala Lys Arg Thr Met Glu Glu Val Trp Pro Leu Lys Val Pro 
805 810 815 

Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys Ala 
820 825 830 

<210> 65 
<211> 842 
<212> PRT 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 65 

Met Asn Ser Thr Pro Leu Phe Asp Leu Glu Glu Pro Pro Lys Arg Val 
15 10 15 
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Leu lieu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Tyr Ala Leu 
20 25 30 

Ser Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Met Val Tyr Gly Phe 
35 40 45 

Ala Arg Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Gin Ala Val Val 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Glu 
65 70 75 80 

Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu Val Lys Arg Leu Val Asp Leu Leu Gly Leu Val Arg Leu 
100 105 110 

Glu Ala Pro Gly Tyr Glu Ala Asp Asp Val Leu Gly Thr Leu Ala Lys 
115 120 125 

Lys Ala Glu Arg Glu Gly Met > Glu Val Arg He Leu Thr Gly Asp Arg 
130 135 140 

Asp Phe Phe Gin Leu Leu Ser Glu Lys Val Ser Val Leu Leu Pro Asp 
145 150 155 160 

Gly Thr Leu Val Tlir Pro Lys Asp Val Gin Glu Lys Tyr Gly Val Pro 
165 170 175 

Pro Glu Arg Trp Val Asp Phe Arg Ala Leu Thr Gly Asp Arg Ser Asp 
180 185 190 

Asn He Pro Gly Val Ala Gly He Gly Glu Lys Thr Ala Leu Arg Leu 
195 200 205 

Leu Ala Glu Trp Gly Ser Val Glu Asn Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Val Lys Pro Asp Ser Leu Arg Arg Lys He Glu Ala His Leu Glu Asp 
225 230 235 240 

Leu His Leu Ser Leu Asp Leu Ala Arg He Arg Thr Asp Leu Pro Leu 
245 250 255 

Glu Val Asp Phe Lys Ala Leu Arg Arg Arg Thr Pro Asp Leu Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Glu Leu Glu Phe Gly Ser Leu Leu His Glu 
275 260 285 

Phe Gly Leu Leu Gly Gly Glu Lys Pro Arg Glu Glu Ala Pro Trp Pro 

290 295 300 



Pro Pro Glu Gly Ala Phe Val Gly Phe Leu Leu Ser Arg Lys Glu Pro 
305 310 315 320 
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Met Trp Ala Glu Leu Leu Ala Leu Ala Ala Ala Ser Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 
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His Leu Ser Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg. Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly His His His His His His 
835 840 

<210> 66 

<211> 838 

<212> PRT 

<2a3> Artificial 

<220> 

<223> Synthetic 
<400> 66 

Met Asn Ser Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu Val 
15 10 15 

Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys Gly Leu 
20 25 30 
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Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala Lys 
35 40 45 

Ser Leu Leu Lys Ala Leu Arg Qlu Asp Gly Asp Val Val He Val Val 
50 55 60 

Phe Asp Ala Lys Ala Pro Ser Phe Arg His Gin Thr Tyr Glu Ala Tyr 
65 70 75 80 

Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu Ala 
85 90 95 

Leu He Lys Glu Met Val Asp Leu Leu Gly Leu Glu Arg Leu Glu Val 
100 105 110 

Pro Gly Phe Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys Lys Ala 
115 120 125 

Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Arg Asp Leu 
130 135 140 

Tyr Gin Leu Leu Ser Glu Arg He Ser He Leu His Pro Glu Gly Tyr 
145 150 155 160 

Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly Leu Lys Pro Ser 
165 170 175 

Gin Trp Val Asp Tyr Arg Ala Leu Ala Gly Asp Pro Ser Asp Asn He 
ISO 185 190 

Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Ala Lys Leu He Arg 
195 200 205 

Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys His Leu Glu Gin Val Lys 
210 215 220 

Pro Ala Ser Val Arg Glu Lys He Leu Ser His Met Glu Asp Leu Lys 
225 230 235 240 

Leu Ser Leu Glu Leu Ser Arg Val His Thr Asp Leu Leu Leu Gin Val 
245 250 255 

Asp Phe Ala Arg Arg Arg Glu Pro Asp Arg Glu Gly Leu Lys Ala Phe 
260 265 270 

Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu Leu 

275 280 285 

Glu Ser Pro Val Ala Ala Glu Glu Ala Pro Trp Pro Pro Pro Glu Gly 
290 295 300 

Ala Phe Val Gly Tyr Val Leu Ser Arg Pro Glu Pro Met Trp Ala Glu 
305 310 315 320 

Leu Asn Ala Leu Ala Ala Ala Trp Gly Gly Arg Val His Arg Ala Ala 
325 330 335 
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Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu Leu 
340 345 350 

Ala Lys Asp Leu Ala Val Leu Ala .Ser Arg Glu Gly Leu Asp Leu Val 
355 360 365 

Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser Asn 
370 375 380 

Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr Glu 
385 390 395 400 

Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn Leu 
405 410 415 

Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His Glu 

420 425 430 

Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr Gly 
435 440 445 

Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu Ala 
450 455 460 

Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly His 
465 470 475 480 

Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe Asp 
485 490 495 

Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys Thr Gly Lys Arg 
500 505 510 

Ser Thr Ser Ala- Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro He 
515 520 525 

Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys Leu Lys Asn Thr 
530 535 540 

Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arg Leu 
545 550 555' 560 

His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser 
565 570 575 

Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly Gin 
580 585 590 

Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu Val Ala 

595 600 605 

Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser Gly 
610 615 620 



Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys Asp He His Thr 
625 630 635 640 
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Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp Pro 
645 650 655 

lieu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu Tyr Gly 
660 665 670 

Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr Glu Glu 
675 680 685 

Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val Arg 
690 695 700 

Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly Tyr Val 
705 710 715 720 

Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn Ala Arg 
725 730 735 

Val Lys Ser Val Arg Glu Ala Ala Qlu Arg Met Ala Phe Asn Met Pro 
740 745 750 

Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys Leu 
755 760 765 

Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin Val Ala 
770 775 780 

Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu Val Ala 
785 790 795 800 

Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala Val Pro 
805 810 315 

Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala Lys Gly 
820 825 830 

His His His His His His 
835 



<210> 67 

<211> 842 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 67 

Met Asn Ser Thr Pro Leu Phe Asp Leu Glu Glu Pro Pro Lys Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Tyr Ala Leu 
20 25 30 

Ser Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Met Val Tyr Gly Phe 
35 40 45 
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Ala Arg Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Gin Ala Val Val 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Glu 
65 70 75 80 

Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu Val Lys Arg Leu Val Asp Leu Leu Gly Leu Val Arg Leu 
100 105 110 

Glu Ala Pro Gly Tyr Glu Ala Asp Asp Val Leu Gly Thr Leu Ala Lys 
115 120 125 

Lys Ala Glu Arg Glu Gly Met Glu Val Arg He Leu Thr Gly Asp Arg 
130 135 140 

Asp Phe Phe Gin Leu Leu Ser Glu Lys Val Ser Val Leu Leu Pro Asp 
145 150 155 160 

Gly Thr Leu Val Thr Pro Lys Asp Val Gin Glu Lys Tyr Gly Val Pro 
165 170 175 

Pro GIU Arg Trp Val Asp Phe Arg Ala Leu Thr Gly Asp Arg Ser Asp 
180 185 190 

Asn He Pro Gly Val Ala Gly He Gly Glu Lys Thr Ala Leu Arg Leu 

195 200 205 

Leu Ala Glu Trp Gly Ser Val Glu Asn Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Val Lys Pro Asp Ser Leu Arg Arg Lys He Glu Ala His Leu Glu Asp 
225 230 235 240 

Leu His Leu Ser Leu Asp Leu Ala Arg He Arg Thr Asp Leu Pro Leu 
245 250 255 

Glu Val Asp Phe Lys Ala Leu Arg Arg Arg Thr Pro Asp Leu Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Glu Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Gly Gly Glu Lys Pro Arg Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Leu Leu Ser Arg Lys Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Leu Ala Leu Ala Ala Ala Ser Gly Gly Arg Val 
325 330 335 



His Arg Ala Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala 
340 345 350 
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Arg Gly Leu Leu Ala I*ys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly 
355 360 365 

Leu Gly Leu Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu 
405 410 415 

Phe Ala Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp 
420 425 430 

Leu Tyr Arg Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser 
450 455 460 

Leu Glu Val Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Gly Leu Pro Ala He Lys Lys Thr Gin Lys 
500 505 5X0 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Ser Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp 

595 600 605 

Leu Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg 
625 630 635 640 

Asp He His Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu 
645 650 655 
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Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Glu Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 

740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala 
785 790 795 800 

Glu Ala Val Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu 
820 825 830 



Ser TQa Lys Glu His His His His His His 
835 840 

<210> 68 

<211> 838 

<212> PRT 

<213> Artificial 



<220> 

<223> Synthetic 

<400> 68 



Met Asn Ser Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu Val 

15 10 15 

Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys Gly Leu 

20 25 30 

Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala Lys 

35 40 45 

Ser Leu Leu Lys Ala Leu Arg Glu Asp Gly Asp Val Val He Val Val 
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50 55 60 

Phe Asp Ala Lys Ala Pro Ser Phe Arg His Gin Thr Tyr Glu Ala Tyr 
65 70 75 BO 

Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu Ala 
85 90 95 

Leu lie Lys Glu Met Val Asp Leu Leu Gly Leu Glu Arg Leu Glu Val 
100 105 110 

Pro Gly Phe Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys Lys Ala 
115 120 125 

Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Arg Asp Leu 
130 135 140 

Tyr Gin Leu Leu Ser Glu Arg lie Ser He Leu His Pro Glu Gly Tyr 
145 150 ' 155 160 

Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly Leu Lys Pro Ser 

165 170 175 

Gin Trp Val Asp Tyr Arg Ala Leu Ala Gly Asp Pro Ser Asp Asn He 
180 185 190 

Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Ala Lys Leu He Arg 
195 200 205 

Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys His Leu Glu Gin Val Lys 
210 215 220 

Pro Ala Ser Val Arg Glu Lys He Leu Ser His Met Glu Asp Leu Lys 
225 230 235 240 

Leu Ser Leu Glu Leu Ser Arg Val His Thr Asp Leu Leu Leu Gin Val 
245 250 255 

Asp Phe Ala Arg Arg Arg Glu Pro Asp Arg Glu Gly Leu Lys Ala Phe 
260 265 270 

Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu Leu 

275 280 285 

Glu Ser Pro Val Ala Ala Glu Glu Ala Pro Trp Pro Pro Pro Glu Gly 
290 295 300 

Ala Phe Val Gly Tyr Val Leu Ser Arg Pro Glu Pro Met Trp Ala Glu 
305 310 315 320 

Leu Asn Ala Leu Ala Ala Ala Trp Gly Gly Arg Val His Arg Ala Pro 
325 330 335 

Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu Leu 
340 345 350 

Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu Pro 
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355 



360 



365 



Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser Asn 
370 375 380 

Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Qlu Trp Thr Glu 
385 390 395 400 

Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn Leu 
405 410 415 

Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg Glu 
420 425 430 

Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr Gly 
435 440 445 

Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val Ala 
450 455 460 

Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly His 
465 470 475 480 

Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe Asp 
485 490 495 

Glu Leu Gly Leu Pro Ala He Lys Lys Thr Gin Lys Thr Gly Lys Arg 
500 505 510 

Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro He 
515 520 525 

Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser Thr 
530 535 540 

Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg Leu 
545 550 555 560 

His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser 
565 570 575 

Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly Gin 
580 585 590 

Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val Ala 

595 600 605 

Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser Gly 
610 615 620 

Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His Thr 
625 630 635 640 

Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp Pro 
645 650 655 



Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr Gly 
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660 

Met Ser Ala His Arg Leu Ser Gin 
675 680 

Ala Gin Ala Phe He Glu Arg Tyr 
690 695 

Ala Trp He Glu Lys Thr Leu Glu 
705 710 

Glu Thr Leu Phe Gly Arg Arg Arg 

725 

Val Lys Ser Val Arg Glu Ala Ala 
740 

Val Gin Gly Thr Ala Ala Asp Leu 
755 760 

Phe Pro Arg Leu Glu Glu Met Gly 
770 775 

Asn Glu Leu Val Leu Glu Ala Pro 
785 790 

Arg Leu Ala Lys Glu Val Met Glu 
805 

Leu Glu Val Glu Val Gly He Gly 
820 

His His His His His His 

835 

<210> 69 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 69 



atgaattcgg ggatgctgcc 


cctctttgag 


cccaagggcc gggtcctcct ggtggacggc 


60 


caccacctgg cctaccgcac 


cttccacgcc 


ctgaagggcc tcaccaccag ccggggggag 


120 


ccggtgcagg cggtctacgg 


cttcgccaag 


agcctcctca aggccctcaa ggaggacggg 


180 


gacgcggtga tcgtggtctt 


tgacgccaag 


gccccctcct tccgccacga ggcctacggg 


240 


gggtacaagg cgggccgggc 


ccccacgccg 


gaggactttc cccggcaact cgccctcatc 


300 


aaggagctgg tggacctcct 


ggggctggcg 


cgcctcgagg tcccgggcta cgaggcggac 


360 


gacgtcctgg ccagcctggc 


caagaaggcg 


gaaaaggagg gctacgaggt ccgcatcctc 


420 
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665 



670 



Glu Leu Ala He Pro Tyr Glu Glu 
685 

Phe Gin Ser Phe Pro Lys Val Arg 
700 

Glu Gly Arg Arg Arg Gly Tyr Val 
715 720 

Tyr Val Pro Asp Leu Glu Ala Arg 
730 735 

Glu Arg Met Ala Phe Asn Met Pro 
745 750 

Met Lys Leu Ala Met Val Lys Leu 
765 

Ala Arg Met Leu Leu Gin Val Ala 
780 

Lys Glu Arg Ala Glu Ala Val Ala 
795 800 

Gly Val Tyr Pro Leu Ala Val Pro 
810 815 



Glu Asp Trp Leu Ser Ala Lys Glu 
825 830 
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accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtgc 


accgggcagc 


agaccccttg 


1020 


gcggggctaa 


aggacctcaa 


ggaggtccgg 


ggcctcctcg 


ccaaggacct 


cgccgtcttg 


1080 


gcctcgaggg 


a9Sr9gctaga 


cctcgtgccc 


ggggacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcgcggc 


gctacggggg 


ggagtggacg 


1200 


gaggacgccg 


cccaccgggc 


cctcctctcg 


gagaggctcc 


atcggaacct 


ccttaagcgc 


1260 


ctcgaggggg 


ctggsgaagct 


cctttggctc 


taccacgagg 


tggaaaagcc 


cctctcccgg 


1320 


gtcctggccc 


atatggaggc 


caccggggta 


cggcgggacg 


tggcctacct 


tcaggccctt 


1380 


tccctggagc 


ttgcggagga 


gatccgccgc 


ctcgaggagg 


aggtcttccg 


cttggcgggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tgctctttga 


cgagcttagg 


1500 


cttcccgcct 


tggggaagac 


gcaaaagaca 


ggcaagcgct 


ccaccagcgc 


cgcggtgctg 


1560 


gaggccctac 


gggaggccca 


ccccatcgtg 


gagaagatcc 


tccagcaccg 


ggagctcacc 


1620 


aagctcaaga 


acacctacgt 


ggaccccctc 


ccaagcctcg 


tccacccgag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggga 


ggcttagtag 


ctccgacccc 


1740 


aacctgcaga 


acatccccgt 


ccgcaccccc 


ttgggccaga 


ggatccgccg 


ggccttcgtg 


1800 


gccgaggcgg 


gttgggcgtt 


ggtggccctg 


gactatagcc 


agatagagct 


ccgcgtcctc 


1860 


gcccacctct 


ccggggacga 


aaacctgatc 


.agggtcttcc 


aggaggggaa 


ggacatccac 


1920 


acccagaccg 


caagctggat 


gttcggcgtc 


cccccggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agacggtgaa 


cttcggcgtc 


ctctacggca 


tgtccgccca 


taggctctcc 


2040 


caggagcttg 


ccatccccta 


cgaggaggcg 


gtggccttta 


tagagcgcta 


cttccaaagc 


2100 
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ttccccaagg tgcgggcctg gatagaaaag accctggagg aggggaggaa gcggggctac 2160 

gtggaaaccc tcttcggaag aaggcgctac gtgcccgacc tcaacgcccg ggtgaagagc 2220 

gtcagggagg ccgcggagcg catggccttc aacatgcccg tccagggcac cgccgccgac 2280 

ctcatgaagc tcgccatggt gaagctcttc ccccgcctcc gggagatggg ggcccgcatg 2340 

ctcctccagg tccacaacga gctcctcctg gaggcccccc aagcgcgggc cgaggaggtg 2400 

gcggctttgg ccaaggaggc catggagaag gcctatcccc tcgccgtgcc cctggaggtg 2460 

gaggtgggga tgggggagga ctggctttcc gccaagggtc accaccacca ccaccac 2517 

<210> 70 
<2X1> 2526 
<212> DNA 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 70 

atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 60 

ggccaccacc tggcctaccg caccttcttc gccctgaagg gcctcaccac gagccggggc 120 

gaaccggtgc aggcggtcta cggcttcgcc aagagcctcc tcaaggccct gaaggaggac 180 

gggtacaagg ccgtcttcgt ggtctttgac gccaaggccc cctccttccg ccacgaggcc 240 

tacgaggcct acaaggcggg gagggccccg acccccgagg acttcccccg* gcagctcgcc 300 

ctcatcaagg agctggtgga cctcctgggg tttacccgcc tcgaggtccc cggctacgag 360 

gcggacgacg ttctcgccac cctggccaag aaggcggaaa aggaggggta cgaggtgcgc 420 

atcctcaccg ccgaccgcga cctctaccaa ctcgtctccg accgcgtcgc cgtcctccac 480 

cccgagggcc acctcatcac cccggagtgg ctttgggaga agtacggcct caggccggag 540 

cagtgggtgg acttccgcgc cctcgtgggg gacccctccg acaacctccc cggggtcaag 600 

ggcatcgggg agaagaccgc cctcaagctc ctcaaggagt ggggaagcct ggaaaacctc 660 

ctcaagaacc tggaccgggt aaagccagaa aacgtccggg agaagatcaa ggcccacctg 720 

gaagacctca ggctctcctt ggagctctcc cgggtgcgca ccgacctccc cctggaggtg 780 

gacctcgccc aggggcggga gcccgaccgg gaggggctta gggccttcct ggagaggctg 840 

gagttcggca gcctcctcca cgagttcggc ctcctggagg cccccgcccc cctggaggag 900 

gccccctggc ccccgccgga aggggccttc gtgggcttcg tcctctcccg ccccgagccc 960 

atgtgggcgg agcttaaagc cctggccgcc tgcaggggcg gccgcgtcca ccgggccccc 1020 
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gagccttata 


aagccctcag ggacctgaag 


gaggcgcggg ggcttctcgc caaagacctg 


1080 


agcgttctgg 


ccctgaggga aggccttggc 


ctcccgcccg gcgacgaccc catgctcctc 


1140 


gcctacctcc 


tggacccttc gaacaccacc 


cccgaggggg tggcccggcg ctacggcggg 


1200 


gagtggacgg 


aggaggcggg ggagcgggcc 


gccctttccg agaggctctt cgccaacctg 


1260 


tgggggaggc 


ttgaggggga ggagaggctc 


ctttggcttt accgggaggt ggagaggccc 


1320 


ctttccgctg 


tcctggccca tatggaggcc 


^cgggggtgc gcctggacgt ggcctatctc 


1380 


agggccttgt 


ccctggaggt ggccgaggag 


atcgcccgcc tcgaggccga ggtcttccgc 


1440 


ctggccggcc 


accccttcaa cctcaactcc 


cgggaccagc tggaaagggt cctctttgac 


1500 


gagctagggc 


ttcccgccat cggcaagacg 


gagaagaccg gcaagcgctc caccagcgcc 


1560 


gccgtcctgg 


aggccctccg cgaggcccac 


cccatcgtgg agaagatcct gcagtaccgg 


1620 


gagctcacca 


agctgaagag cacctacatt 


gaccccttgc cggacctcat ccaccccagg 


1680 


acgggccgcc 


tccacacccg cttcaaccag 


acggccacgg ccacgggcag gctaagtagc 


1740 


tccgatccca 


acctccagaa catccccgtc 


cgcaccccgc ttgggcagag gatccgccgg 


1800 


gccttcatcg 


ccgaggaggg gtggctattg 


gtggccctgg actatagcca gatagagctc 


1860 


agggtgctgg 


cccacctctc cggcgacgag 


aacctgatcc gggtcttcca ggaggggcgg 


1920 


gacatccaca 


cggagaccgc cagctggatg 


ttcggcgtcc cccgggaggc cgtggacccc 


1980 


ctgatgcgcc 


gggcggccaa gaccatcaac 


ttcggggtcc tctacggcat gtcggcccac 


2040 


cgcctctccc 


aggagctagc catcccttac 


gaggaggccc aggccttcat tgagcgctac 


2100 


tttcagagct 


tccccaaggt gcgggcctgg 


attgagaaga ccctggagga gggcaggagg 


2160 


cgggggtacg 


tggagaccct cttcggccgc 


cgccgctacg tgccagacct agaggcccgg 


2220 


gtgaagagcg 


tgcgggaggc ggccgagcgc 


atggccttca acatgcccgt ccagggcacc 


2280 


gccgccgacc 


tcatgaagct ggctatggtg 


aagctcttcc ccaggctgga ggaaatgggg 


2340 


gccaggatgc 
gaggccgtgg 


tccttcaggt ccacaacgag 
cccggctggc caaggaggtc 


ctggtcctcg aggccccaaa agagagggcg 
atggaggggg tgtatcccct ggccgtgccc 


2400 
2460 


ctggaggtgg 


aggtggggat aggggaggac 


tggctctccg ccaaggagca ccaccaccac 


2520 


caccac 






2526 



<210> 71 

<211> 2517 

<212> DNA 

<213> Artificial 
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<220> 

<223> Synthetic 
<400> 71 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 

caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 120 

ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 

gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 240 

gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 300 

aaggagctgg tggacctcct ggggctggcg cgcctcgagg tcccgggcta cgaggcggac 360 

gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 420 

accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 480 

gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 540 

gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 600 

ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 660 

aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 720 

aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 780 

aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 840 

agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 900 

cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 960 

gatcttctgg ccctggccgc cgccaggggc ggccgcgtgc accgggcagc agaccccttg 1020 

gcggggctaa aggacctcaa ggaggtccgg ggcctcctcg ccaaggacct cgccgtcttg 1080 

gcctcgaggg aggggctaga cctcgtgccc ggggacgacc ccatgctcct cgcctacctc 1140 

ctggaccctt cgaacaccac ccccgagggg gtggcgcggc gctacggggg ggagtggacg 1200 

gaggacgccg cccaccgggc cctcctctcg gagaggctcc atcggaacct ccttaagcgc 1260 

ctcgaggggg aggagaagct cctttggctc taccacgagg tggaaaagcc cctctcccgg 1320 

gtcctggccc atatggaggc caccggggta cggcgggacg tggcctacct tcaggccctt 1380 

tccctggagc ttgcggagga gatccgccgc ctcgaggagg aggtcttccg cttggcgggc 1440 

caccccttca acctcaactc ccgggaccag ctggaaaggg tgctctttga cgagcttagg 1500 

cttcccgcct tggggaagac gcaaaagaca ggcaagcgct ccaccagcgc cgcggtgctg 1560 
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gaggccctac 


gggaggccca 


ccccatcgtg 


gagaagatcc 


tccagcaccg 


ggagctcacc 


1620 


aagctcaaga 


acacctacgt 


ggaccccctc 


ccaagcctcg 


tccacccgag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggga 


ggcttagtag 


ctccgacccc 


1740 


aacctgcaga 


acatccccgt 


ccgcaccccc 


ttgggccaga 


ggatccgccg 


ggccttcatc 


1800 


gccgaggagg 


ggtggctatt 


ggtggccctg 


gactatagcc 


agatagagct 


cagggtgctg 


1860 


gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 


acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agaccatcaa 


cttcggggtc 


ctctacggca 


tgtcggccca 


ccgcctctcc 


2040 


caggagctag 


ccatccctta 


cgaggaggcc 


caggccttca 


ttgagcgcta 


ctttcagagc 


2100 


ttccccaagg 


tgcgggcctg 


gattgagaag 


accctggagg 


agggcaggag 


gcgggggtac 


2160 


gtggagaccc 


tcttcggccg 


ccgccgctac 


gtgccagacc 


tagaggcccg 


ggtgaagagc 


2220 






catoaccfcfcc 


aacafccrccca 




cere caccaa^c 


2280 


ctcatgaagc 


tggctatggt 


gaagctcttc 


cccaggctgg 


aggaaatggg 


ggccaggatg 


2340 


ctccttcagg 


tccacaacga 


gctggtcctc 


gaggccccaa 


aagagagggc 


ggaggccgtg 


2400 


gcccggctgg 


ccaaggaggt 


catggagggg 


gtgtatcccc 


tggccgtgcc 


cctggaggtg 


2460 


gaggtgggga 


taggggagga 


ctggctctcc 


gccaaggagc 


accaccacca 


ccaccac 


2517 



<210> 72 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 72 



atgaattcgg ggatgctgcc 


cctctttgag 


cccaagggcc 


gggtcctcct 


srgtggacggc 


60 


caccacctgg cctaccgcac 


cttccacgcc 


ctgaagggcc 


tcaccaccag 


ccggggggag 


120 


ccggtgcagg cggtctacgg 


cttcgccaag 


agcctcctca 


aggccctcaa 


ggaggacggg 


180 


gacgcggtga tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga 


ggcctacggg 


240 


gggtacaagg cgggccgggc 


ccccacgccg 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


aaggagctgg tggacctcct 


ggggctggcg 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 


gacgtcctgg ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


accgccgaca aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 
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gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 540 

gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 600 

ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 660 

aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 720 

aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 780 

aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 840 

agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 900 

cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 960 

gatcttctgg ccctggccgc cgccaggggc ggccgcgtcc accgggcccc cgagccttat 1020 

aaagccctca gggacctgaa ggaggcgcgg gggcttctcg ccaaagacct gagcgttctg 1080 

gccctgaggg aaggccttgg cctcccgccc ggcgacgacc ccatgctcct cgcctacctc 1140 

ctggacGctt cgaacaccac ccccgagggg gtggcccggc gctacggcgg ggagtggacg 1200 

gaggaggcgg gggagcgggc cgccctttcc gagaggctct tcgccaacct gtgggggagg 1260 

cttgaggggg aggagaggct cctttggctt taccgggagg tggagaggcc cctttccgct 1320 

gtcctggccc atatggaggc cacgggggtg cgcctggacg tggcctatct cagggccttg 1380 

tccctggagg tggccgagga gatcgcccgc ctcgaggccg aggtcttccg cctggccggc 1440 

caccccttca acctcaactc ccgggaccag ctggaaaggg tcctctttga cgagctaggg 1500 

cttcccgcca tcggcaagac ggagaagacc ggcaagcgct ccaccagcgc cgccgtcctg 1560 

gaggccctcc gcgaggccca ccccatcgtg gagaagatcc tgcagtaccg ggagctcacc 1620 

aagctgaaga gcacctacat tgaccccttg ccggacctca tccaccccag gacgggccgc 1680 

ctccacaccc gcttcaacca gacggccacg gccacgggca ggctaagtag ctccgatccc 1740 

aacctccaga acatccccgt ccgcaccccg cttgggcaga ggatccgccg ggccttcgtg 1800 

gccgaggcgg gttgggcgtt ggtggccctg gactatagcc agatagagct ccgcgtcctc 1860 

gcccacctct ccggggacga aaacctgatc agggtcttcc aggaggggaa ggacatccac 1920 

acccagaccg caagctggat gttcggcgtc cccccggagg ccgtggaccc cctgatgcgc 1980 

cgggcggcca agacggtgaa cttcggcgtc ctctacggca tgtccgccca taggctctcc 2040 

caggagcttg ccatccccta cgaggaggcg gtggccttta tagagcgcta cttccaaagc 2100 

ttccccaagg tgcgggcctg gatagaaaag accctggagg aggggaggaa gcggggctac 2160 
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gtggaaaccc tcttcggaag 


aaggcgctac 


gtgcccgacc 


tcaacgcccg 


ggtgaagagc 


2220 




1.* CI L> y y \tf w L« t. C 






^y ^ wy w>y Cl 1^ 


2280 


ctcatgaagc tcgccatggt 


gaagctcttc 


ccccgcctcc 


gggagatggg 


ggcccgcatg 


2340 


ctcctccagg tccacaacga 


gctcctcctg 


gaggcccccc 


aagcgcgggc 


cgaggaggtg 


2400 


gcggctttgg ccaaggaggc 


catggagaag 


gcctatcccc 


tcgccgtgcc 


cctggaggtg 


2460 


gaggtgggga tgggggagga 


ctggctttcc 


gccaagggtc 


accaccacca 


ccaccac 


2517 



<210> 73 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 73 



atgaattcgg 


ggatgctgcc 


cctctttgag 


cccaagggcc 


gggtcctcct 


ggtggacggc 


60 


caccacctgg 


cctaccgcac 


cttccacgcc 


ctgaagggcc 


tcaccaccag 


ccggggggag 


120 


ccggtgcagg 


cggtctacgg 


cttcgccaag 


agcctcctca 


aggccctcaa 


ggaggacggg 


180 


gacgcggtga 


tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga 


ggcctacggg 


240 


gggtacaagg 


cgggccgggc 


ccccacgccg 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggctggcg 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtgc 


accgggcagc 


agaccccttg 


1020 


gcggggctaa 


aggacctcaa 


ggaggtccgg 


ggcctcctcg 


ccaaggacct 


cgccgtcttg 


1080 



338 



wo 01/90337 



PCT/lISOl/17086 



gcctcgaggg aggggctaga cctcgtgccc ggggacgacc ccatgctcct cgcctacctc 1140 

ctggaccctt cgaacaccac ccccgagggg gtggcgcggc gctacggggg ggagtggacg 1200 

gaggacgccg cccaccgggc cctcctctcg gagaggctcc atcggaacct ccttaagcgc 1260 

ctcgaggggg aggagaagct cctttggctc taccacgagg tggaaaagcc cctctcccgg 1320 

gtcctggccc atatggaggc cacgggggtg cgcctggacg tggcctatct cagggccttg 1380 

tccctggagg tggccgagga gatcgcccgc ctcgaggccg aggtcttccg cctggccggc 1440 

caccccttca acctcaactc ccgggaccag ctggaaaggg tcctctttga cgagctaggg 1500 

cttcccgcca tcggcaagac ggagaagacc ggcaagcgct ccaccagcgc cgccgtcctg 1560 

gaggccctcc gcgaggccca ccccatcgtg gagaagatcc tgcagtaccg ggagctcacc 1620 

aagctgaaga gcacctacat tgaccccttg ccggacctca tccaccccag gacgggccgc 1680 

ctccacaccc gcttcaacca gacggccacg gccacgggca ggctaagtag ctccgatccc 1740 

aacctccaga acatccccgt ccgcaccccg cttgggcaga ggatccgccg ggccttcatc 1800 

gccgaggagg ggtggctatt ggtggccctg gactatagcc agatagagct cagggtgctg 1860 

gcccacctct ccggcgacga gaacctgatc cgggtcttcc aggaggggcg ggacatccac 1920 

acggagaccg ccagctggat gttcggcgtc ccccgggagg ccgtggaccc cctgatgcgc 1980 

cgggcggcca agaccatcaa cttcggggtc ctctacggca tgtcggccca ccgcctctcc 2040 

caggagctag ccatccctta cgaggaggcc caggccttca ttgagcgcta ctttcagagc 2100 

ttccccaagg tgcgggcctg gattgagaag accctggagg agggcaggag gcgggggtac 2160 

gtggagaccc tcttcggccg ccgccgctac gtgccagacc tagaggcccg ggtgaagagc 2220 

gtgcgggagg cggccgagcg catggccttc aacatgcccg tccagggcac cgccgccgac 2280 

ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 2340 
ctccttcagg tccacaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 2400 
gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 2460 
gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 2517 



<210> 74 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
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<400> 74 



atgaattcgg 


ggatgctgcc 


cctctttgag 


cccaagggcc 


gggtcctcct ggtggacggc 


60 


caccacctgg 


cctaccgcac 


cttccacgcc 


ctgaagggcc 


tcaccaccag ccggggggag 


120 


ccggtgcagg 


cggtctacgg 


cttcgccaag 


agcctcctca 


aggccctcaa ggaggacggg 


180 


gacgcggtga 


tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga ggcctacggg 


240 


gggtacaagg 


cgggccgggc 


ccccacgccg 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggctggcg 


cgcctcgagg 


tcccgggcta cgaggcggac 


360 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtcc 


accgggcccc 


cgagccttat 


1020 


aaagccctca 


gggacctgaa 


ggaggcgcgg 


gggcttctcg 


ccaaagacct 


gagcgttctg 


1080 


gccctgaggg 


aaggccttgg 


cctcccgccc 


ggcgacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcccggc 


gctacggcgg 


ggagtggacg 


1200 


gaggaggcgg 


gggagcgggc 


cgccctttcc 


gagaggctct 


tcgccaacct 


gtgggggagg 


1260 


cttgaggggg 


aggagaggct 


cctttggctt 


taccgggagg 


tggagaggcc 


cctttccgct 


1320 






caccggggca 


cggcgggacg 


tggcctacct 


tcaggccctt 


T on 

- ± JOU 


tccctggagc 


ttgcggagga 


gatccgccgc 


ctcgaggagg 


aggtcttccg 


cttggcgggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tgctctttga 


cgagcttagg 


1500 


cttcccgcct 


tggggaagac 


gcaaaagaca 


ggcaagcgct 


ccaccagcgc 


cgcggtgctg 


1560 


gaggccctac 


gggaggccca 


ccccatcgtg 


gagaagatcc 


tccagcaccg 


ggagctcacc 


1620 


aagctcaaga 


acacctacgt 


ggaccccctc 


ccaagcctcg 


tccacccgag 


gacgggccgc 


1680 
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ctccacaccc gcttcaacca gacggccacg gccacgggga ggcttagtag ctccgacccc 1740 

aacctgcaga acatccccgt ccgcaccccc ttgggccaga ggatccgccg ggccttcatc 1800 

gccgaggagg ggtggctatt ggtggccctg gactatagcc agatagagct cagggtgctg 1860 

gcccacctct ccggcgacga gaacctgatc cgggtcttcc aggaggggcg ggacatccac 1920 

acggagaccg ccagctggat gttcggcgtc ccccgggagg ccgtggaccc cctgatgcgc 1980 

cgggcggcca agaccatcaa cttcggggtc ctctacggca tgtcggccca ccgcctctcc 2040 

caggagctag ccatccctta cgaggaggcc caggccttca ttgagcgcta ctttcagagc 2100 

ttccccaagg tgcgggcctg gattgagaag accctggagg agggcaggag gcgggggtac 2160 

gtggagaccc tcttcggccg ccgccgctac gtgccagacc tagaggcccg ggtgaagagc 2220 

gtgcgggagg cggccgagcg catggccttc aacatgcccg tccagggcac cgccgccgac 2280 

ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 2340 

ctccttcagg tccacaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg .2400 

gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 2460 

gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 2517 

<210> 75 
<211> 2517 
<212> DNA 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 75 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 

caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 120 

ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 

gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 240 

gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 300 

aaggagctgg tggacctcct ggggctggcg cgcctcgagg tcccgggcta cgaggcggac 360 

gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 420 

accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 480 

gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 540 
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gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtcc 


accgggcccc 


cgagccttat 


1020 


aaagccctca 


gggacctgaa 


ggaggcgcgg 


gggcttctcg 


ccaaagacct 


gagcgttctg 


1080 


gccctgaggg 


aaggccttgg 


cctcccgccc 


ggcgacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcgcggc 


gctacggggg 


ggagtggacg 


1200 


gaggacgccg 


cccaccgggc 


cctcctctcg 


gagaggctcc 


atcggaacct 


ccttaagcgc 


1260 


ctcgaggggg 


aggagaagct 


cctttggctc 


taccacgagg 


tggaaaagcc 


cctctcccgg 


1320 


gtcctggccc 


atatggaggc 


caccggggta 


cggcgggacg 


tggcctacct 


tcaggccctt 


1380 


tccctggagc 


ttgcggagga 


gatccgccgc 


ctcgaggagg 


aggtcttccg 


cttggcgggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tgctctttga 


cgagcttagg 


1500 


cttcccgcct 


tggggaagac 


gcaaaagaca 


ggcaagcgct 


ccaccagcgc 


cgcggtgctg 


1560 


gaggccctac 


gggaggccca 


ccccatcgtg 


gagaagatcc 


tccagcaccg 


ggagctcacc 


1620 


aagctcaaga 


acacctacgt 


ggaccccctc 


ccaagcctcg 


tccacccgag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggga 


ggcttagtag 


ctccgacccc 


1740 


aacctgcaga 


acatccccgt 


ccgcaccccc 


ttgggccaga 


ggatccgccg 


ggccttcatc 


1800 


gccgaggagg 


ggtggctatt 


99tggccctg 


gactatagcc 


agatagagct 


cagggtgctg 


1860 


gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 


acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 




ay ci w iw oL u C cL a 


cu ucgggguc 


^ ^ ^ ^ a ^t'Kftf* s 

ccc uacggca 


cgucggccca 


^ ^%^w ^ ^ ^ ^ 4* ^ ^ 

ccgccucucc 


£U4 u 


caggagctag 


ccatccctta 


cgaggaggcc 


caggccttca 


ttgagcgcta 


ctttcagagc 


2100 


ttccccaagg 


tgcgggcctg 


gattgagaag 


accctggagg 


agggcaggag 


gcgggggtac 


2160 


gtggagaccc 


tcttcggccg 


ccgccgctac 


gtgccagacc 


tagaggcccg 


ggtgaagagc 


2220 


gtgcgggagg 


cggccgagcg 


catggcct'tc 


aacatgcccg 


tccagggcac 


cgccgccgac 


2280 
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ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 2340 

ctccttcagg tccacaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 2400 

gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 2460 

gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 2517 

<210> 76 
<211> 2517 
<212> mh 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 76 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 

caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 120 

ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 

gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 240 

gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 300 

aaggagctgg tggacctcct ggggctggcg cgcctcgagg tcccgggcta cgaggcggac 360 

gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 420 

accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 480 

gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 540 

gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 600 

ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 660 

aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 720 

aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 780 

aaaaggcggg- agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 840 

agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 900 

cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 960 

gatcttctgg ccctggccgc cgccaggggc ggccgcgtgc accgggcagc agaccccttg 1020 

gcggggctaa aggacctcaa ggaggtccgg ggcctcctcg ccaaggacct cgccgtcttg 1080 

gcctcgaggg aggggctaga cctcgtgccc ggggacgacc ccatgctcct cgcctacctc 1140 
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ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcccggc 


gctacggcgg 


ggagtggacg 


1200 


gaggaggcgg 


gggagcgggc 


cgccctttcc 


gagaggctct 


tcgccaacct 


gtgggggagg 


1260 


cttgaggggg 


aggagaggct 


cctttggctt 


taccgggagg 


tggagaggcc 


cctttccgct 


1320 


gtcctggccc 


atatggaggc 


cacgggggtg 


cgcctggacg 


tggcctatct 


cagggccttg 


1380 


tccctggagg 


tggccgagga 


gatcgcccgc 


ctcgaggccg 


aggtcttccg 


cctggccggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tcctctttga 


cgagctaggg 


1500 


cttcccgcca 


tcggcaagac 


ggagaagacc 


ggcaagcgct 


ccaccagcgc 


cgccgtcctg 


1560 


gaggccctcc 


gcgaggccca 


ccccatcgtg 


gagaagatcc 


tgcagtaccg 


ggagctcacc 


1620 


aagctgaaga 


gcacctacat 


tgaccccttg 


ccggacctca 


tccaccccag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggca 


ggctaagtag 


ctccgatccc 


1740 


aacctccaga 


acatccccgt 


ccgcaccccg 


cttgggcaga 


ggatccgccg 


ggccttcatc 


1800 


gccgaggagg 


ggtggctatt 


ggtggccctg 


gactatagcc 


agatagagct 


cagggtgctg 


1860 


gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 


acgcfagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agaccatcaa 


cttcggggtc 


ctctacggca 


tgtcggccca 


ccgcctctcc 


2040 


caggagctag 


ccatccctta 


cgaggaggcc 


caggccttca 


ttgagcgcta 


ctttcagagc 


2100 


ttccccaagg 


tgcgggcctg 


gattgagaag 


accctggagg 


agggcaggag 


gcgggggtac 


2160 


gtggagaccc 


tcttcggccg 


ccgccgctac 


gtgccagacc 


tagaggcccg 


ggtgaagagc 


2220 




yy wi-^y ay 
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ctcatgaagc 


tggctatggt 


gaagctcttc 


cccaggctgg 


aggaaatggg 


ggccaggatg 


2340 


ctccttcagg 


tccacaacga 


gctggtcctc 


gaggccccaa 


aagagagggc 


ggaggccgtg 


2400 


gcccggctgg 


ccaaggaggt 


catggagggg 


gtgtatcccc 


tggccgtgcc 


cctggaggtg 


2460 


gaggtgggga 


taggggagga 


ctggctctcc 


gccaaggagc 


accaccacca 


ccaccac 


2517 



<210> 77 

<211> 2526 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 77 

atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 60 
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ggccaccacc tggcctaccg caccttcttc gccctgaagg gcctcaccac gagccggggc 120 

gaaccggtgc aggcggtcta cggcttcgcc aagagcctcc tcaaggccct gaaggaggac 180 

gggtacaagg ccgtcttcgt ggtctttgac gccaaggccc cctccttccg ccacgaggcc 240 

tacgaggcct acaaggcggg gagggccccg acccccgagg acttcccccg gcagctcgcc 300 

ctcatcaagg agctggtgga cctcctgggg tttacccgcc tcgaggtccc cggctacgag 360 

gcggacgacg ttctcgccac cctggccaag aaggcggaaa aggaggggta cgaggtgcgc 420 

atcctcaccg ccgaccgcga cctctaccaa ctcgtctccg accgcgtcgc cgtcctccac 480 

cccgagggcc acctcatcac cccggagtgg ctttgggaga agtacggcct caggccggag 540 

cagtgggtgg acttccgcgc cctcgtgggg gacccctccg acaacctccc cggggtcaag 600 

ggcatcgggg agaagaccgc cctcaagctc ctcaaggagt ggggaagcct ggaaaacctc 660 

ctcaagaacc tggaccgggt aaagccagaa aacgtccggg agaagatcaa ggcccacctg 720 

gaagacctca ggctctcctt ggagctctcc cgggtgcgca ccgacctccc cctggaggtg 780 

gacctcgccc aggggcggga gcccgaccgg gaggggctta gggccttcct ggagaggctg 840 

gagttcggca gcctcctcca cgagttcggc ctcctggagg cccccgcccc cctggaggag 900 

gccccctggc ccccgccgga aggggccttc gtgggcttcg tcctctcccg ccccgagccc 960 

atgtgggcgg agcttaaagc cctggccgcc tgcaggggcg gccgcgtgca ccgggcagca 1020 

gaccccttgg cggggctaaa ggacctcaag gaggtccggg gcctcctcgc caaggacctc 1080 

gccgtcttgg cctcgaggga ggggctagac ctcgtgcccg gggacgaccc catgctcctc 1140 

gcctacctcc tggacccttc gaacaccacc cccgaggggg tggcgcggcg ctacgggggg 1200 

gagtggacgg aggacgccgc ccaccgggcc ctcctctcgg agaggctcca tcggaacctc 1260 

cttaagcgcc tcgaggggga ggagaagctc ctttggctct accacgaggt ggaaaagccc 1320 

ctctcccggg tcctggccca tatggaggcc accggggtac ggcgggacgt ggcctacctt 1380 

caggcccttt ccctggagct tgcggaggag atccgccgcc tcgaggagga ggtcttccgc 1440 

ttggcgggcc accccttcaa cctcaactcc cgggaccagc tggaaagggt gctctttgac 1500 

gagcttaggc ttcccgcctt ggggaagacg caaaagacag gcaagcgctc caccagcgcc 1560 

gcggtgctgg aggccctacg ggaggcccac cccatcgtgg agaagatcct ccagcaccgg 1620 

gagctcacca agctcaagaa cacctacgtg gaccccctcc caagcctcgt ccacccgagg 1680 

acgggccgcc tccacacccg cttcaaccag acggccacgg ccacggggag gcttagtagc 1740 
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tccgacccca 


acctgcagaa 


catccccgtc 


cgcaccccct 


tgggccagag 

999 9 9 


gatccgccgg 


1800 


gccttcatcg 


ccgaggaggg 


gtggctattg 


crtqQCCctgq 


actatagcca 


gatagagctc 


1860 


agggtgctgg 


cccacctctc 


cggcaacqao 

9 9 ■*» 3 ** g ** 3 


aaccbga^cc 


gggtcttcca 


9 9**J939*'99 


1920 


gacatccaca 


cggagaccgc 


cagctqqatq 


ttcggcgtcc 


cccqqqagqc 


cgtggacccc 


1980 


ctgatgcgcc 


gggcggccaa 


gaccatcaac 




tctacggcat 


gtcggcccac 


2040 


cgcctctccc 


aggagctagc 


catcccttac 


gaQQaQgccc 

9 9 9 9 9 


aggccttcat 


tgagcgctac 


2100 


tttcagagct 


tccccaaggt 


gcgggcctgg 


attgagaaga 


ccctggagga 

99 9 9 


ggqcaggagg 

99 9 99 99 


2160 


cgggggtacg 


tggagaccct 


cttcggccgc 


cgccgctacg 


tgccagacct 


agagqcccqq 


2220 


gtgaagagcg 


tgcgggaggc 


ggccgagcgc 


atggccttca 


acatgcccgt 


ccagggcacc 


2280 


gccgccgacc 


tcatgaagct 


ggctatggtg 


aagctcttcc 


ccaggctgga 


ggaaatgggg 


2340 


gccaggatgc 


tccttcaggt 


ccacaacgag 


ctggtcctcg 


aggccccaaa 


agagagggcg 


2400 


gaggccgtgg 


cccggctggc 


caaggaggtc 


atggaggggg 


tgtatcccct 


ggccgtgccc 


2460 


ctggaggtgg 


aggtggggat 


aggggaggac 


tggctctccg 


ccaaggagca 


ccaccaccac 


2520 


caccac 












2526 



<210> 78 

<211> 2526 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 78 



atgaattccg 


aggcgatgct 


tccgctcttt 


gaacccaaag 


gccgggtcct 


cctggtggac 


60 


ggccaccacc 


tggcctaccg 


caccttcttc 


gccctgaagg 


gcctcaccac 


gagccggggc 


120 


gaaccggtgc 


aggcggtcta 


cggcttcgcc 


aagagcctcc 


tcaaggccct 


gaaggaggac 


180 


gggtacaagg 


ccgtcttcgt 


ggtctttgac 


gccaaggccc 


cctccttccg 


ccacgaggcc 


240 


tacgaggcct 


acaaggcggg 


gagggccccg 


acccccgagg 


acttcccccg 


gcagctcgcc 


300 


ctcatcaagg 


agctggtgga 


cctcctgggg 


tttacccgcc 


tcgaggtccc 


cggctacgag 


360 


gcggacgacg 


ttctcgccac 


cctggccaag 


aaggcggaaa 


aggaggggta 


cgaggtgcgc 


420 


atcctcaccg 


ccgaccgcga 


cctctaccaa 


ctcgtctccg 


accgcgtcgc 


cgtcctccac 


480 


cccgagggcc 


acctcatcac 


cccggagtgg 


ctttgggaga 


agtacggcct 


caggccggag 


540 


cagtgggtgg 


acttccgcgc 


cctcgtgggg 


gacccctccg 


acaacctccc 


cggggtcaag 


600 
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ggcatcgggg agaagaccgc cctcaagctc ctcaaggagt ggggaagcct ggaaaacctc 660 

ctcaagaacc tggaccgggt aaagccagaa aacgtccggg agaagatcaa ggcccacctg 720 

gaagacctca ggctctcctt ggagctctcc cgggtgcgca ccgacctccc cctggaggtg 780 

gacctcgccc aggggcggga gcccgaccgg gaggggctta gggccttcct ggagaggctg 840 

gagttcggca gcctcctcca cgagttcggc ctcctggagg cccccgcccc cctggaggag 900 

gccccctggc ccccgccgga aggggccttc gtgggcttcg tcctctcccg ccccgagccc 960 

atgtgggcgg agcttaaagc cctggccgcc tgcaggggcg gccgcgtcca ccgggccccc 1020 

gagccttata aagccctcag ggacctgaag gaggcgcggg ggcttctcgc caaagacctg 1080 

agcgttctgg ccctgaggga aggccttggc ctcccgcccg gcgacgaccc catgctcctc 1140 

gcctacctcc tggacccttc gaacaccacc cccgaggggg tggcccggcg ctacggcggg 1200 

gagtggacgg aggaggcggg ggagcgggcc gccctttccg agaggctctt cgccaacctg 1260 

tgggggaggc ttgaggggga ggagaggctc ctttggcttt accgggaggt ggagaggccc 1320 

ctttccgctg tcctggccca tatggaggcc acgggggtgc gcctggacgt ggcctatctc 1380 

agggccttgt ccctggaggt ggccgaggag atcgcccgcc tcgaggccga ggtcttccgc 1440 

ctggccggcc accccttcaa cctcaactcc cgggaccagc tggaaagggt cctctttgac 1500 

gagctagggc ttcccgccat cggcaagacg gagaagaccg gcaagcgctc caccagcgcc 1560 

gccgtcctgg aggccctccg cgaggcccac cccatcgtgg agaagatcct gcagtaccgg 1620 

gagctcacca agctgaagag cacctacatt gaccccttgc cggacctcat ccaccccagg 1680 

acgggccgcc tccacacccg cttcaaccag acggccacgg ccacgggcag gctaagtagc 1740 

tccgatccca acctccagaa catccccgtc cgcaccccgc ttgggcagag gatccgccgg 1800 

gccttcgtgg ccgaggcggg ttgggcgttg gtggccctgg actatagcca gatagagctc 1860 

cgcgtcctcg cccacctctc cggggacgaa aacctgatca gggtcttcca ggaggggaag 1920 

gacatccaca cccagaccgc aagctggatg ttcggcgtcc ccccggaggc cgtggacccc 1980 

ctgatgcgcc gggcggccaa gacggtgaac ttcggcgtcc tctacggcat gtccgcccat 2040 

aggctctccc aggagcttgc catcccctac gaggaggcgg tggcctttat agagcgctac 2100 

ttccaaagct tccccaaggt gcgggcctgg atagaaaaga ccctggagga ggggaggaag 2160 

cggggctacg tggaaaccct cttcggaaga aggcgctacg tgcccgacct caacgcccgg 2220 

gtgaagagcg tcagggaggc cgcggagcgc atggccttca acatgcccgt ccagggcacc 2280 
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gccgccgacc 


tcatgaagct 


cgccatggtg 


gcccgcatgc tcctccaggt 


ccacaacgag 


gaggaggtgg cggctttggc 


caaggaggcc 


ctggaggtgg aggtggggat 


gggggaggac 


caccac 






<210> 79 
<211> 2499 
<212> DNA 
<213> Artificial 




<220> 

<223> Synthetic 




<400> 79 
atgaattcgg 


ggatgctgcc 


cctctttgag 


caccacctgg 


cctaccgcac 


cttccacgcc 


ccggtgcagg 


cggtctacgg 


cttcgccaag 


gacgcggtga 


ccgcggtctt 


tgacgccaag 


gggtacaagg 


cgggccgggc 


ccccacgccg 


aaggagctgg 


tggacctcct 


ggggctggcg 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


accgccgaca 


aagaccttta 


ccagctcctt 


gggtacctca 


tcaccccggc 


ctggctttgg 


gccgactacc 


gggccctgac 


cggggacgag 


ggggagaaga 


cggcgaggaa 


gcttctggag 


aacctggacc 


ggctgaagcc 


cgccatccgg 


aagctctcct 


gggacctggc 


caaggtgcgc 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agcctcctcc 


acgagttcgg 


ccccccggaa 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gatcttctgg 


ccctggccgc 


cgccaggggc 


gcggggctaa 


aggacctcaa 


ggaggtccgg 


gcctcgaggg 


aggggctaga 


cctcgtgccc 
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aagctcttcc 


cccgcctccg 


ggagatgggg 


2340 


ctcctcctgg 


aggcccccca 


agcgcgggcc 


2400 


atggagaagg 


cctatcccct 


cgccgtgccc 


2460 


tggctttccg 


ccaagggtca 


ccaccaccac 


2520 
2526 



cccaagggcc 


gggtcctcct 


ggtggacggc 


60 


ctgaagggcc 


tcaccaccag 


ccggggggag 


120 


agcctcctca 


aggccctcaa 


ggaggacggg 


180 


gccccctcct 


tccgccacga 


ggcctacggg 


240 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


ggccgcgtgc 


accgggcagc 


agaccccttg 


1020 


ggcctcctcg 


ccaaggacct 


cgccgtcttg 


1080 


ggggacgacc 


ccatgctcct 


cgcctacctc 


1140 
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ctggaccctt cgaacaccac ccccgagggg gtggcgcggc gctacggggg ggagtggacg 1200 

gaggacgccg cccaccgggc cctcctctcg gagaggctcc atcggaacct ccttaagcgc 1260 

ctcgaggggg aggagaagct cctttggctc taccacgagg tggaaaagcc cctctcccgg 1320 

gtcctggccc atatggaggc caccggggta cggcgggacg tggcctacct tcaggccctt 1380 

tccctggagc ttgcggagga gatccgccgc ctcgaggagg aggtcttccg cttggcgggc 1440 

caccccttca acctcaactc ccgggaccag ctggaaaggg tgctctttga cgagcttagg 1500 

cttcccgcct tggggaagac gcaaaagaca ggcaagcgct ccaccagcgc cgcggtgctg 1560 

gaggccctac gggaggccca ccccatcgtg gagaagatcc tccagcaccg ggagctcacc 1620 

aagctcaaga acacctacgt ggaccccctc ccaagcctcg tccacccgag gacgggccgc 1680 

ctccacaccc gcttcaacca gacggccacg gccacgggga ggcttagtag ctccgacccc 1740 

aacctgcaga acatccccgt ccgcaccccc ttgggccaga ggatccgccg ggccttcgtg 1800 

gccgaggcgg gttgggcgtt ggtggccctg gactatagcc agatagagct ccgcgtcctc 1860 

gcccacctct ccggggacga aaacctgatc agggtcttcc aggaggggaa ggacatccac 1920 

acccagaccg caagctggat gttcggcgtc cccccggagg ccgtggaccc cctgatgcgc 1980 

cgggcggcca agacggtgaa cttcggcgtc ctctacggca tgtccgccca taggctctcc 2040 

caggagcttg ccatccccta cgaggaggcg gtggccttta tagagcgcta cttccaaagc 2100 

ttccccaagg tgcgggcctg gatagaaaag accctggagg aggggaggaa gcggggctac 2160 

gtggaaaccc tcttcggaag aaggcgctac gtgcccgacc tcaacgcccg ggtgaagagc 2220 

gtcagggagg ccgcggagcg catggccttc aacatgcccg tccagggcac cgccgccgac 2280 

ctcatgaagc tcgccatggt gaagctcttc ccccgcctcc gggagatggg ggcccgcatg 2340 

ctcctccagg tecacgacga gctcctcctg gaggcccccc aagcgcgggc cgaggaggtg 2400 

gcggctttgg ccaaggaggc catggagaag gcctatcccc tcgccgtgcc cctggaggtg 2460 

gaggtgggga tgggggagga ctggctttcc gccaagggt 2499 



<210> 80 

<211> 2499 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 80 
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atgaattcgg 


ggatgctgcc 


cctctttgag 


cccaagggcc 


gggtcctcct 


ggtggacggc 


60 


caccacctgg 


cctaccgcac 


cttccacgcc 


ctgaagggcc 


tcaccaccag 


ccggggggag 


120 


ccggtgcagg 


cggtctacgg 


cttcgccaag 


agcctcctca 


aggccctcaa 


ggaggacggg 


180 


gacgcggtga 


tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga 


ggcctacggg 


240 


gggtacaagg 


cgggccgggc 


ccccacgccg 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggctggcg 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtgc 


accgggcagc 


agaccccttg 


1020 


gcggggctaa 


aggacctcaa 


ggaggtccgg 


ggcctcctcg 


ccaaggacct 


cgccgtcttg 


1080 


gcctcgaggg 


aggggctaga 


cctcgtgccc 


ggggacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcgcggc 


gctacggggg 


ggagtggacg 


1200 


gaggacgccg 


cccaccgggc 


cctcctctcg 


gagaggctcc 


atcggaacct 


ccttaagcgc 


1260 


ctcgaggggg 


aggagaagct 


cctttggctc 


taccacgagg 


tggaaaagcc 


cctctcccgg 


1320 


gtcctggccc 


atatggaggc 


caccggggta 


cggcgggacg 


tggcctacct 


tcaggccctt 


1380 


tccctggagc 


ttgcggagga 


gatccgccgc 


ctcgaggagg 


aggtcttccg 


cttggcgggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tgctctttga 


cgagcttagg 


1500 


cttcccgcct 


tsrgggaagac 


gcaaaagaca 


ggcaagcgct 


ccaccagcgc 


cgcggtgctg 


1560 


gaggccctac 


gggaggccca 


ccccatcgtg 


gagaagatcc 


tccagcaccg 


ggagctcacc 


1620 


aagctcaaga 


acacctacgt 


ggaccccctc 


ccaagcctcg 


tccacccgag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggga 


ggcttagtag 


ctccgacccc 


1740 
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aacctgcaga acatccccgt ccgcaccccc ttgggccaga ggatccgccg ggccttcatc 1800 

gccgaggagg ggtggctatt ggtggccctg gactatagcc agatagagct cagggtgctg 1860 

gcccacctct ccggcgacga gaacctgatc cgggtcttcc aggaggggcg ggacatccac 1920 

acggagaccg ccagctggat gttcggcgtc ccccgggagg ccgtggaccc cctgatgcgc 1980 

cgggcggcca agaccatcaa cttcggggtc ctctacggca tgtcggccca ccgcctctcc 2040 

caggagctag ccatccctta cgaggaggcc caggccttca ttgagcgcta ctttcagagc 2100 

ttccccaagg tgcgggcctg gattgagaag accctggagg agggcaggag gcgggggtac 2160 

gtggagaccc tcttcggccg ccgccgctac gtgccagacc tagaggcccg ggtgaagagc 2220 

gtgcgggagg cggccgagcg catggccttc aacatgcccg tccagggcac cgccgccgac 2280 

ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 2340 

ctccttcagg tccacgacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 2400 

gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 2460 

gaggtgggga taggggagga ctggctctcc gccaaggag . 2499 

<210> 81 
<211> 2499 
<212> DNA 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 81 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 

caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 120 

ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 

gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 240 

gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 300 

aaggagctgg tggacctcct ggggctggcg cgcctcgagg tcccgggcta cgaggcggac 360 

gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 420 

accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 480 

gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 540 

gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 600 
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ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtcc 


accgggcccc 


cgagccttat 


1020 


aaagccctca 


gggacctgaa 


ggaggcgcgg 


gggcttctcg 


ccaaagacct 


gagcgttctg 


1080 


gccctgaggg 


aaggccttgg 


cctcccgccc 


ggcgacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcccggc 


gctacggcgg 


ggagtggacg 


1200 


gaggaggcgg 


gggagcgggc 


cgccctttcc 


gagaggctct 


tcgccaacct 


gtgggggagg 


1260 


cttgaggggg 


aggagaggct 


cctttggctt 


taccgggagg 


tggagaggcc 


cctttccgct 


1320 


gtcctggccc 


atatggaggc 


cacgggggtg 


cgcctggacg 


tggcctatct 


cagggccttg 


1380 


tccctggagg 


tggccgagga 


gatcgcccgc 


ctcgaggccg 


aggtcttccg 


cctggccggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tcctctttga 


cgagctaggg 


1500 


cttcccgcca 


tcggcaagac 


ggagaagacc 


ggcaagcgct 


ccaccagcgc 


cgccgtcctg 


1560 


gaggccctcc 


gcgaggccca 


ccccatcgtg 


gagaagatcc 


tgcagtaccg 


ggagctcacc 


1620 


aagctgaaga 


gcacctacat 


tgaccccttg 


ccggacctca 


tccaccccag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggca 


ggctaagtag 


ctccgatccc 


1740 


aacctccaga 


acatccccgt 


ccgcaccccg 


cttgggcaga 


ggatccgccg 


ggccttcgtg 


1800 


gccgaggcgg 


gttgggcgtt 


ggtggccctg 


gactatagcc 


agatagagct 


ccgcgtcctc 


1860 


gcccacctct 


ccggggacga 


aaacctgatc 


agggtcttcc 


aggaggggaa 


ggacatccac 


1920 


acccagaccg 


caagctggat 


gttcggcgtc 


cccccggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agacggtgaa 


cttcggcgtc 


ctctacggca 


tgtccgccca 


taggctctcc 


2040 


caggayctug 




cgaggaggcg 


gcggccuuua 


cagagcgcca 


cucccaaagc 


^ xuu 


ttccccaagg 


tgcgggcctg 


gatagaaaag 


accctggagg 


aggggaggaa 


gcggggctac 


2160 


gtggaaaccc 


tcttcggaag 


aaggcgctac 


gtgcccgacc 


tcaacgcccg 


ggtgaagagc 


2220 


gtcagggagg 


ccgcggagcg 


catggccttc 


aacatgcccg 


tccagggcac 


cgccgccgac 


2280 


ctcatgaagc 


tcgccatggt 


gaagctcttc 


ccccgcctcc 


gggagatggg 


ggcccgcatg 


2340 
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ctcctccagg tccacgacga gctcctcctg gaggcccccc aagcgcgggc cgaggaggtg 2400 

gcggctttgg ccaaggaggc catggagaag gcctatcccc tcgccgtgcc cctggaggtg 2460 

gaggtgggga tgggggagga ctggctttcc gccaagggt 2499 

<210> 82 
<211> 2499 
<212> DKA 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 82 

atgaattcgg ggatgctgcc octet ttgag cccaagggcc gggtcctcct ggtggacggc 60 

caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 120 

ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 

gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 240 

gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 300 

aaggagctgg tggacctcct ggggctggcg cgcctcgagg tcccgggcta cgaggcggac 360 

gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 420 

accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 480 

gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 540 

gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 600 

ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 660 

aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 720 

aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 780 

aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 840 

agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 900 

cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 960 

gatcttctgg ccctggccgc cgccaggggc ggccgcgtcc accgggcccc cgagccttat 1020 

aaagccctca gggacctgaa ggaggcgcgg gggcttctcg ccaaagacct gagcgttctg 1080 

gccctgaggg aaggccttgg cctcccgccc ggcgacgacc ccatgctcct cgcctacctc 1140 

ctggaccctt cgaacaccac ccccgagggg gtggcccggc gctacggcgg ggagtggacg 1200 
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gaggaggcgg gggagcgggc 


cgccctttcc 


gagaggctct tcgccaacct gcttaagagg 


1260 


cttgaggggg aggagaggct 


cctttggctt 


taccgggagg tggagaggcc cctttccgct 


1320 


gtcctggccc atatggaggc 


cacgggggtg 


cgcctggacg tggcctatct cagggccttg 


1380 


tccctggagg tggccgagga 


gatcgcccgc 


ctcgaggccg aggtcttccg cctggccggc 


1440 


caccccttca acctcaactc 


ccgggaccag 


ctggaaaggg tcctctttga cgagctaggg 


1500 


cttcccgcca tcggcaagac 


gcaaaagacc 


ggcaagcgct ccaccagcgc cgccgtcctg 


1560 


gaggccctcc gcgaggccca 


ccccatcgtg 


gagaagatcc tgcagtaccg ggagctcacc 


1620 


aagctgaaga gcacctacat 


tgaccccttg 


ccggacctca tccaccccag gacgggccgc 


1680 


ctccacaccc gcttcaacca 


gacggccacg 


gccacgggca ggctaagtag ctccgatccc 


1740 


aacctccaga acatccccgt 


ccgcaccccg 


cttgggcaga ggatccgccg ggccttcatc 


1800 


gccgaggagg ggtggctatt 


ggtggccctg 


gactatagcc agatagagct cagggtgctg 


1860 


gcccacctct ccggcgacga 


gaacctgatc 


cgggtcttcc aggaggggcg ggacatccac 


1920 


acggagaccg ccagctggat 


gttcggcgtc 


ccccgggagg ccgtggaccc cctgatgcgc 


1980 


cgggcggcca agaccatcaa 


cttcggggtc 


ctctacggca tgtcggccca ccgcctctcc 


2040 


caggagctag ccatccctta 


cgaggaggcc 


caggccttca ttgagcgcta ctttcagagc 


2100 


ttccccaagg tgcgggcctg 


gattgagaag 


accctggagg agggcaggag gcgggggtac 


2160 


gtggagaccc tcttcggccg 


ccgccgctac 


gtgccagacc tagaggcccg ggtgaagagc 


2220 


gugcgggagg cggccgagcg 


catggcct^tc 


aacatgcccg tccagggcac cgccgccgac 


2280 


ctcatgaagc tggctatggt 


gaagctcttc 


cccaggctgg aggaaatggg ggccaggatg 


2340 


ctccttcagg tccacgacga 


gctggtcctc 


gaggccccaa aagagagggc ggaggccgtg 


2400 


gcccggctgg ccaaggaggt 


catggagggg 


gtgtatcccc tggccgtgcc cctggaggtg 


2460 


gaggtgggga taggggagga 


ctggctctcc 


gccaaggag 


2499 



<210> 83 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 83 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 
caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 120 
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ccQPtcrcaQQ 


cqatctacaq 

»-yy i-w wwwgg 


cttcgccaag agcctcctca 


aggccctcaa ggaggacggg 


180 




tcgtggtctt 


tgacgccaag gccccctcct 


tccgccacga ggcctacggg 


240 


aaqtacaaQQ 


cqqqccqqqc 


ccccacgccg gaggactttc 


cccggcaact cgccctcatc 


300 


aaqqacjctQQ 


tggacctcct 


ggggctggcg cgcctcgagg 


tcccgggcta cgaggcggac 


360 


oacQtcctQQ 


ccagcctggc 


caagaaggcg gaaaaggagg 


gctacgaggt ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt tccgaccgca 


tccacgtcct ccaccccgag 


480 


craQtacctca 


tcaccccggc 


ctggctttgg gaaaagtacg 


gcctgaggcc cgaccagtgg 


540 


sccgactacc 


qqqccctqac 


^9999^^9^9 tccgacaacc 


ttcccggggt caagggcatc 


600 


craaaRaaa.aa 


wyy ^y y 


qctfcctaqaq qaqtqqqqqa 


gcctggaagc cctcctcaag 


660 


aacctiaaacc 

tic* WW w W 


qqctoaaqcc 


cgccatccgg gagaagatcc 


tggcccacat ggacgatctg 


720 


a.aac tc tec ti 


qqqacctqac 

yyy wv*w ^-yyw 


caaqqtqcqc accqacctqc 


ccctggaggt ggacttcgcc 


780 


aaaacTQCQCTQ 


agcccgaccg 


ggagaggctt agggcctttc 


tggagaggct tgagtttggc 


840 


agcctcctcc 


acqaqttcqg 


ccttctggaa agccccaagg 


ccctggagga ggcqccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt gtgctttccc 


gcaaggagcc catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc ggccgcgtcc 


accgggcccc cgagccttat 


1020 


aaagccctca 


gggacctgaa 


qqagqcqcgq qqqcttctcg 


ccaaagacct gagcgttctg 


1080 


qccctcraQcrQ 


aaqqccttqq 


cctcccgccc ggcgacgacc 


ccatgctcct cgcctacctc 


1140 


c t aaac c c fc. h 

w^^nC*w w w w 


cgaacaccac 


ccccqaqqqq qtqqcccqqc 
^^^^^y^yyyy y*-yy*-^^3y^ 


qctacqqcqg gqagtggacg 


1200 


yeiyyayy(^yy 


yyw^wwyyyw 


cacccfcfcfccc oaaaqqctct 


tcqccaacct qtqqqqgagg 


1260 


^uuyayyyyy 


y y 55 ^y y w \m 


cctfctoactit fc.accqaaaaa 


fcqqaqaaqc c c c fc 1 1 ccgc t 


1320 


y ^y y wv.^ 


a fc a fccrcr acrcrc 

aii>awyy ayy w 


wciw^*y yy y **y y ^y y y ^^y 


tggcctacct tcaggccctt 


1380 


w w WW w-y y c^y w 


fcfcaccrcraqcra 

i> uy wyy ayy a 


cra.fc cccrccqc c t cqacrqaqq 

j5 ww wwyw wjjw w w w jvt^^^*fcU5j 


aggtcttccg cttggcgggc 


1440 






^<^cicici^cci^ci cfcaaaaaoqq 

wwyyy av-VrfO-y w t.y y a.a-aiyyy 


tgctctttga cgagcttagg 


1500 


cttcccgcct 


tggggaagac 


gcaaaagaca ggcaagcgct 


ccaccagcgc cgcggtgctg 


1560 


gaggccctac 


gggaggccca 


ccccatcgtg gagaagatcc 


tccagcaccg ggagctcacc 


1620 


aagctcaaga 


acacctacgt 


ggaccccctc ccaagcctcg 


tccacccgag gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg gccacgggga 


ggcttagtag ctccgacccc 


1740 


aacctgcaga 


acatccccgt 


ccgcaccccc ttgggccaga 


ggatccgccg ggccttcatc 


1800 
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gccgaggagg ggtggctatt 


ggtggccctg 


gcccacctct ccggcgacga 


gaacctgatc 


acggagaccg ccagctggat 


gttcggcgtc 


cgggcggcca agaccatcaa 


cttcggggtc 


caggagctag 


ccatccctta 


cgaggaggcc 


ttccccaagg tgcgggcctg 


gattgagaag 


gtggagaccc 


tcttcggccg 


ccgccgctac 


gtgcgggagg cggccgagcg 


catggccttc 


ctcatgaagc tggctatggt 


gaagctcttc 


ctccttcagg tccacaacga 


gctggtcctc 


gcccggctgg 


ccaaggaggt 


catggagggg 


gaggtgggga 


taggggagga 


ctggctctcc 


<210> 84 
<211> 2517 
<212> DNA 
<213> Artificial 




<220> 

<223> Synthetic 




<400> 84 
atgaattcgg 


ggatgctgcc 


cctctttgag 


caccacctgg 


cctaccgcac 


cttccacgcc 


ccggtgcagg 


cggtctacgg 


cttcgccaag 


gacgcggtga 


tcgtggtctt 


tgacgccaag 


gggtacaagg 


cgggccgggc 


ccccacgccg 


aaggagctgg 


tggacctcct 


ggggctggcg 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


accgccgaca 


aagaccttta 


ccagctcctt 


gggtacctca 


tcaccccggc 


ctggctttgg 


gccgactacc 


gggccctgac 


cggggacgag 


agggagaaga 


cggcgaggaa 


gcttctggag 


aacctggacc 


ggctgaagcc 


cgccatccgg 



PCT/USOl/17086 



gactatagcc 


agatagagct 


cagggtgctg 


1860 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 


ctctacggca 


tgtcggccca 


ccgcctctcc 


2040 


caggccttca 


ttgagcgcta 


ctttcagagc 


2100 


accctggagg 


agggcaggag 


gcgggggtac 


2160 


gtgccagacc 


tagaggcccg 


ggtgaagagc 


2220 


aacacgcccg 


t ccagggcac 


cgccgccgac 




cccaggctgg 


aggaaatggg 


ggccaggatg 


2340 


gaggccccaa 


aagagagggc 


ggaggccgtg 


2400 


gtgtatcccc 


tggccgtgcc 


cctggaggtg 


2460 


gccaaggagc 


accaccacca 


ccaccac 


2517 



cccaagggcc 


gggtcctcct 


ggtggacggc 


60 


ctgaagggcc 


tcaccaccag 


ccggggggag 


120 


agcctcctca 


aggccctcaa 


ggaggacggg 


180 


gccccctcct 


tccgccacga 


ggcctacggg 


240 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 
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aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 780 

aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 840 

agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 900 

cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 960 

gatcttctgg ccctggccgc cgccaggggc ggccgcgtcc accgggcccc cgagccttat 1020 

aaagccctca gggacctgaa ggaggcgcgg gggcttctcg ccaaagacct gagcgttctg 1080 

gccctgaggg aaggccttgg cctcccgccc ggcgacgacc ccatgctcct cgcctacctc 1140 

ctggaccctt cgaacaccac ccccgagggg gtggcccggc gctacggcgg ggagtggacg 1200 

gaggaggcgg gggagcgggc cgccctttcc gagaggctcc atcggaacct gtgggggagg 1260 

cttgaggggg aggagaggct cctttggctt taccgggagg tggagaggcc cctttccgct 1320 

gtcctggccc atatggaggc caccggggta cggcgggacg tggcctacct tcaggccctt 1380 

tccctggagc ttgcggagga gatccgccgc ctcgaggagg aggtcttccg cttggcgggc 1440 

caccccttca acctcaactc ccgggaccag ctggaaaggg tgctctttga cgagcttagg 1500 

cttcccgcct tggggaagac gcaaaagaca ggcaagcgct ccaccagcgc cgcggtgctg 1560 

gaggccctac gggaggccca ccccatcgtg gagaagatcc tccagcaccg ggagctcacc 1620 

aagctcaaga acacctacgt ggaccccctc ccaagcctcg tccacccgag gacgggccgc 1680 

ctccacaccc gcttcaacca gacggccacg gccacgggga ggcttagtag ctccgacccc 1740 

aacctgcaga acatccccgt ccgcaccccc ttgggccaga ggatccgccg ggccttcatc 1800 

gccgaggagg ggtggctatt ggtggccctg gactatagcc agatagagct cagggtgctg 1860 

gcccacctct ccggcgacga gaacctgatc cgggtcttcc aggaggggcg ggacatccac 1920 

acggagaccg ccagctggat gttcggcgtc ccccgggagg ccgtggaccc cctgatgcgc 1980 

cgggcggcca agaccatcaa cttcggggtc ctctacggca tgtcggccca ccgcctctcc 2040 

caggagctag ccatccctta cgaggaggcc caggccttca ttgagcgcta ctttcagagc 2100 

ttccccaagg tgcgggcctg gattgagaag accctggagg agggcaggag gcgggggtac 2160 

gtggagaccc tcttcggccg ccgccgctac gtgccagacc tagaggcccg ggtgaagagc 2220 

gtgcgggagg cggccgagcg catggccttc aacatgcccg tccagggcac cgccgccgac 2280 

ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 2340 

ctccttcagg tccacaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 2400 
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gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 2460 
gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 2517 

<210> 85 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 85 



atgaattcgg 


ggatgctgcc 


cctctttgag 


cccaagggcc 


gggtcctcct 


ggtggacggc 


60 


caccacctgg 


cctaccgcac 


cttccacgcc 


ctgaagggcc 


tcaccaccag 


ccggggggag 


120 


ccggtgcagg 


cggtctacgg 


cttcgccaag 


agcctcctca 


aggccctcaa 


ggaggacggg 


180 


gacgcggtga 


tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga 


ggcctacggg 


240 


gggtacaagg 


cgggccgggc 


ccccacgccg 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggctggcg 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcacaccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


c C f\ 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtcc 


accgggcccc 


cgagccttat 


1020 


aaagccctca 


gggacctgaa 


ggaggcgcgg 


gggcttctcg 


ccaaagacct 


gagcgttctg 


1080 


gccctgaggg 


aaggccttgg 


cctcccgccc 


ggcgacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcccggc 


gctacggcgg 


ggagtggacg 


1200 


gaggaggcgg 


gggagcgggc 


cgccctttcc 


gagaggctct 


tcgccaacct 


gcttaagagg 


1260 


cttgaggggg 


aggagaggct 


cctttggctt 


taccgggagg 


tggagaggcc 


cctttccgct 


1320 
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gtcctggccc atatggaggc caccggggta cggcgggacg tggcctacct tcaggccctt 1380 

tccctggagc ttgcggagga gatccgccgc ctcgaggagg aggtcttccg cttggcgggc 1440 

caccccttca acctcaactc ccgggaccag ctggaaaggg tgctctttga cgagcttagg 1500 

cttcccgcct tggggaagac gcaaaagaca ggcaagcgct ccaccagcgc cgcggtgctg 1560 

gaggccctac gggaggccca ccccatcgtg gagaagatcc tccagcaccg ggagctcacc 1620 

aagctcaaga acacctacgt ggaccccctc ccaagcctcg tccacccgag gacgggccgc 1680 

ctccacaccc gcttcaacca gacggccacg gccacgggga ggcttagtag ctccgacccc 1740 

aacctgcaga acatccccgt ccgcaccccc ttgggccaga ggatccgccg ggccttcatc 1800 

gccgaggagg ggtggctatt ggtggccctg gactatagcc agatagagct cagggtgctg 1860 

gcccacctct ccggcgacga gaacctgatc cgggtcttcc aggaggggcg ggacatccac 1920 

acggagaccg ccagctggat gttcggcgtc ccccgggagg ccgtggaccc cctgatgcgc 1980 

cgggcggcca agaccatcaa cttcggggtc ctctacggca tgtcggccca ccgcctctcc 2040 

caggagctag ccatccctta cgaggaggcc caggccttca ttgagcgcta ctttcagagc 2100 

ttccccaagg tgcgggcctg gattgagaag accctggagg agggcaggag gcgggggtac 2160 

gtgigagaccc tcttcggccg ccgccgctac gtgccagacc tagaggcccg ggtgaagagc 2220 

gtgcgggagg cggccgagcg catggccttc aacatgcccg tccagggcac cgccgccgac 2280 

ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 2340 

ctccttcagg tccacaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 2400 

gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 2460 

gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 2517 



<2io> 86 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 86 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 
caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 120 
ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 
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gacgcggtga 


tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga 


ggcctacggg 


240 


gggtacaagg 


cgggccgggc 


ccccacgccg 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggctggcg 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


9999agaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtcc 


accgggcccc 


cgagccttat 


1020 


aaagccctca 


gggacctgaa 


99a9gcgcgg 


gggcttctcg 


ccaaagacct 


gagcgttctg 


1080 


gccctgaggg 


aaggccttgg 


cctcccgccc 


ggcgacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcccggc 


gctacggcgg 


ggagtggacg 


1200 


gaggaggcgg 


gggagcgggc 


cgccctttcc 


gagaggctct 


tcgccaacct 


gtgggggagg 


1260 


cttgaggggg 


aggagaggct 


cctttggctt 


taccgggagg 


tggagaggcc 


cctttcccgg 


1320 


gtcctggccc 


atatggaggc 


caccggggta 


cggcgggacg 


tggcctacct 


tcaggccctt 


1380 


tccctggagc 


ttgcggagga 


gatccgccgc 


ctcgaggagg 


aggtcttccg 


cttggcgggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tgctctttga 


cgagcttagg 


1500 


cttcccgcct 


tggggaagac 


gcaaaagaca 


ggcaagcgct 


ccaccagcgc 


cgcggtgctg 


1560 


gaggccctac 


999aggccca 


ccccatcgtg 


gagaagatcc 


tccagcaccg 


ggagctcacc 


1620 


aavj w u w ctcig el 


aCaCC t.acgu 


^gACCCCCCC 


ccacigcccc^ 


uCwaCCC^cig 


ga cgggc cgc 




ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggga 


ggcttagtag 


ctccgacccc 


1740 


aacctgcaga 


acatccccgt 


ccgcaccccc 


ttgggccaga 


ggatccgccg 


ggccttcatc 


1800 


gccgaggagg 


ggtggctatt 


ggtggccctg 


gactatagcc 


agatagagct 


cagggtgctg 


1860 


gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 
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acggagaccg ccagctggat gttcggcgtc ccccgggagg ccgtggaccc cctgatgcgc 1980 

cgggcggcca agaccatcaa cttcggggtc ctctacggca tgtcggccca ccgcctctcc 2040 

caggagctag ccatccctta cgaggaggcc caggccttca ttgagcgcta ctttcagagc 2100 

ttccccaagg tgcgggcctg gattgagaag accctggagg agggcaggag gcgggggtac 2160 

gtggagaccc tcttcggccg ccgccgctac gtgccagacc tagaggcccg ggtgaagagc 2220 

gtgcgggagg cggccgagcg catggccttc aacatgcccg tccagggcac cgccgccgac 2280 

ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 2340 

ctccttcagg tccacaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 2400 

gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 2460 

gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 2517 

<210> 87 
<211> 2517 
<212> DNA 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 87 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 

caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 120 

ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 

gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 240 

gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 300 

aaggagctgg tggacctcct ggggctggcg cgcctcgagg tcccgggcta cgaggcggac 360 

gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 420 

accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 480 

gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 540 

gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 600 

ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc ectcctcaag 660 

aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 720 

aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 780 
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aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtgc 


accgggcagc 


agaccccttg 


1020 


gcggggctaa 


aggacctcaa 


ggaggtccgg 


ggcctcctcg 


ccaaggacct 


cgccgtcttg 


1080 


gcctcgaggg 


aggggctaga 


cctcgtgccc 


ggggacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcgcggc 


gctacggggg 


ggagtggacg 


1200 


gaggacgccg 


cccaccgggc 


cctcctctcg 


gagaggctcc 


atcggaacct 


ccttaagcgc 


1260 


ctcgaggggg 


aggagaagct 


cctttggctc 


taccacgagg 


tggaaaagcc 


cctctcccgg 


1320 


gtcctggccc 


atatggaggc 


cacgggggtg 


cgccgggacg 


tggcctatct 


cagggccttg 


1380 


tccctggagg 


tggccgagga 


gatcgcccgc 


ctcgaggccg 


aggtcttccg 


cctggccggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tcctctttga 


cgagctaggg 


1500 


cttcccgcca 


tcggcaagac 


ggagaagacc 


ggcaagcgct 


ccaccagcgc 


cgccgtcctg 


1560 


gaggccctcc 


gcgaggccca 


ccccatcgtg 


gagaagatcc 


tgcagtaccg 


ggagctcacc 


1620 


aagctgaaga 


gcacctacat 


tgaccccttg 


ccggacctca 


tccaccccag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggca 


ggctaagtag 


ctccgatccc 


1740 


aacctccaga 


acatccccgt 


ccgcaccccg 


cttgggcaga 


ggatccgccg 


ggccttcatc 


1800 


gccgaggagg 


ggtggctatt 


ggtggccctg 


gactatagcc 


agatagagct 


cagggtgctg 


1660 


gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 


acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggacGc 


cctgatgcgc 


1980 


cgggcggcca 


agaccatcaa 


cttcggggtc 


ctctacggca 


tgtcggccca 


ccgcctctcc 


2040 


caggagctag 


ccatccctta 


cgaggaggcc 


caggccttca 


ttgagcgcta 


ctttcagagc 


2100 


ttccccaagg 


tgcgggcctg 


gattgagaag 


accctggagg 


agggcaggag 


gcgggggtac 


2160 


gtggagaccc 


tcttcggccg 


ccgccgctac 


gtgccagacc 


tagaggcccg 


ggtgaagagc 


2220 








a. a ^ cL u a w Vi> w a 




cgccgccgdc 


no OA 


ctcatgaagc 


tggctatggt 


gaagctcttc 


cccaggctgg 


aggaaatggg 


ggccaggatg 


2340 


ctccttcagg 


tccacaacga 


gctggtcctc 


gaggccccaa 


aagagagggc 


ggaggccgtg 


2400 


gcccggctgg 


ccaaggaggt 


catggagggg 


gtgtatcccc 


tggccgtgcc 


cctggaggtg 


2460 


gaggtgggga 


taggggagga 


ctggctctcc 


gccaaggagc 


accaccacca 


ccaccac 


2517 
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<210> 88 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 



<400> 88 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ^gtggacggc 
caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 
ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 
gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 
gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 
aaggagctgg tggacctcct ggggctggcg cgcctcgagg tcccgggcta cgaggcggac 
gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 
accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 
gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 
gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 
ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 
aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 
aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 
• aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 
agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 
cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 
gatcttctgg ccctggccgc cgccaggggc ggccgcgtgc accgggcagc agaccccttg 1020 
gcggggctaa aggacctcaa ggaggtccgg ggcctcctcg ccaaggacct cgccgtcttg 1080 
gcctcgaggg aggggctaga cctcgtgccc ggggacgacc ccatgctcct cgcctacctc 
ctggaccctt cgaacaccac ccccgagggg gtggcgcggc gctacggggg ggagtggacg 
gaggacgccg cccaccgggc cctcctctcg gagaggctcc atcggaacct ccttaagcgc 
ctcgaggggg aggagaagct cctttggctc taccacgagg tggaaaagcc cctctcccgg 1320 
gtcctggccc atatggaggc cacgggggtg cgcctggacg tggcctatct ccaggccttg 1380 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 



1140 

1200 
1260 
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tccctggagg 


tggccgagga 


gatcgcccgc 


ctcgaggccg 


aggtcttccg 


cctggccggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tcctctttga 


cgagctaggg 


1500 


cttcccgcca 


tcggcaagac 


ggagaagacc 


ggcaagcgct 


ccaccagcgc 


cgccgtcctg 


1560 


gaggccctcc 


gcgaggccca 


ccccatcgtg 


gagaagatcc 


tgcagtaccg 


ggagctcacc 


1620 


aagctgaaga 


gcacctacat 


tgaccccttg 


ccggacctca 


tccaccccag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggca 


ggctaagtag 


ctccgatccc 


1740 


aacctccaga 


acatccccgt 


ccgcaccccg 


cttgggcaga 


ggatccgccg 


ggccttcatc 


1800 


gccgaggagg 


ggtggctatt 


gQtggccctg 


gactatagcc 


agatagagct 


cagggtgctg 


1860 


gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 


acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agaccatcaa 


cttcggggtc 


ctctacggca 


tgtcggccca 


ccgcctctcc 


2040 


caggagctag 


ccatccctta 


cgaggaggcc 


caggccttca 


ttgagcgcta 


ctttcagagc 


2100 


ttccccaagg 


tgcgggcctg 


gattgagaag 


accctggagg 


agggcaggag 


gcgggggtac 


2160 


gtggagaccc 


tcttcggccg 


ccgccgctac 


gtgccagacc 


tagaggcccg 


ggtgaagagc 


2220 


gtgcgggagg 


cggccgagcg 


cati.ggcctt>c 


aacatgcccg 


tccagggcac 


cgc eg ccg ac 


O O O A 

22o0 


ctcatgaagc 


tggctatggt 


gaagctcttc 


cccaggctgg 


aggaaatggg 


ggccaggatg 


2340 


ctccttcagg 


tccacaacga 


gctggtcctc 


gaggccccaa 


aagagagggc 


ggaggccgtg 


2400 


gcccggctgg 


ccaaggaggt 


catggagggg 


gtgtatcccc 


tggccgtgcc 


cctggaggtg 


2460 


gaggtgggga 


taggggagga 


ctggctctcc 


gccaaggagc 


accaccacca 


ccaccac 


2517 



<210> 89 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 89 



atgaattcgg ggatgctgcc 


cctctttgag 


cccaagggcc 


gggtcctcct 


ggtggacggc 


60 


caccacctgg cctaccgcac 


cttccacgcc 


ctgaagggcc 


tcaccaccag 


ccggggggag 


120 


ccggtgcagg cggtctacgg 


cttcgccaag 


agcctcctca 


aggccctcaa 


ggaggacggg 


180 


gacgcggtga tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga 


ggcctacggg 


240 


gggtacaagg cgggccgggc 


ccccacgccg 


gaggactttc 


cccggcaact 


cgccctcatc 


300 
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aaggagctgg tggacctcct ggggctggcg cgcctcgagg tcccgggcta cgaggcggac 360 

gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 420 

accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 480 

gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 540 

gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 600 

ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 660 

aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 720 

aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 780 

aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 840 

agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 900 

cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 960 

gatcttctgg ccctggccgc cgccaggggc ggccgcgtgc accgggcagc agaccccttg 1020 

gcggggctaa aggacctcaa ggaggtccgg ggcctcctcg ccaaggacct cgccgtcttg 1080 

gcctcgaggg aggggctaga cctcgtgccc ggggacgacc ccatgctcct cgcctacctc 1140 

ctggaccctt cgaacacoac ccccgagggg gtggcgcggc gctacggggg ggagtggacg 1200 

gaggacgccg cccaccgggc cctcctctcg gagaggctcc atcggaacct ccttaagcgc 1260 

ctcgaggggg aggagaagct cctttggctc taccacgagg tggaaaagcc cctctcccgg 1320 

gtcctggccc atatggaggc cacgggggtg cgcctggacg tggcctatct cagggccttg 1380 

tccctggagc ttgccgagga gatcgcccgc ctcgaggccg aggtcttccg cctggccggc 1440 

caccccttca acctcaactc ccgggaccag ctggaaaggg tcctctttga cgagctaggg 1500 

cttcccgcca tcggcaagac ggagaagacc ggcaagcgct ccaccagcgc cgccgtcctg 1560 

gaggccctcc gcgaggccca ccccatcgtg gagaagatcc tgcagtaccg ggagctcacc 1620 

aagctgaaga gcacctacat tgaccccttg ccggacctca tccaccccag gacgggccgc 1680 

ctccacaccc gcttcaacca gacggccacg gccacgggca ggctaagtag ctccgatccc 1740 

aacctccaga acatccccgt ccgcaccccg cttgggcaga ggatccgccg ggccttcatc 1800 

gccgaggagg ggtggctatt ggtggccctg gactatagcc agatagagct cagggtgctg 1860 

gcccacctct ccggcgacga gaacctgatc cgggtcttcc aggaggggcg ggacatccac 1920 

acggagaccg ccagctggat gttcggcgtc ccccgggagg ccgtggaccc cctgatgcgc 1980 
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cgggcggcca agaccatcaa 


cttcggggtc 


ctctacggca 


tgtcggccca 


ccgcctctcc 


2040 


caggagctag ccatccctta 


cgaggaggcc 


caggccttca 


ttgagcgcta 


ctttcagagc 


2100 


ttccccaagg tgcgggcctg 


gattgagaag 


accctggagg 


agggcaggag 


gcgggggtac 


2160 


gtggagaccc tcttcggccg 


ccgccgctac 


gtgccagacc 


tagaggcccg 


ggtgaagagc 


2220 


gtgcgggagg cggccgagcg 


catggccttc 


aacatgcccg 


tccagggcac 


cgccgccgac 


2280 


ctcatgaagc tggctatggt 


gaagctcttc 


cccaggctgg 


aggaaatggg 


ggccaggatg 


2340 


ctccttcagg tccacaacga 


gctggtcctc 


gaggccccaa 


aagagagggc 


ggaggccgtg 


2400 


gcccggctgg ccaaggaggt 


catggagggg 


gtgtatcccc 


tggccgtgcc 


cctggaggtg 


2460 


gaggtgggga taggggagga 


ctggctctcc 


gccaaggagc 


accaccacca 


ccaccac 


2517 



<210> 90 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 90 



atgaattcgg 


ggatgctgcc 


cctctttgag 


cccaagggcc 


gggtcctcct 


ggtggacggc 


60 


caccacctgg 


cctaccgcac 


cttccacgcc 


ctgaagggcc 


tcaccaccag 


ccggggggag 


120 


ccggtgcagg 


cggtctacgg 


cttcgccaag 


agcctcctca 


aggccctcaa 


ggaggacggg 


180 


gacgcggtga 


tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga 


ggcctacggg 


240 


gggtacaagg 


cgggccgggc 


ccccacgccg 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggctggcg 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 
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cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 960 

gatcttctgg ccctggccgc cgccaggggc ggccgcgtgc accgggcagc agaccccttg 1020 

gcggggctaa aggacctcaa ggaggtccgg ggcctcctcg ccaaggacct cgccgtcttg 1080 

gcctcgaggg aggggctaga cctcgtgccc ggggacgacc ccatgctcct cgcctacctc 1140 

ctggaccctt cgaacaccac ccccgagggg gtggcgcggc gctacggggg ggagtggacg 1200 

gaggacgccg cccaccgggc cctcctctcg gagaggctcc atcggaacct ccttaagcgc 1260 

ctcgaggggg aggagaagct cctttggctc taccacgagg tggaaaagcc cctctcccgg 1320 

gtcctggccc atatggaggc cacgggggtg cgcctggacg tggcctatct cagggccttg 1380 

tccctggagg tggccgagga gatccgccgc ctcgaggccg aggtcttccg cctggccggc 1440 

caccccttca acctcaactc ccgggaccag ctggaaaggg tcctctttga cgagctaggg 1500 

cttcccgcca tcggcaagac ggagaagacc ggcaagcgct ccaccagcgc cgccgtcctg 1560 

gaggccctcc gcgaggccca ccccatcgtg gagaagatcc tgcagtaccg ggagctcacc 1620 

aagctgaaga gcacctacat tgaccccttg ccggacctca tccaccccag gacgggccgc 1680 

ctccacaccc gcttcaacca gacggccacg gccacgggca ggctaagtag ctccgatccc 1740 

aacctccaga acatccccgt ccgcaccccg cttgggcaga ggatccgccg ggccttcatc 1800 

gccgaggagg ggtggctatt ggtggccctg gactatagcc agatagagct cagggtgctg 1860 

gcccacctct ccggcgacga gaacctgatc cgggtcttcc aggaggggcg ggacatccac 1920 

acggagaccg ccagctggat gttcggcgtc ccccgggagg ccgtggaccc cctgatgcgc 1980 

cgggcggcca agaccatcaa cttcggggtc ctctacggca tgtcggccca ccgcctctcc 2040 

caggagctag ccatccctta cgaggaggcc caggccttca ttgagcgcta ctttcagagc 2100 

ttccccaagg tgcgggcctg gattgagaag accctggagg agggcaggag gcgggggtac 2160 

gtggagaccc tcttcggccg ccgccgctac gtgccagacc tagaggcccg ggtgaagagc 2220 

gtgcgggagg cggccgagcg catggccttc aacatgcccg tccagggcac cgccgccgac 2280 

ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 2340 

ctccttcagg tccacaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 2400 

gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 2460 

gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 2517 
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<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 91 



atgaattcgg 


ggatgctgcc 


cctctttgag cccaagggcc gggtcctcct ggtggacggc 


60 


caccacctgg 


cctaccgcac 


cttccacgcc ctgaagggcc tcaccaccag ccggggggag 


120 


ccggtgcagg 


cggtctacgg 


cttcgccaag agcctcctca aggccctcaa ggaggacggg 


180 


gacgcggtga 


tcgtggtctt 


tgacgccaag gccccctcct tccgccacga ggcctacggg 


240 


gggtacaagg 


cgggccgggc 


ccccacgccg gaggactttc cccggcaact cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggctggcg cgcctcgagg tcccgggcta cgaggcggac 


360 


gacgtcctgg 


ccagcctggc 


caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt tccgaccgca tccacgtcct ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 


540 


gccgactacc 


gggccctgac 


cggggacgag tccgacaacc ttcccggggt caagggcatc 


600 


ggggagaaga 


cggcgaggaa 


gcttctggag gagtggggga gcctggaagc cctcctcaag 


' 660 


aacctggacc 


ggctgaagcc 


cgccatccgg gagaagatcc tggcccacat ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt agggcctttc tggagaggct tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa agccccaagg ccctggagga ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc ggccgcgtgc accgggcagc agaccccttg 


1020 


gcggggctaa 


aggacctcaa 


ggaggtccgg ggcctcctcg ccaaggacct cgccgtcttg 


1080 


gcctcgaggg 


aggggctaga 


cctcgtgccc ggggacgacc ccatgctcct cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg gtggcgcggc gctacggggg ggagtggacg 


1200 


gaggacgccg 


cccaccgggc 


cctcctctcg gagaggctcc atcggaacct ccttaagcgc 


1260 


ctcgaggggg 


aggagaagct 


cctttggctc taccacgagg tggaaaagcc cctctcccgg 


1320 


gtcctggccc 


atatggaggc 


cacgggggtg cgcctggacg tggcctatct cagggccttg 


1380 


tccctggagg 


tggccgagga 


gatcgcccgc ctcgaggagg aggtcttccg cctggccggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag ctggaaaggg tcctctttga cgagctaggg 


1500 
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cttcccgcca tcggcaagac ggagaagacc ggcaagcgct ccaccagcgc cgccgtcctg 1560 

gaggccctcc gcgaggccca ccccatcgtg gagaagatcc tgcagtaccg ggagctcacc 1620 

aagctgaaga gcacctacat tgaccccttg ccggacctca tccaccccag gacgggccgc 1680 

ctccacaccc gcttcaacca gacggccacg gccacgggca ggctaagtag ctccgatccc 1740 

aacctccaga acatccccgt ccgcaccccg cttgggcaga ggatccgccg ggccttcatc 1800 

gccgaggagg ggtggctatt ggtggccctg gactatagcc agatagagct cagggtgctg 1860 

gcccacctct ccggcgacga gaacctgatc cgggtcttcc aggaggggcg ggacatccac 1920 

acggagaccg ccagctggat gttcggcgtc ccccgggagg ccgtggaccc cctgatgcgc 1980 

cgggcggcca agaccatcaa cttcggggtc ctctacggca tgtcggccca ccgcctctcc 2040 

caggagctag ccatccctta cgaggaggcc caggccttca ttgagcgcta ctttcagagc 2100 

ttccccaagg tgcgggcctg gattgagaag accctggagg agggcaggag gcgggggtac 2160 

gtggagaccc tcttcggccg ccgccgctac gtgccagacc tagaggcccg ggtgaagagc 2220 

gtgcgggagg cggccgagcg catggccttc aacatgcccg tccagggcac cgccgccgac 2280 

ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 2340 

ctccttcagg tccacaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 2400 

gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 2460 

gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 2517 

<210> 92 
<211> 2517 
<212> DNA 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 92 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 

caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 120 

ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 

gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 240 

gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 300 

aaggagctgg tggacctcct ggggctggcg cgcctcgagg tcccgggcta cgaggcggac 360 
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gacgtcctgg ccagcctggc 
accgccgaca aagaccttta 
gggtacctca tcaccccggc 
gccgactacc gggccctgac 
ggggagaaga cggcgaggaa 
aacctggacc ggctgaagcc 
aagctctcct gggacctggc 
aaaaggcggg agcccgaccg 
agcctcctcc acgagttcgg 
cccccgccgg aaggggcctt 
gatcttctgg ccctggccgc 
gcggggctaa aggacctcaa 
gcctcgaggg aggggctaga 
ctggaccctt cgaacaccac 
gaggacgccg cccaccgggc 
ctcgaggggg aggagaagct 
gtcctggccc atatggaggc 
tccctggagg tggccgagga 
caccccttca acctcaactc 
cttcccgcca tcggcaagac 
gaggccctcc gcgaggccca 
aagctgaaga gcacctacat 
ctccacaccc gcttcaacca 
aacctccaga acatccccgt 
gccgaggagg ggtggctatt 
gcccacctct ccggcgacga 
acggagaccg ccagctggat 
cgggcggcca agaccatcaa 
caggagctag ccatccctta 



PCTAJSOl/17086 



caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


340 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


cgccaggggc 


ggccgcgtgc 


accgggcagc 


agaccccttg 


1020 


ggaggtccgg 


ggcctcctcg 


ccaaggacct 


cgccgtcttg 


1080 


cctcgtgccc 


ggggacgacc 


ccatgctcct 


cgcctacctc 


1140 


ccccgagggg 


gtggcgcggc 


gctacggggg 


ggagtggacg 


1200 


cctcctctcg 


gagaggctcc 


atcggaacct 


ccttaagcgc 


1260 


cctttggctc 


taccacgagg 


tggaaaagcc 


cctctcccgg 


1320 


cacgggggtg 


cgcctggacg 


tggcctatct 


cagggccttg 


1380 


gatcgcccgc 


ctcgaggccg 


aggtcttccg 


cctggccggc 


1440 


ccgggaccag 


ctggaaaggg 


tcctctttga 


cgagctaagg 


1500 


ggagaagacc 


ggcaagcgct 


ccaccagcgc 


cgccgtcctg 


1560 


ccccatcgtg 


gagaagatcc 


tgcagtaccg 


ggagctcacc 


1620 


tgaccccttg 


ccggacctca 


tccaccccag 


gacgggccgc 


1680 


gacggccacg 


gccacgggca 


ggctaagtag 


ctccgatccc 


1740 


ccgcaccccg 


cttgggcaga 


ggatccgccg 


ggccttcatc 


1800 


OCT t craG G G fc cr 


aaGtatacrcc 
yc**i* v>a L>ciy v*v* 


acf a t acf aac t 

C% ^4 Cl^ A GH^H ^wp W 


wwyyy u>y v« ^y 


1860 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cttcggggtc 


ctctacggca 


tgtcggccca 


ccgcctctcc 


2040 


cgaggaggcc 


caggccttca 


ttgagcgcta 


ctttcagagc 


2100 



370 



wo 01/90337 



PCT/USOl/17086 



ttccccaagg tgcgggcctg gattgagaag accctggagg agggcaggag gcgggggtac 2160 

gtggagaccc tcttcggccg ccgccgctac gtgccagacc tagaggcccg ggtgaagagc 2220 

gtgcgggagg cggccgagcg catggccttc aacatgcccg tccagggcac cgccgccgac 2280 

ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 2340 

ctccttcagg tccacaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 2400 

gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 2460 

gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 2517 

<210> 93 
<211> 2517 
<212> DNA 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 93 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 

caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 120 

Gcggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 

gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 240 

gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 300 

aaggagctgg tggacctcct ggggctggcg cgcctcgagg tcccgggcta cgaggcggac 360 

gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 420 

accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 480 

gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 540 

gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 600 

ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 660 

aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 720 

aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 780 

aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 840 

agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 900 

cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 960 
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gatcttctgg 


ccctggccgc cgccaggggc 


ggccgcgtgc 


accgggcagc 


agaccccttg 


1020 


gcggggctaa 


aggacctcaa ggaggtccgg 


ggcctcctcg 


ccaaggacct 


cgccgtcttg 


1080 


gcctcgaggg 


aggggctaga cctcgtgccc 


ggggacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac ccccgagggg 


gtggcgcggc 


gctacggggg 


ggagtggacg 


1200 


gaggacgccg 


cccaccgggc cctcctctcg 


gagaggctcc 


atcggaacct 


ccttaagcgc 


1260 


ctcgaggggg 


aggagaagct cctttggctc 


taccacgagg 


tggaaaagcc 


cctctcccgg 


1320 


gtcctggccc 


atatggaggc cacgggggtg 


cgcctggacg 


tggcctatct 


cagggccttg 


1380 


tccctggagg 


tggccgagga gatcgcccgc 


ctcgaggccg 


aggtcttccg 


cctggccggc 


1440 


caccccttca 


acctcaactc ccgggaccag 


ctggaaaggg 


tcctctttga 


cgagctaggg 


1500 


cttcccgcca 


tcggcaagac gcaaaagacc 


ggcaagcgct 


ccaccagcgc 


cgccgtcctg 


1560 


gaggccctcc 


gcgaggccca ccccatcgtg 


gagaagatcc 


tgcagtaccg 


ggagctcacc 


1620 


aagctgaaga 


gcacctacat tgaccccttg 


ccggacctca 


tccaccccag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca gacggccacg 


gccacgggca 


ggctaagtag 


ctccgatccc 


1740 


aacctccaga 


acatccccgt ccgcaccccg 


cttgggcaga 


ggatccgccg 


ggccttcatc 


1800 


gccgaggagg 


ggtggctatt ggtggccctg 


gactatagcc 


agatagagct 


cagggtgctg 


1860 


gcccacctct 


ccggcgacga gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 


acggagaccg 


ccagctggat gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agaccatcaa cttcggggtc 


ctctacggca 


tgtcggccca 


ccgcctctcc 


2040 


caggagctag 


ccatccctta cgaggaggcc 


caggccttca 


ttgagcgcta 


ctttcagagc 


2100 


ttccccaagg 


tgcgggcctg gattgagaag 


accctggagg 


agggcaggag 


gcgggggtac 


2160 


gtggagaccc 


tcttcggccg ccgccgctac 


gtgccagacc 


tagaggcccg 


ggtgaagagc 


2220 


qtQCCTQcracrcr 


caaccoaoca cataaccttc 


a. a c at cr c c ccr 

vLCL v^c^ ^^Xrf v^^j 


W Vif Vir Q. ^ ^ a ViT 


CCSCCCSCCCIS.C 


2280 


ctcatgaagc 


tggctatggt gaagctcttc 


cccaggctgg 


aggaaatggg 


ggccaggatg 


2340 


ctccttcagg 


tccacaacga gctggtcctc 


gaggccccaa 


aagagagggc 


ggaggccgtg 


2400 


gcccggctgg 


ccaaggaggt catggagggg 


gtgtatcccc 


tggccgtgcc 


cctggaggtg 


2460 


gaggtgggga 


taggggagga ctggctctcc 


gccaaggagc 


accaccacca 


ccaccac 


2517 



<210> 94 

<211> 2517 

<212> DNA 

<213> Artificial 
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<220> 

<223> Synthetic 
<400> 94 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 

caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 120 

ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 

gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 240 

gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 300 

aaggagctgg tggacctcct ggggctggcg cgcctcgagg tcccgggcta cgaggcggac 360 

gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 420 

accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 480 

gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 540 

gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 600 

ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 660 

aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 720 

aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 780 

aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 840 

agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 900 

cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 960 

gatcttctgg ccctggccgc cgccaggggc ggccgcgtgc accgggcagc agaccccttg 1020 

gcggggctaa aggacctcaa ggaggtccgg ggcctcctcg ccaaggacct cgccgtcttg 1080 

gcctcgaggg aggggctaga cctcgtgccc ggggacgacc ccatgctcct cgcctacctc 1140 

ctggaccctt cgaacaccac ccccgagggg gtggcgcggc gctacggggg ggagtggacg 1200 

gaggacgccg cccaccgggc cctcctctcg gagaggctcc atcggaacct ccttaagcgc 1260 

ctcgaggggg aggagaagct cctttggctc taccacgagg tggaaaagcc cctctcccgg 1320 

gtcctggccc atatggaggc cacgggggtg cgcctggacg tggcctatct cagggccttg 1380 

tccctggagg tggccgagga gatcgcccgc ctcgaggccg aggtcttccg cctggccggc 1440 

caccccttca acctcaactc ccgggaccag ctggaaaggg tcctctttga cgagctaggg 1500 

cttcccgcca tcggcaagac ggagaagacc ggcaagcgct ccaccagcgc cgccgtcctg 1560 
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gaggccctcc 


gcgaggccca 


ccccatcgtg 


gagaagatcc 


tgcagcaccg 


ggagctcacc 


1620 


aagctgaaga 


gcacctacat 


tgaccccttg 


ccggacctca 


tccaccccag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggca 


ggctaagtag 


ctccgatccc 


1740 


aacctccaga 


acatccccgt 


ccgcaccccg 


cttgggcaga 


ggatccgccg 


ggccttcatc 


1800 


gccgaggagg 


ggtggctatt 


ggtggccctg 


gactatagcc 


agatagagct 


cagggtgctg 


1860 


gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 


acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agaccatcaa 


cttcggggtc 


ctctacggca 


tgtcggccca 


ccgcctctcc 


2040 


caggagctag 


ccatccctta 


cgaggaggcc 


caggccttca 


ttgagcgcta 


ctttcagagc 


2100 


ttccccaagg 


tgcgggcctg 


gattgagaag 


accctggagg 


agggcaggag 


gcgggggtac 


2160 


gtggagaccc 


tcttcggccg 


ccgccgctac 


gtgccagacc 


tagaggcccg 


ggtgaagagc 


2220 


gtgcqgqagg 


cggccgagcg 


catggccttc 


aacatgcccg 


tccagggcac 


cgccgccgac 


2280' 


ctcatgaagc 


tggctatggt 


gaagctcttc 


cccaggctgg 


aggaaatggg 


ggccaggatg 


2340 


ctccttcagg 


tccacaacga 


gctggtcctc 


gaggccccaa 


aagagagggc 


ggaggccgtg 


2400 


gcccggctgg 


ccaaggaggt 


catggagggg 


gtgtatcccc 


tggccgtgcc 


cctggaggtg 


2460 


gaggtgggga 


taggggagga 


ctggctctcc 


gccaaggagc 


accaccacca 


ccaccac 


2517 



<210> 95 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 95 



atgaattcgg 


ggatgctgcc 


cctctttgag 


cccaagggcc 


gggtcctcct 


ggtggacggc 


60 


caccacctgg 


cctaccgcac 


cttccacgcc 


ctgaagggcc 


tcaccaccag 


ccggggggag 


120 


ccggtgcagg 


cggtctacgg 


cttcgccaag 


agcctcctca 


aggccctcaa 


.ggaggacggg 


180 


gacgcggtga 


tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga 


ggcctacggg 


240 


gggtacaagg 


cgggccgggc 


ccccacgccg 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggctggcg 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 
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gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 540 
gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 600 
ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 660 
aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 720 
aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 780 
aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 840 
agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 900 
cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 960 

gatcttctgg ccctggccgc cgccaggggc ggccgcgtgc accgggcagc agaccccttg 1020 

gcggggctaa aggacctcaa ggaggtccgg ggcctcctcg ccaaggacct cgccgtcttg 1080 

gcctcgaggg aggggctaga cctcgtgccc ggggacgacc ccatgctcct cgcctacctc 1140 

ctggaccctt cgaacaccac ccccgagggg gtggcgcggc gctacggggg ggagtggacg 1200 

gaggacgccg cccaccgggc cctcctctcg gagaggctcc atcggaacct ccttaagcgc 1260 

ctcgaggggg aggagaagct cctttggctc taccacgagg tggaaaagcc cctctcccgg 1320 

gtcctggccc atatggaggc cacgggggtg cgcctggacg tggcctatct cagggccttg 1380 

tccctggagg tggccgagga gatcgcccgc ctcgaggccg aggtcttccg cctggccggc 1440 

caccccttca acctcaactc ccgggaccag ctggaaaggg tcctctttga cgagctaggg 1500 

cttcccgcca tcggcaagac ggagaagacc ggcaagcgct ccaccagcgc cgccgtcctg 1560 

gaggccctcc gcgaggccca ccccatcgtg gagaagatcc tgcagtaccg ggagctcacc 1620 

aagctgaaga acacctacat tgaccccttg ccggacctca tccaccccag gacgggccgc 1680 

ctccacaccc gcttcaacca gacggccacg gccacgggca ggctaagtag ctccgatccc 1740 

aacctccaga acatccccgt ccgcaccccg cttgggcaga ggatccgccg ggccttcatc 1800 

gccgaggagg ggtggctatt ggtggccctg gactatagcc agatagagct cagggtgctg 1860 

gcccacctct ccggcgacga gaacctgatc cgggtcttcc aggaggggcg ggacatccac 1920 

acggagaccg ccagctggat gttcggcgtc ccccgggagg ccgtggaccc cctgatgcgc 1980 

cgggcggcca agaccatcaa cttcggggtc ctctacggca tgtcggccca ccgcctctcc 2 040 

caggagctag ccatccctta cgaggaggcc caggccttca ttgagcgcta ctttcagagc 2100 

ttccccaagg tgcgggcctg gattgagaag accctggagg agggcaggag gcgggggtac 2160 
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gtggagaccc 


tcttcggccg 


ccgccgctac 


gtgcgggagg cggccgagcg 


catggccttc 


ctcatgaagc tggctatggt 


gaagctcttc 


ctccttcagg tccacaacga 


gctggtcctc 


gcccggctgg ccaaggaggt 


catggagggg 


gaggtgggga 


taggggagga 


ctggctctcc 


<21p> 96 
<2li> 2517 
<212> DNA 
<213> Artificial 




<220> 

<223> Synthetic 




<400> 96 
acgaau ucgg 


ggacgccgcc 


cctctttgag 


caccacctgg 


cctaccgcac 


cttccacgcc 


ccggtgcagg 


cggtctacgg 


cttcgccaag 


gacgcggtga 


tcgtggtctt 


tgacgccaag 


gggtacaagg 


cgggccgggc 


ccccacgccg 


aaggagctgg 


tggacctcct 


ggggctggcg 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


accgccgaca 


aagaccttta 


ccagctcctt 


gggtacctca 


tcaccccggc 


ctggctttgg 


gccgactacc 


gggccctgac 


cggggacgag 


ggggagaaga 


cggcgaggaa 


gcttctggag 


aacctggacc 


ggctgaagcc 


cgccatccgg 


aagctctcct 


gggacctggc 


caaggtgcgc 


aaaaggcggg 


agcccgaccg 




agcctcctcc 


acgagttcgg 


ccttctggaa 


cccccgccgg 


^aggggcctt 


cgtgggcttt 


gatcttctgg 


ccctggccgc 


cgccaggggc 


gcggggctaa 


aggacctcaa 


ggaggtccgg 



gtgccagacc 


tagaggcccg 


ggtgaagagc 


2220 


aacatgcccg 


tccagggcac 


cgccgccgac 


2280 


cccaggctgg 


aggaaatggg 


ggccaggatg 


2340 


gaggccccaa 


aagagagggc 


ggaggccgtg 


2400 


gtgtatcccc 


tggccgtgcc 


cctggaggtg 


2460 


gccaaggagc 


accaccacca 


ccaccac 


2517 



cccaagggcc 


gggtcctcct 


ggtggacggc 


60 


ctgaagggcc 


tcaccaccag 


ccggggggag 


120 


agcctcctca 


aggccctcaa 


ggaggacggg 


loO 


gccccctcct 


tccgccacga 


..ggcctacggg 


240 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


ggccgcgtgc 


accgggcagc 


agaccccttg 


1020 


ggcctcctcg 


ccaaggacct 


cgccgtcttg 


1080 
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gcctcgaggg aggggctaga cctcgtgccc ggggacgacc ccatgctcct cgcctacctc 1140 

ctggaccctt cgaacaccac ccccgagggg gtggcgcggc gctacggggg ggagtggacg 1200 

gaggacgccg cccaccgggc cctcctctcg gagaggctcc atcggaacct ccttaagcgc 1260 

ctcgaggggg aggagaagct cctttggctc taccacgagg tggaaaagcc cctctcccgg 1320 

gtcctggccc atatggaggc cacgggggtg cgcctggacg tggcctatct cagggccttg 1380 

tccc.tggagg tggccgagga gatcgcccgc ctcgaggccg aggtcttccg cctggccggc 1440 

caccccttca acctcaactc ccgggaccag ctggaaaggg tcctctttga cgagctaggg 1500 

cttcccgcca tcggcaagac ggagaagacc ggcaagcgct ccaccagcgc cgccgtcctg 1560 

gaggccctcc gcgaggccca ccccatcgtg gagaagatcc tgcagtaccg ggagctcacc 1620 

aagctgaaga gcacctacgt ggaccccttg ccggacctca tccaccccag gacgggccgc 1680 

ctccacaccc gcttcaacca gacggccacg gccacgggca ggctaagtag ctccgatccc 1740 

aacctccaga acatccccgt ccgcaccccg cttgggcaga ggatccgccg ggccttcatc 1800 

gccgaggagg ggtggctatt ggtggccctg gactatagcc agatagagct cagggtgctg 1860 

gcccacctct ccggcgacga gaacctgatc cgggtcttcc aggaggggcg ggacatccac 1920 

acggagaccg ccagctggat gttcggcgtc ccccgggagg ccgtggaccc cctgatgcgc 1980 

cgggcggcca agaccatcaa cttcggggtc ctctacggca tgtcggccca ccgcctctcc 2040 

caggagctag ccatccctta cgaggaggcc caggccttca ttgagcgcta ctttcagagc 2100 

ttccccaagg tgcgggcctg gattgagaag accctggagg agggcaggag gcgggggtac 2160 

gtggagaccc tcttcggccg ccgccgctac gtgccagacc tagaggcccg ggtgaagagc 2220 

gtgcgggagg cggccgagcg catggccttc aacatgcccg tccagggcac cgccgccgac 2280 

ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 2340 

ctccttcagg tccacaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 2400 

gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 2460 

gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 2517 



<210> 97 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
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<400> 97 
atgaattcgg 


ggatgctgcc 


cctctttgag 


cccaagggcc 


gggtcctcct 


ggtggacggc 


60 


caccacctgg 


cctaccgcac 


cttccacgcc 


ctgaagggcc 


tcaccaccag 


ccggggggag 


120 


ccggtgcagg 


cggtctacgg 


cttcgccaag 


agcctcctca 


aggacctcaa 


ggaggacggg 


180 


gacgcggtga 


tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga 


ggcctacggg 


240 


gggtacaagg 


cgggccgggc 


ccccacgccg 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggctggcg 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtgc 


accgggcagc 


agaccccttg 


1020 


gcggggctaa 


aggacctcaa 


ggaggtccgg 


ggcctcctcg 


ccaaggacct 


cgccgtcttg 


1080 


gcctcgaggg 


aggggctaga 


cctcgtgccc 


ggggacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcgcggc 


gctacggggg 


ggagtggacg 


1200 


gaggacgccg 


cccaccgggc 


cctcctctcg 


gagaggctcc 


atcggaacct 


ccttaagcgc 


1260 


ctcgaggggg 


aggagaagct 


cctttggctc 


taccacgagg 


tggaaaagcc 


cctctcccgg 


1320 


gtcctggccc 


atatggaggc 


cacgggggtg 


cgcctggacg 


tggcctatct 


cagggccttg 


1380 


tccctggagg 


tggccgagga 


gatcgcccgc 


ctcgaggccg 


aggtcttccg 


cctggccggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tcctctttga 


cgagctaggg 


1500 


cttcccgcca 


tcggcaagac 


ggagaagacc 


ggcaagcgct 


ccaccagcgc 


cgccgtcctg 


1560 


gaggccctcc 


gcgaggccca 


ccccatcgtg 


gagaagatcc 


tgcagtaccg 


ggagctcacc 


1620 


aagctgaaga 


gcacctacat 


tgaccccttg 


ccgagcctcg 


tccaccccag 


gacgggccgc 


1680 
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cf f r»a a f n a 
^ w u^ciAwwa 


aapcrcfccaca acGacocraca aactaaataci cticcQatccc 


1740 


aoC 6 u v« o 




ffcrfacGCCO f!ttoacrr*acra aaatCCCfCCQ OQCCttcatC 


1800 




ggcggccatt 


rtrtVexannn^r o aanf-a^aarr" aGrat"aoraQCt CaaQCrtlQClZQ 


1860 


9cccdcctct 


ccggcgacga 


gaaccugauc cgygucutcc "■99»yyyy^y yyc».^-.ci.uwv*c*w 


1920 




ccagctggat 


gcccggcgtc ccccgggagg ccytyyat-^^t- uuu.yau.yoyw 


1980 


cgggcggcca 


agaccatcaa 


cuccgggywC wtcuacggca ugucyyccLrd L>(,..ywwu^u^#w 


2040 


caggagctag 




cgayyayyc^ v^ayycwuwwci uuyeiy^yowa wwu^wc&yayw 


2100 


ttccccaagg 


tgcgggcctg 


gaccgagaag accccggagg ayyycagyd.y y^yyyyyua^ 


2160 


gtggagaccc 


ucuccggccg 


«-ir«rr^r«rrr»t-a r» rr^o/^r'arrar^r' t'aciacrafr'f Cl acitGia3.CraQC 
Coy wCy wUGii.* y L^y w>uciya.ww uoyayy^^v-^y yywy«wy«3w 


2220 


gtgcgggagg 


cggccgagcg 


catggccttc aacatgcccg tccagggcac cgccgccgac 


2280 


ctcatgaagc 


tggctatggt 


gaagctcttc cccaggctgg aggaaatggg ggccaggatg 


2340 


ctccttcagg 


tccacaacga 


gctggtcctc gaggccccaa aagagagggc ggaggccgtg 


2400 


gcccggctgg 


ccaaggaggt 


catggagggg gtgtatcccc tggccgtgcc cctggaggtg 


2460 


gaggtgggga 


taggggagga 


ctggctctcc gccaaggagc accaccacca ccaccac 


2517 



<210> 98 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 98 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 

caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 120 

ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 

gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 240 

gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 300 

aaggagctgg tggacctcct ggggctggcg cgcctcgagg tcccgggcta cgaggcggac 3 60 

gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 420 

accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 480 

gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 540 
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gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtcc 


accgggcccc 


cgagccttat 


1020 


aaagccctca 


gggacctgaa 


ggaggcgcgg 


gggcttctcg 


ccaaagacct 


gagcgttctg 


1080 


gccctgaggg 


aaggccttgg 


cctcccgccc 


ggcgacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcccggc 


gctacggcgg 


ggagtggacg 


1200 


gaggaggcgg 


gggagcgggc 


cgccctttcc 


gagaggctct 


tcgccaacct 


gcttaagagg 


1260 


cttgaggggg 


aggagaggct 


cctttggctt 


taccgggagg 


tggagaggcc 


cctttccgct 


1320 


gtcctggccc 


atatggaggc 


cacgggggtg 


cgcctggacg 


tggcctatct 


cagggccttg 


1380 


tccctggagg 


tggccgagga 


gatcgcccgc 


ctcgaggccg 


aggtcttccg 


cctggccggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tcctctttga 


cgagctaggg 


1500 


cttcccgcca 


tcggcaagac 


gcaaaagacc 


ggcaagcgct 


ccaccagcgc 


cgccgtcctg 


1560 


gaggccctcc 


gcgaggccca 


ccccatcgtg 


gagaagatcc 


tgcagtaccg 


ggagctcacc 


1620 


aagctgaaga 


gcacctacat 


tgaccccttg 


ccggacctca 


tccaccccag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggca 


ggctaagtag 


ctccgatccc 


1740 


aacctccaga 


acatccccgt 


ccgcaccccg 


cttgggcaga 


ggatccgccg 


ggccttcatc 


1800 


gccgaggagg 


ggtggctatt 


ggtggccctg 


gactatagcc 


agatagagct 


cagggtgctg 


1860 


gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 


acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 




fid fittest i~t^ a Si 


^uuv^yyyyL.^ 


^ ^ ^ a f^nnr^ 9 

^ ucuacyy Uel 


Cy u-CyyCCCa 


^% 4* ^ 4* 

ccgccucccc 




caggagctag 


ccatccctta 


cgaggaggcc 


caggccttca 


ttgagcgcta 


ctttcagagc 


2100 


ttccccaagg 


tgcgggcctg 


gattgagaag 


accctggagg 


agggcaggag 


gcgggggtac 


2160 


gtggagaccc 


tcttcggccg 


ccgccgctac 


gtgccagacc 


tagaggcccg 


ggtgaagagc 


2220 


gtgcgggagg 


cggccgagcg 


catggccttc 


aacatgcccg 


tccagggcac 


cgccgccgac 


2280 



380 



wo 01/90337 



PCT/USOl/17086 



ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 2340 

ctccttcagg tccacaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 2400 

gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 2460 

gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 2517 



<210> 99 
<211> 2517 
<212> DNA 
<213> Artificial 




<220> 

<223> Synthetic 




atgaattcgg 


ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 


60 


caccacctgg 


cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 


120 


ccggtgcagg 


cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 


180 


gacgcggtga 


tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 


240 


gggtacaagg 


cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 


300 


aaggagctgg 


tggacctcct ggggctggcg cgcctcgagg tcccgggcta cgaggcggac 


360 


gacgtcctgg 


ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 


420 


accgccgaca 


aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 


480 


gggtacctca 


tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 


540 


gccgactacc 


gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 


600 


ggggagaaga 


cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 


660 


aacctggacc 


ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 


720 


aagctctcct 


gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 


840 


agcctcctcc 


acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 


900 


cccccgccgg 


aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 


960 


gatcttctgg 


ccctggccgc cgccaggggc ggccgcgtcc accgggcccc cgagccttat 


1020 


aaagccctca 


gggacctgaa ggaggcgcgg gggcttctcg ccaaagacct gagcgttctg 


1080 


gccctgaggg 


aaggccttgg cctcccgccc ggcgacgacc ccatgctcct cgcctacctc 


1140 
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ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcccggc 


gctacggcgg 


ggagtggacg 


1200 


gaggaggcgg 


gggagcgggc 


cgccctttcc 


gagaggctct 


tcgccaacct 


gcttgggagg 


1260 


cttgaggggg 


aggagaggct 


cctttggctt 


taccgggagg 


tggagaggcc 


cctttccgct 


1320 


gtcctggccc 


atatggaggc 


cacgggggtg 


cgcctggacg 


tggcctatct 


cagggccttg 


1380 


tccctggagg 


tggccgagga 


gatcgcccgc 


ctcgaggccg 


aggtcttccg 


cctggccggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tcctctttga 


cgagctaggg 


1500 


cttcccgcca 


tcggcaagac 


gcaaaagacc 


ggcaagcgct 


ccaccagcgc 


cgccgtcctg 


1560 


gaggccctcc 


gcgaggccca 


ccccatcgtg 


gagaagatcc 


tgcagtaccg 


ggagctcacc 


1620 


aagctgaaga 


gcacctacat 


tgaccccttg 


ccggacctca 


tccaccccag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggca 


ggctaagtag 


ctccgatccc 


1740 


aacctccaga 


acatccccgt 


ccgcaccccg 


cttgggcaga 


ggatccgccg 


ggccttcatc 


1800 


gccgaggagg 


ggtggctatt 


ggtggccctg 


gactatagcc 


agatagagct 


cagggtgctg 


1860 


gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 


acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agaccatcaa 


cttcggggtc 


ctctacggca 


tgtcggccca 


ccgcctctcc 


2040 


caggagctag 


ccatccctta 


cgaggaggcc 


caggccttca 


ttgagcgcta 


ctttcagagc 


2100 


ttccccaagg 


tgcgggcctg 


gattgagaag 


accctggagg 


agggcaggag 


gcgggggtac 


2160 


gtggagaccc 


tcttcggccg 


ccgccgctac 


gtgccagacc 


tagaggcccg 


ggtgaagagc 


2220 






VrfCH^yyvvV^iM- 




tr* f* acfcrcr c a. c 


COCCQCCOSlC 


2280 


ctcatgaagc 


tggctatggt 


gaagctcttc 


cccaggctgg 


aggaaatggg 


ggccaggatg 


2340 


ctccttcagg 


tccacaacga 


gctggtcctc 


gaggccccaa 


aagagagggc 


ggaggccgtg 


2400 


gcccggctgg 


ccaaggaggt 


catggagggg 


gtgtatcccc 


tggccgtgcc 


cctggaggtg 


2460 


gaggtgggga 


taggggagga 


ctggctctcc 


gccaaggagc 


accaccacca 


ccaccac 


2517 



<210> 100 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 100 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 
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caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 120 

ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 

gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 240 

gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 300 

aaggagctgg tggacctcct ggggctggcg cgcctcgagg tcccgggcta cgaggcggac 360 

gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 420 

accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 480 

gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 540 

gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 600 

ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 660 

aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 720 

aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 780 

aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 840 

agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 900 

cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 960 

gatcttctgg ccctggccgc cgccaggggc ggccgcgtcc accgggcccc cgagccttat 1020 

aaagccctca gggacctgaa ggaggcgcgg gggcttctcg ccaaagacct gagcgttctg 1080 

gccctgaggg aaggccttgg cctcccgccc ggcgacgacc ccatgctcct cgcctacctc 1140 

ctggaccctt cgaacaccac ccccgagggg gtggcccggc gctacggcgg ggagtggacg 1200 

gaggaggcgg gggagcgggc cgccctttcc gagaggctct tcgccaacct gtggaagagg 1260 

cttgaggggg aggagaggct cctttggctt taccgggagg tggagaggcc cctttccgct 1320 

gtcctggccc atatggaggc cacgggggtg cgcctggacg tggcctatct cagggccttg 1380 

tccctggagg tggccgagga gatcgcccgc ctcgaggccg aggtcttccg cctggccggc • 1440 

caccccttca acctcaactc ccgggaccag ctggaaaggg tcctctttga cgagctaggg 1500 

cttcccgcca tcggcaagac gcaaaagacc ggcaagcgct ccaccagcgc cgccgtcctg 1560 

gaggccctcc gcgaggccca ccccatcgtg gagaagatcc tgcagtaccg ggagctcacc 1620 

aagctgaaga gcacctacat tgaccccttg ccggacctca tccaccccag gacgggccgc 1680 

ctccacaccc gcttcaacca gacggccacg gccacgggca ggctaagtag ctccgatccc 1740 
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aacctccaga 


acatccccgt 


ccgcaccccg 


gccgaggagg 


ggtggctatt 


ggtggccctg 


gcccacctct 


ccggcgacga 


gaacctgatc 


acggagaccg 


ccagctggat 


gttcggcgtc 


cgggcggcca 


agaccatcaa 


cttcggggtc 


caggagctag 


ccatccctta 


cgaggaggcc 


ttccccaagg 


tgcgggcctg 


gattgagaag 


gtggagaccc 


tcttcggccg 


ccgccgctac 


gtgcgggagg 


cggccgagcg 


catggccttc 


ctcatgaagc 


tggctatggt 


gaagctcttc 


ctccttcagg 


tccacaacga 


gctggtcctc 


gcccggctgg 


ccaaggaggt 


catggagggg 


gaggtgggga 


taggggagga 


ctggctctcc 


<210> 101 
<2X1> 2526 
<212> DNA 
<213> Artificial 




<220> 

<223> Synthetic 




<400> 101 
atgaattccg 


aggcgatgct 


tccgctcttt 


ggccaccacc 


tggcctaccg 


caccttcttc 


gaaccggtgc 


aggcggtcta 


cggcttcgcc 


gggtacaagg 


ccgtcttcgt 


ggtctttgac 


tacgaggcct 


acaaggcggg 


gagggccccg 


ctcatcaagg 


agctggtgga 


cctcctgggg 


gcggacgacg 


ttctcgccac 


cctggccaag 


atcctcaccg 


ccgaccgcga 


cctctaccaa 


cccgagggcc 


acctcatcac 


cccggagtgg 


cagtgggtgg 


acttccgcgc 


cctcgtgggg 


ggcatcgggg 


agaagaccgc 


cctcaagctc 



cttgggcaga 


ggatccgccg 


ggccttcatc 


1800 


gactatagcc 


agatagagct 


cagggtgctg 


1860 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1960 


ctctacggca 


tgtcggccca 


ccgcctctcc 


2040 


caggccttca 


ttgagcgcta 


ctttcagagc 


2100 


accctggagg 


agggcaggag 


gcgggggtac 


2160 


gtgccagacc 


tagaggcccg 


ggtgaagagc 


2220 


aacatgcccg 


tccagggcac 


cgccgccgac 


2280 


cccaggctgg 


aggaaatggg 


ggccaggatg 


2340 


gaggccccaa 


aagagagggc 


ggaggccgtg 


2400 


gtgtatcccc 


tggccgtgcc 


cctggaggtg 


2460 


gccaaggagc 


accaccacca 


ccaccac 


2517 



gaacccaaag 


gccgggtcct 


cctggtggac 


60 


gccctgaagg 


gcctcaccac 


gagccggggc 


120 


aagagcctcc 


tcaaggccct 


gaaggaggac 


180 


gccaaggccc 


cctccttccg 


ccacgaggcc 


240 


acccccgagg 


acttcccccg 


gcagctcgcc 


300 


tttacccgcc 


tcgaggtccc 


cggctacgag 


360 


aaggcggaaa 


aggaggggta 


cgaggtgcgc 


420 


ctcgtctccg 


accgcgtcgc 


cgtcctccac 


480 


ctttgggaga 


agtacggcct 


caggccggag 


540 


gacccctccg 


acaacctccc 


cggggtcaag 


600 


ctcaaggagt 


ggggaagcct 


ggaaaacctc 


660 
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ctcaagaacc tggaccgggt aaagccagaa aacgtccggg agaagatcaa ggcccacctg 720 

gaagacctca ggctctcctt ggagctctcc cgggtgcgca ccgacctccc cctggaggtg 780 

gacctcgccc aggggcggga gcccgaccgg gaggggctta gggccttcct ggagaggctg 840 

gagttcggca gcctcctcca cgagttcggc ctcctggagg cccccgcccc cctggaggag 900 

gccccctggc ccccgccgga aggggccttc gtgggcttcg tcctctcccg ccccgagccc 960 

atgtgggcgg agcttaaagc cctggccgcc tgcagggacg gccgggtgca ccgggcagca 1020 

gaccccttgg cggggctaaa ggacctcaag gaggtccggg gcctcctcgc caaggacctc 1080 

gccgtcttgg cctcgaggga ggggctagac ctcgtgcccg gggacgaccc catgctcctc 1140 

gcctacctcc tggacccttc gaacaccacc cccgaggggg tggcgcggcg ctacgggggg 1200 

gagtggacgg aggacgccgc ccaccgggcc ctcctctcgg agaggctcca tcggaacctc 1260 

cttaagcgcc tcgaggggga ggagaagctc ctttggctct accacgaggt ggaaaagccc 1320 

ctctcccggg tcctggccca tatggaggcc accggggtac ggcgggacgt ggcctacctt 1380 

caggcccttt ccctggagct tgcggaggag atccgccgcc tcgaggagga ggtcttccgc 1440 

ttggcgggcc accccttcaa cctcaactcc cgggaccagc tggaaagggt gctctttgac 1500 

gagcttaggc ttcccgcctt ggggaagacg caaaagacag gcaagcgctc caccagcgcc 1560 

gcggtgctgg aggccctacg ggaggcccac cccatcgtgg agaagatcct ccagcaccgg 1620 

gagctcacca agctcaagaa cacctacgtg gaccccctcc caagcctcgt ccacccgagg 1680 

acgggccgcc tccacacccg cttcaaccag acggccacgg ccacggggag gcttagtagc 1740 

tccgacccca acctgcagaa catccccgtc cgcaccccct tgggccagag gatccgccgg 1800" 

* gccttcgtgg ccgaggcggg ttgggcgttg gtggccctgg actatagcca gatagagctc 1860 

cgcgtcctcg cccacctctc cggggacgaa aacctgatca gggtcttcca ggaggggaag 1920 

gacatcgcca cccagaccgc aagctggatg ttcggcgtcc ccccggaggc cgtggacccc 1980 

ctgatgcgcc gggcggccaa gacggtgaac ttcggcgtcc tctacggcat gtccgcccat 2040 

aggctctccc aggagcttgc catcccctac gaggaggcgg tggcctttat agagcgctac 2100 

ttccaaagct tccccaaggt gcgggcctgg atagaaaaga ccctggagga ggggaggaag 2160 

cggggctacg tggaaaccct cttcggaaga aggcgctacg tgcccgacct caacgcccgg 2220 

gtgaagagcg tcagggaggc cgcggagcgc atggccttca acatgcccgt ccagggcacc 2280 

gccgccgacc tcatgaagct cgccatggtg aagctcttcc cccgcctccg ggagatgggg 2340 
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gcccgcatgc 


tcctccaggt 


ccacaacgag 


ctcctcctgg 


aggcccccca 


agcgcgggcc 


2400 


gaggaggtgg cggctttggc 


caaggaggcc 


atggagaagg 


cctatcccct 


cgccgtgccc 


2460 


ctggaggtgg aggtggggat 


gggggaggac 


tggctttccg 


ccaagggtca 


ccaccaccac 


2520 


caccac 












2526 


<210> 102 
<211> 2526 
<212> DNA 
<213> Artificial 












<220> 

<223> Synthetic 












<400> 102 
atgaattccg 


aQQCQ a t QC t 


tccgctcttt 


gaacccaaag 


gccgggtcct 


cctggtggac 


60 


ggccaccacc 


tggcctaccg 


caccttcttc 


gccctgaagg 


gcctcaccac 


gagccggggc 


120 


QaaccQortQC 

Z3 ****** ^323 *• 3 


acrQCQQtcta 


cggcttcgcc 


aagagcctcc 


tcaaggccct 


gaaggaggac 


180 


crqqtacaaQcr 


ccgtcttcgt 


ggtctttgac 


gccaaggccc 


cctccttccg 


ccacgaggcG 


240 


tacgaggcct 


acaaocicoaa 


gagggccccg 


acccccgagg 


acttcccccg 


gcagctcgcc 


300 


ct cat c aaaa 




cctcctgggg 


tttacccgcc 


tcgaggtccc 


cggctacgag 


360 






cctggccaag 


aaggcggaaa 


aggaggggta 


cgaggtgcgc 


420 


atcctcaccg 


ccgaccgcga 


cctctaccaa 


ctcgtctccg 


accgcgtcgc 


cgtcctccac 


480 


cccgagggcc 


acctcatcac 


cccggagtgg 


ctttgggaga 


agtacggcct 


caggccggag 


540 


cagtgggtgg 


acttccgcgc 


cctcgtgggg 


gacccctccg 


acaacctccc 


cggggtcaag 


600 


ggcatcgggg 


agaagaccgc 


cctcaagctc 


ctcaaggagt 


ggggaagcct 


ggaaaacctc 


660 


ctcaagaacc 


tggaccgggt 


aaagccagaa 


aacgtccggg 


agaagatcaa 


ggcccacctg 


720 


gaagacctca 


ggctctcctt 


ggagctctcc 


cgggtgcgca 


ccgacctccc 


cctggaggtg 


780 


gacctcgccc 


aggggcggga 


gcccgaccgg 


gaggggctta 


gggccttcct 


ggagaggctg 


840 


gagttcggca 


gcctcctcca 


cgagttcggc 


ctcctggagg 


cccccgcccc 


cctggaggag 


900 


gccccctggc 


ccccgccgga 


aQQQQCCttC 


qtaqqcttCQ 


tcctctcccg 


ccccgagccc 


960 


atgtgggcgg 


agcttaaagc 


cctggccgcc 


tgcaggggcg 


gccgcgtgca 


ccgggcagca 


1020 


gaccccttgg 


cggggctaaa 


ggacctcaag 


gaggtccggg 


gcctcctcgc 


caaggacctc 


1080 


gccgtcttgg 


cctcgaggga 


ggggctagac 


ctcgtgcccg 


gggacgaccc 


catgctcctc 


1140 


gcctacctcc 


tggacccttc 


gaacaccacc 


cccgaggggg 


tggcgcggcg 


ctacgggggg 


1200 
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gagtggacgg aggacgccgc ccaccgggcc ctcctctcgg agaggctcca tcggaacctc 1260 

cttaagcgcc tcgaggggga ggagaagctc ctttggctct accacgaggt ggaaaagccc 1320 

ctctcccggg tcctggccca tatggaggcc accggggtac ggcgggacgt ggcctacctt 1380 

caggcccttt ccctggagct tgcggaggag atccgccgcc tcgaggagga ggtcttccgc 1440 

ttggcgggcc accccttcaa cctcaactcc cgggaccagc tggaaagggt gctctttgac 1500 

gagcttaggc ttcccgcctt ggggaagacg caaaagacag gcaagcgctc caccagcgcc 1560 

gcggtgctgg aggccctacg ggaggcccac cccatcgtgg agaagatcct ccagcaccgg 1620 

gagctcacca agctcaagaa cacctacgtg gaccccctcc caagcctcgt ccacccgagg 1680 

acgggccgcc tccacacccg cttcaaccag acggccacgg ccacggggag gcttagtagc 1740 

tccgacccca acctgcagaa catccccgtc cgcaccccct tgggccagag gatccgccgg 1800 

gccttcgtgg ccgaggcggg ttgggcgttg gtggccctgg actatagcca gatagagctc i860 

cgcgtcctcg cccacctctc cggggacgaa aacctgatca gggtcttcca ggaggggaag 1920 

gacatccaca cccagaccgc aagctggatg ttcggcgtcc ccccggaggc cgtggacccc 1980 

ctgatgcgcc gggcggccaa gacggtgaac ttcggcgtcc tctacggcat gtccgcccat 2040 

aggctctccc aggagcttgc catcccctac gaggaggcgg tggcctttat agagcgctac 2100 

ttccaaagct tccccaaggt gcgggcctgg atagaaaaga ccctggagga ggggaggaag 2160 

cggggctacg tggaaaccct cttcggaaga aggcgctacg tgcccgacct caacgcccgg 2220 

gtgaagagcg tcagggaggc cgcggaggcc atggccttca acatgcccgt ccagggcacc 2280 

gccgccgacc tcatgaagct cgccatggtg aagctcttcc cccgcctccg ggagatgggg 2340 

gcccgcatgc tcctccaggt ccacaacgag ctcctcctgg aggcccccca agcgcgggcc 2400 

gaggaggtgg cggctttggc caaggaggcc atggagaagg cctatcccct cgccgtgccc 2460 

ctggaggtgg aggtggggat gggggaggac tggctttccg ccaagggtca ccaccaccac 2520 
caccac 



<210> 103 

<211> 2526 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 103 



2526 
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atgaattccg 


aggcgatgct 


tccgctcttt 


gaacccaaag 


gccgggtcct 


cctggtggac 


60 


ggccaccacc 


tggcctaccg 


caccttcttc 


gccctgaagg 


gcctcaccac 


gagccggggc 


120 


gaaccggtgc 


aggcggtcta 


cggcttcgcc 


aagagcctcc 


tcaaggccct 


gaaggaggac 


180 


gggtacaagg 


ccgtcttcgt 


ggtctttgac 


gccaaggccc 


cctccttccg 


ccacgaggcc 


240 


tacgaggcct 


acaaggcggg 


gagggccccg 


acccccgagg 


acttcccccg 


gcagctcgcc 


300 


ctcatcaagg 


agctggtgga 


cctcctgggg 


tttacccgcc 


tcgaggtccc 


cggctacgag 


360 


gcggacgacg 


ttctcgccac 


cctggccaag 


aaggcggaaa 


aggaggggta 


cgaggtgcgc 


420 


atcctcaccg 


ccgaccgcga 


cctctaccaa 


ctcgtctccg 


accgcgtcgc 


cgtcctccac 


480 


cccgagggcc 


acctcatcac 


cccggagtgg 


ctttgggaga 


agtacggcct 


caggccggag 


540 


cagtgggtgg 


acttccgcgc 


cctcgtgggg 


gacccctccg 


acaacctccc 


cggggtcaag 


600 


ggcatcgggg 


agaagaccgc 


cctcaagctc 


ctcaaggagt 


ggggaagcct 


ggaaaacctc 


660 


ctcaagaacc 


tggaccgggt 


aaagccagaa 


aacgtccggg 


agaagatcaa 


ggcccacctg 


720 


gaagacctca 


ggctctcctt 


ggagctctcc 


cgggtgcgca 


ccgacctccc 


cctggaggtg 


780 


gacctcgccc 


aggggcggga 


gcccgaccgg 


gaggggctta 


gggccttcct 


ggagaggctg 


840 


gagttcggca 


gcctcctcca 


cgagttcggc 


ctcctggagg 


cccccgcccc 


cctggaggag 


900 


gccccctggc 


ccccgccgga 


aggggccttc 


gtgggcttcg 


tcctctcccg 


ccccgagccc 


960 


atgtgggcgg 


agcttaaagc 


cctggccgcc 


tgcaggggcg 


gccgcgtgca 


ccgggcagca 


1020 


gaccccttgg 


cggggctaaa 


ggacctcaag 


gaggtccggg 


gcctcctcgc 


caaggacctc 


1080 


gccgtcttgg 


cctcgaggga 


ggggctagac 


ctcgtgcccg 


gggacgaccc 


catgctcctc 


1140 


gcctacctcc 


tggacccttc 


gaacaccacc 


cccgaggggg 


tggcgcggcg 


ctacgggggg 


1200 


gagtggacgg 


aggacgccgc 


ccaccgggcc 


ctcctctcgg 


agaggctcca 


tcggaacctc 


1260 


cttaagcgcc 


tcgaggggga 


ggagaagctc 


ctttggctct 


accacgaggt 


ggaaaagccc 


1320 


ctctcccggg 


tcctggccca 


tatggaggcc 


accggggtac 


ggcgggacgt 


ggcctacctt 


1380 


caggcccttt 


ccctggagct 


tgcggaggag 


atccgccgcc 


tcgaggagga 


ggtcttccgc 


1440 
















gagcttaggc 


ttcccgcctt 


ggggaagacg 


caaaagacag 


gcaagcgctc 


caccagcgcc 


1560 


gcggtgctgg 


aggccctacg 


ggaggcccac 


cccatcgtgg 


agaagatcct 


ccagcaccgg 


1620 


gagctcacca 


agctcaagaa 


cacctacgtg 


gaccccctcc 


caagcctcgt 


ccacccgagg 


1680 


acgggccgcc 


tccacacccg 


cttcaaccag 


acggccacgg 


ccacggggag 


gcttagtagc 


1740 
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caccac 



<210> 104 

<211> 2526 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 



tccgacccca acctgcagaa catccccgtc cgcaccccct tgggccagag gatccgccgg 1800 

gccttcgtgg ccgaggcggg ttgggcgttg gtggccctgg actatagcca gatagagctc 1860 

cgcgtcctcg cccatctctc cggggacgaa aacctgatca gggtcttcca ggaggggaag 1920 

gacatccaca cccagaccgc aagctggatg ttcggcgtcc ccccggaggc cgtggacccc 1980 

ctgatgcgcc gggcggccaa gacggtgaac ttcggcgtcc tctacggcat gtccgcccat 2040 

aggctctccc aggagcttgc catcccctac gaggaggcgg tggcctttat agagcgctac 2100 

ttccaaagct tccccaaggt gcgggcctgg atagaaaaga ccctggagga ggggaggaag 2160 

cggggctacg tggaaaccct cttcggaaga aggcgctacg tgcccgacct caacgcccgg 2220 

gtgaagagcg tcagggaggc cgcggagcgc atggccttca acatgcccgt ccagggcacc 2280 

gccgccgacc tcatgaagct cgccatggtg aagctcttcc cccgcctccg ggagatgggg 2340 

gcccgcatgc tcctccaggt cgccaacgag ctcctcctgg aggcccccca agcgcgggcc 2400 

gaggaggtgg cggctttggc caaggaggcc atggagaagg cctatcccct cgccgtgccc 2460 

ctggaggtgg aggtggggat gggggaggac tggctttccg ccaagggtca ccaccaccac 2520 



2526 



<400> 104 

atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 
ggccaccacc tggcctaccg caccttcttc gccctgaagg gcctcaccac gagccggggc 
gaaccggtgc aggcggtcta cggcttcgcc aagagcctcc tcaaggccct gaaggaggac 
gggtacaagg ccgtcttcgt ggtctttgac gccaaggccc cctccttccg ccacgaggcc 
tacgaggcct acaaggcggg gagggccccg acccccgagg acttcccccg gcagctcgcc 
ctcatcaagg agctggtgga cctcctgggg tttacccgcc tcgaggtccc cggctacgag 
gcggacgacg ttctcgccac cctggccaag aaggcggaaa aggaggggta cgaggtgcgc 
atcctcaccg ccgaccgcga cctctaccaa ctcgtctccg accgcgtcgc cgtcctccac 
cccgagggcc acctcatcac cccggagtgg ctttgggaga agtacggcct caggccggag 540 



60 
120 
180 
240 
300 
360 
420 
480 
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cagtgggtgg 


acttccgcgc 


cctcgtgggg 


gacccctccg 


acaacctccc 


cggggtcaag 


600 


ggcatcgggg 


agaagaccgc 


cctcaagctc 


ctcaaggagt 


ggggaagcct 


ggaaaacctc 


660 


ctcaagaacc 


tggaccgggt 


aaagccagaa 


aacgtccggg 


agaagatcaa 


ggcccacctg 


720 


gaagacctca 


ggctctcctt 


ggagctctcc 


cgggtgcgca 


ccgacctccc 


cctggaggtg 


780 


gacctcgccc 


aggggcggga 


gcccgaccgg 


gaggggctta 


gggccttcct 


ggagaggctg 


840 


gagttcggca 


gcctcctcca 


cgagttcggc 


ctcctggagg 


cccccgcccc 


cctggaggag 


900 


gccccctggc 


ccccgccgga 


aggggccttc 


gtgggcttcg 


tcctctcccg 


ccccgagccc 


960 


atgtgggcgg 


agcttaaagc 


cctggccgcc 


tgcaggggcg 


gccgcgtgca 


ccgggcagca 


1020 


gaccccttgg 


cggggctaaa 


ggacctcaag 


gaggtccggg 


gcctcctcgc 


caaggacctc 


1080 


gccgtcttgg 


cctcgaggga 


ggggctagac 


ctcgtgcccg 


gggacgaccc 


catgctcctc 


1140 


gcctacctcc 


tggacccttc 


gaacaccacc 


cccgaggggg 


tggcgcggcg 


ctacgggggg 


1200 


gagtggacgg 


aggacgccgc 


ccaccgggcc 


ctcctctcgg 


agaggctcca 


tcggaacctc 


1260 


cttaagcgcc 


tcgaggggga 


ggagaagctc 


ctttggctct 


accacgaggt 


ggaaaagccc 


1320 


ctctcccggg 


tcctggccca 


tatggaggcc 


accggggtac 


ggcgggacgt 


ggcctacctt 


1380 


caggcccttt 


ccctggagct 


tgcggaggag 


atccgccgcc 


tcgaggagga 


ggtcttccgc 


1440 


ttggcgggcc 


accccttcaa 


cctcaactcc 


cgggaccagc 


tggaaagggt 


gctctttgac 


1500 


gagcttaggc 


ttcccgcctt 


gaagaagacg 


aagaagacag 


gcaagcgctc 


caccagcgcc 


1560 


gcggtgctgg 


aggccctacg 


ggaggcccac 


cccatcgtgg 


agaagatcct 


ccagcaccgg 


1620 


gagctcacca 


agctcaagaa 


cacctacgtg 


gaccccctcc 


caagcctcgt 


ccacccgagg 


1680 


acgggccgcc 


tccacacccg 


cttcaaccag 


acggccacgg 


ccacggggag 


gcttagtagc 


1740 


tccgacccca 


acctgcagaa 


catccccgtc 


cgcaccccct 


tgggccagag 


gatccgccgg 


1800 


gccttcgtgg 


ccgaggcggg 


ttgggcgttg 


gtggccctgg 


actatagcca 


gatagagctc 


1860 


cgcgtcctcg 


cccacctctc 


cggggacgaa 


aacctgatca 


gggtcttcca 


ggaggggaag 


1920 


gacatccaca 


cccagaccgc 


aagctggatg 


ttcggcgtcc 


ccccggaggc 


cgtggacccc 


1980 


Cugaugcgcc 


gggcggc c a a 


gacggngaac 


uccggcgucc 


uCuacggcau 


^^^^^^^ a 4* 

guccgcccau 




aggctctccc 


aggagcttgc 


catcccctac 


gaggaggcgg 


tggcctttat 


agagcgctac 


2100 


ttccaaagct 


tccccaaggt 


gcgggcctgg 


atagaaaaga 


ccctggagga 


ggggaggaag 


2160 


cggggctacg 


tggaaaccct 


cttcggaaga 


aggcgctacg 


tgcccgacct 


caacgcccgg 


2220 


gtgaagagcg 


tcagggaggc 


cgcggagcgc 


atggccttca 


acatgcccgt 


ccagggcacc 


2280 
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2526 



gccgccgacc tcatgaagct cgccatggtg aagctcttcc cccgcctccg ggagatgggg 2340 

gcccgcatgc tcctccaggt cgccaacgag ctcctcctgg aggcccccca agcgcgggcc 2400 

gaggaggtgg cggctttggc caaggaggcc atggagaagg cctatcccct cgccgtgccc 2460 

ctggaggtgg aggtggggat gggggaggac tggctttccg ccaagggtca ccaccaccac 2520 
caccac 



<210> 105 

<211> 2517 

<212> DMA 

<213> Artificial 

<220> 

<223> Synthetic 



60 
120 
180 
240 
300 
360 
420 



<400> 105 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 
caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 
ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 
gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 
gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 
aaggagctgg tggacctcct ggggctggcg cgcctcgagg tcccgggcta cgaggcggac 
gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 
accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 480 
gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 540 
gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 
ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 
aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 720 
aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 
aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 
agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 
cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 960 
gatcttctgg ccctggccgc cgccaggggc ggccgcgtcc accgggcccc cgagccttat 1020 
aaagccctca gggacctgaa ggaggcgcgg gggcttctcg ccaaagacct gagcgttctg 1080 



600 
660 



780 
840 

900 
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gccctgaggg 


aaggccttgg 


cctcccgccc ggcgacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg gtggcccggc 


gctacggcgg 


ggagtggacg 


1200 


gaggaggcgg 


gggagcgggc 


cgccctttcc gagaggctct 


tcgccaacct 


gcttaagagg 


1260 


cttgaggggg 


aggagaggct 


cctttggctt taccgggagg 


tggagaggcc 


cctttccgct 


1320 


gtcctggccc 


atatggaggc 


cacgggggtg cgcctggacg 


tggcctatct 


cagggccttg 


1380 


tccctggagg 


tggccgagga 


gatcgcccgc ctcgaggccg 


aggtcttccg 


cctggccggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag ctggaaaggg 


tcctctttga 


cgagctaggg 


1500 


cttcccgcca 


tcggcaagac 


gcaaaagacc ggcaagcgct 


ccaccagcgc 


cgccgtcctg 


1560 


gaggccctcc 


gcgaggccca 


ccccatcgtg gagaagatcc 


tgcagtaccg 


ggagctcacc 


1620 


aagctgaaga 


gcacctacat 


tgaccccttg ccggacctca 


tccaccccag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg gccacgggca 


ggctaagtag 


ctccgatccc 


1740 


aacctccaga 


acatccccgt 


ccgcaccccg cttgggcaga 


ggatccgccg 


ggccttcatc 


1800 


gccgaggagg 


ggtggctatt 


ggtggccctg gactatagcc 


agatagagct 


cagggtgctg 


1860 


gcccacctct 


ccggcgacga 


gaacctgatc cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 


acggagaccg 


ccagctggat 


gttcggcgtc ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agaccatcaa 


cttcggggtc ctctacggca 


tgtcggccca 


ccgcctctcc 


2040 


caggagctag 


ccatccctta 


cgaggaggcc caggccttca 


ttgagcgcta 


ctttcagagc 


2100 


ttccccaagg 


tgcgggcctg 


gattgagaag accctggagg 


agggcaggag 


gcgggggtac 


2160 


gtggagaccc 


tcttcggccg 


ccgccgctac gtgccagacc 


tagaggcccg 


ggtgaagagc 


2220 






catcfcrccttc aacatcrccca 


^ ^ a ^ d \i» 




2280 


ctcatgaagc 


tggctatggt 


gaagctcttc cccaggctgg 


aggaaatggg 


ggccaggatg 


2340 


ctccttcagg 


tcgccaacga 


gctggtcctc gaggccccaa 


aagagagggc 


ggaggccgtg 


2400 


gcccggctgg 


ccaaggaggt 


catggagggg gtgtatcccc 


tggccgtgcc 


cctggaggtg 


2460 


gaggtgggga 


taggggagga 


ctggctctcc gccaaggagc 


accaccacca 


ccaccac 


2517 



<210> 106 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
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^400> 106 ^ . 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 

caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 

ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 

gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 

gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 

aaggagctgg tggacctcct ggggctggcg cgcctcgagg tcccgggcta cgaggcggac 

gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 

accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 

gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 

gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 

ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 

aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 

aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 

aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 

agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 

ccGccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 

gatcttctgg ccctggccgc cgccaggggc ggccgcgtcc accgggcccc cgagccttat 1020 

aaagccctca gggacctgaa ggaggcgcgg gggcttctcg ccaaagacct gagcgttctg 1080 

gccctgaggg aaggccttgg cctcccgccc ggcgacgacc ccatgctcct cgcctacctc 1140 

ctggaccctt cgaacaccac ccccgagggg gtggcccggc gctacggcgg ggagtggacg 1200 

gaggaggcgg gggagcgggc cgccctttcc gagaggctct tcgccaacct gcttaagagg 1260 

cttgaggggg aggagaggct cctttggctt taccgggagg tggagaggcc cctttccgct 1320 

gtcctggccc atatggaggc cacgggggtg cgcctggacg tggcctatct cagggccttg 1380 

tccctggagg tggccgagga gatcgcccgc ctcgaggccg aggtcttccg cctggccggc 1440 

caccccttca acctcaactc ccgggaccag ctggaaaggg tcctctttga cgagctaggg 1500 

cttcccgcca tcggcaagac gcaaaagacc ggcaagcgct ccaccagcgc cgccgtcctg 1560 

gaggccctcc gcgaggccca ccccatcgtg gagaagatcc tgcagtaccg ggagctcacc 1620 

aagctgaaga gcacctacat tgaccccttg ccggacctca tccaccccag gacgggccgc 1680 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
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ctccacaccc 


gcttcaacca 


gacggccacg 


aacctccaga 


acatCGCcgt 


ccgcaccccg 


gccgaggagg 


ggtggctatt 


ggtggccctg 


gcccacctct 


ccggcgacga 


gaacctgatc 


acggagaccg 


ccagctggat 


gttcggcgtc 


cgggcggcca 


agaccatcaa 


cttcggggtc 


caggagctag 


ccatccctta 


cgaggaggcc 


ttccccaagg 


tgcgggcctg 


gattgagaag 


gtggagaccc 


tcttcggccg 


ccgccgctac 


gtgcgggagg 


cggccgagcg 


catggccttc 


ctcatgaagc 


tggctatggt 


gaagctcttc 


ctccttcagg 


tcgccaacga 


gctggtcctc 


gcccggctgg 


ccaaggaggt 


catqciaQQcrQ 


gaggtgggga 


taggggagga 


ctggctctcc 


<210> 107 
<211> 2517 
<212> DNA 
<213> Artificial 




<220> 

<223> Synthetic 




<400> 107 
atgaattcgg 


ggatgctgcc 


cctctttgag 


caccacctgg 


cctaccgcac 


cttccacgcc 


ccggtgcagg 


cggtctacgg 


cttcgccaag 


gacgcggtga 


tcgtggtctt 


tgacgccaag 


999tacaagg 


cgggccgggc 


ccccacgccg 


aaggagctgg 


tggacctcct 


ggggctggcg 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


accgccgaca 


aagaccttta 


ccagctcctt 


gggtacctca 


tcaccccggc 


ctggctttgg 


gccgactacc 


gggccctgac 


cggggacgag 



PCTAJSOl/17086 



gccacgggca 


ggctaagtag 


ctccgatccc 


1740 


cttgggcaga 


ggatccgccg 


ggccttcatc 


1800 


gactatagcc 


agatagagct 


cagggtgctg 


1860 


cgggtcttcc 


aggaggggcg 


ggacatcgcc 


1920 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 


ctctacggca 


tgtcggccca 


ccgcctctcc 


2040 


caggccttca 


ttgagcgcta 


ctttcagagc 


2100 


accctggagg 


agggcaggag 


gcgggggtac 


2160 


gtgccagacc 


tagaggcccg 


ggtgaagagc 


2220 


aciwougccwg 


U w d^^^ w CL\« 


^ w« ^ x« A w 


2280 


cccaggctgg 


aggaaatggg 


ggccaggatg 


2340 


gaggccccaa 


aagagagggc 


ggaggccgtg 


2400 


gtgtatcccc 


tggccgtgcc 


cctggaggtg 


2460 


gccaaggagc 


accaccacca 


ccaccac 


2517 



cccaagggcc 


gggtcctcct 


ggtggacggc 


60 


ctgaagggcc 


tcaccaccag 


ccggggggag 


120 


agcctcctca 


aggccctcaa 


ggaggacggg 


180 


gccccctcct 


tccgccacga 


ggcctacggg 


240 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 
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660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 
aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 
aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 
aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 
agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 
cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 
gatcttctgg ccctggccgc cgccaggggc ggccgcgtcc accgggcccc cgagccttat 
aaagccctca gggacctgaa ggaggcgcgg gggcttctcg ccaaagacct gagcgttctg 
gccctgaggg aaggccttgg cctcccgccc ggcgacgacc ccatgctcct cgcctacctc 
ctggaccctt cgaacaccac ccccgagggg gtggcccggc gctacggcgg ggagtggacg 
gaggaggcgg gggagcgggc cgccctttcc gagaggctct tcgccaacct gcttaagagg 
cttgaggggg aggagaggct cctttggctt taccgggagg tggagaggcc cctttccgct 
gtcctggccc atatggaggc cacgggggtg cgcctggacg tggcctatct cagggccttg 
tccctggagg tggccgagga gatcgcccgc ctcgaggccg aggtcttccg cctggccggc 1440 
caccccttca acctcaactc ccgggaccag ctggaaaggg tcctctttga cgagctaggg 1500 
cttcccgcca tcggcaagac gcaaaagacc ggcaagcgct ccaccagcgc cgccgtcctg 
gaggccctcc gcgaggccca ccccatcgtg gagaagatcc tgcagtaccg ggagctcacc 
aagctgaaga gcacctacat tgaccccttg ccggacctca tccaccccag gacgggccgc 
ctccacaccc gcttcdacca gacggccacg gccacgggca ggctaagtag ctccgatccc 
aacctccaga acatccccgt cgccaccccg cttgggcaga ggatccgccg ggccttcatc 
gccgaggagg ggtggctatt ggtggccctg gactatagcc agatagagct cagggtgctg 1860 
gcccacctct ccggcgacga gaacctgatc cgggtcttcc aggaggggcg ggacatccac 
acggagaccg ccagctggat gttcggcgtc ccccgggagg ccgtggaccc cctgatgcgc 
cgggcggcca agaccatcaa cttcggggtc ctctacggca tgtcggccca ccgcctctcc 
caggagctag ccatccctta cgaggaggcc caggccttca ttgagcgcta ctttcagagc 
ttccccaagg tgcgggcctg gattgagaag accctggagg agggcaggag gcgggggtac 
gtggagaccc tcttcggccg ccgccgctac gtgccagacc tagaggcccg ggtgaagagc 
gtgcgggagg cggccgagcg catggccttc aacatgcccg tccagggcac cgccgccgac 2280 



1560 
1620 
1680 
1740 
1800 



1920 
1980 
2040 
2100 
2160 
2220 
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ctcatgaagc tggctatggt 


gaagctcttc 


ctccttcagg tcgccaacga 


gctggtcctc 


gcccggctgg ccaaggaggt 


catggagggg 


gaggtgggga 


taggggagga 


ctggctctcc 


<210> 108 
<211> 2517 
<212> DNA 
<213> Artificial 




<220> 

<223> Synthetic 




<400> 108 


yy ciL>y L>y wvi* 


cctctttgag 


r' a o a ri r» nri 
C cl v.> C cLV* w t y ^ 




cttccacgcc 




f*tmt if" ^ a r^nn 

cyy L.C(^av;yy 


cttcgccaag 


o a r* rr cm or a 
y cn^y uyy L.y ct 


ui«>y uyy u u 


tgacgccaag 


orml" ar*aacrrf 
yyy ua%^aciy y 


cgggccgggc 


ccccacgccg 


««yy«y^fc-yy 


wyy ci^i^o^^^ 


ggggctggcg 


^CLw*4 wV«w «»3y 




caagaaggcg 


accgccgaca 


aagaccttta 


ccagctcctt 


gggtacctca 


tcaccccggc 


ctggctttgg 


gccgactacc 


gggccctgac 


cggggacgag 




cggcgaggaa 


gcttctggag 


aacctggacc 


ggctgaagcc 


cgccatccgg 


aagctctcct 


gggacctggc 


caaggtgcgc 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agcctcctcc 


acgagttcgg 


ccttctggaa 


cccccgccgg 


aaggggcctt 


CQtcrcracttt 


gatcttctgg 


ccctggccgc 


cgccaggggc 


aaagccctca 


gggacctgaa 


ggaggcgcgg 


gccctgaggg 


aaggccttgg 


cctcccgccc 


ctggaccctt 


cgaacaccac 


ccccgagggg 
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cccaggctgg 


aggaaatggg 


ggccaggatg 


2340 


gaggccccaa 


aagagagggc 


ggaggccgtg 


2400 


gtgtatcccc 


tggccgtgcc 


cctggaggtg 


2460 


gccaaggagc 


accaccacca 


ccaccac 


2517 



cccaagggcc 


gggtcctcct 


ggtggacggc 


60 


ctgaagggcc 


tcaccaccag 


ccggggggag 


120 


agcctcctca 


aggccctcaa 


ggaggacggg 


160 


gccccctcct 


tccgccacga' 


ggcctacggg 


240 


gaggactttc 


cccggcaact 


cgccctcatc 


300^ 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 


era aaacrcraorcr 


actaccracrat 


ccgcatcctc 


420 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


ggccgcgtcc 


accgggcccc 


cgagccttat 


1020 


gggcttctcg 


ccaaagacct 


gagcgttctg 


1080 


ggcgacgacc 


ccatgctcct 


cgcctacctc 


1140 


gtggcccggc 


gctacggcgg 


ggagtggacg 


1200 
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gaggaggcgg gggagcgggc cgcoctttcc gagaggctct tcgccaacct gcttaagagg 1260 

cttgaggggg aggagaggct cctttggctt taccgggagg tggagaggcc cotttccgct 1320 

gtcctggccc atatggaggc cacgggggtg cgcctggacg tggcctatct cagggccttg 1380 

tccctggagg tggccgagga gatcgcccgc ctcgaggccg aggtcttccg cctggccggc 1440 

cacccottca acctcaactc ccgggaccag ctggaaaggg tcctctttga cgagctaggg 1500 

cttcocgcca tcaagaagac gcaaaagacc ggcaagcgct ccaccagcgc cgccgtcctg 1560 

gaggccctcc gcgaggccca ccccatcgtg gagaagatcc tgcagtaccg ggagctcacc 1620 

aagctgaaga gcacctacat tgaccccttg ccggacctca tccaccccag gacgggccgc 1680 

ctccacaccc gcttcaacca gacggccacg gcoacgggca ggctaagtag ctccgatccc 1740 

aacctccaga acatccccgt ccgcaccccg cttgggcaga ggatccgccg ggccttcatc 1800 

gccgaggagg ggtggctatt ggtggccctg gactatagcc agatagagct cagggtgctg 1860 

gcccacctct ccggcgacga gaacctgatc cgggtcttcc aggaggggcg ggacatccac 1920 

acggagaccg ccagctggat gttcggcgtc ccccgggagg ccgtggaccc cctgatgcgc 1980 

cgggcggcca agaccatcaa cttcggggtc ctctacggca tgtcggccoa ccgcctctcc 2040 

caggagctag ccatccctta cgaggaggcc caggccttca ttgagcgcta otttcagagc 2100 

ttccccaagg tgcgggcctg gattgagaag accctggagg agggcaggag gcgggggtac 2160 

gtggagaccc tcttcggocg ccgccgctac gtgccagacc tagaggcccg ggtgaagagc 2220 

gtgcgggagg cggccgagcg catggccttc aacatgcccg tccagggcac cgccgccgac 2280 

ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 2340 

ctccttcagg tcgccaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 2400 

gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 2460 

gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 2517 

<210> 109 
<211> 2517 
<212> DNA 
<213> Artificial 

<220> 

<223> Synthetic 

<400> 109 ^ ^ .„ 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 



397 



wo 01/90337 



PCTAJSOl/17086 



caccacctgg 


cctaccgcac 


cttccacgcc 


ctgaagggcc 


tcaccaccag ccggggggag 


120 


ccggtgcagg 


cggtctacgg 


cttcgccaag 


agcctcctca 


aggccctcaa ggaggacggg 


180 


gacgcggtga 


tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga ggcctacgag 


240 


gggtacaagg 


cgggccgggc 


ccccacgccg 


gaggactttc 


cccggcaact cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggctggcg 


cgcctcgagg 


tcccgggcta cgaggcggac 


360 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc cgaccagtgg 


540 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt caagggcatc 


600 


ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtcc 


accgggcccc cgagccttat 


1020 


aaagccctca 


gggacctgaa 


ggaggcgcgg 


gggcttctcg 


ccaaagacct gagcgttctg 


1080 


gccctgaggg 


aaggccttgg 


cctcccgccc 


ggcgacgacc 


ccatgctcct cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcccggc 


gctacggcgg ggagtggacg 


1200 


gaggaggcgg 


gggagcgggc 


cgccctttcc 


gagaggctct 


tcgccaacct gcttaagagg 


1260 


cttgaggggg 


aggagaggct 


cctttggctt 


taccgggagg 


tggagaggcc cctttccgct 


1320 


gtcctggccc 


atatggaggc 


cacgggggtg 


cgcctggacg 


tggcctatct cagggccttg 


1380 


tccctggagg 


tggccgagga 


gatcgcccgc 


ctcgaggccg 


aggtcttccg cctggccggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tcctctttga cgagctaggg 


1500 


^ V- 4— c*f^t^fir*r^^ 

c u ucwcgcca 


t cggcaagac 


gcaaaagacc 


ggcaagcgct 


ccaccagcgc cgccgtcctg 


1 c n 


gaggccctcc 


gcgaggccca 


ccccatcgtg 


gagaagatcc 


tgcagtaccg ggagctcacc 


1620 


aagctgaaga 


gcacctacat 


tgaccccttg 


ccggacctca 


tccaccccag gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggca 


ggctaagtag ctccgatccc 


1740 


aacctccaga 


acatccccgt 


ccgcaccccg 


cttgggcaga 


ggatccgccg ggccttcatc 


1800 
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2517 



gccgaggagg ggtggctatt ggtggccctg gactatagcc agatagagct cagggtgctg 1860 

gcccacctct ccggcgacga gaacctgatc cgggtcttcc aggaggggcg ggacatccac 1920 

acggagaccg ccagctggat gttcggcgtc ccccgggagg ccgtggaccc cctgatgcgc 1980 

cgggcggcca agaccatcaa cttcggggtc ctctacggca tgtcggccca ccgcctctcc 2040 

caggagctag ccatccctta cgaggaggcc caggccttca ttgagcgcta ctttcagagc 2100 

ttccccaagg tgcgggcctg gattgagaag accctggagg agggcaggag gcgggggtac 2160 

gtggagaccc tcttcggccg ccgccgctac gtgccagacc tagaggcccg ggtgaagagc 2220 

gtgcgggagg cggccgagcg catggccttc aacatgcccg tccagggcac cgccgccgac 2280 

ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 2340 

ctccttcagg tcgccaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 2400 

gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 2460 
gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 

<210> 110 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 110 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 
caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 120 
ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 
gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 240 
gggtacaagg cgggccgggc cgagacggag gaggactttc cccggcaact cgccctcatc 300 
aaggagctgg tggacctcct ggggctggcg cgcctcgagg tcccgggcta cgaggcggac 360 
gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 
accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 
gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 
gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 600 
ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 



60 



420 
480 
540 



660 
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aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtcc 


accgggcccc 


cgagccttat 


1020 


aaagccctca 


gggacctgaa 


ggaggcgcgg 


gggcttctcg 


ccaaagacct 


gagcgttctg 


1080 


gccctgaggg 


aaggccttgg 


cctcccgccc 


ggcgacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcccggc 


gctacggcgg 


ggagtggacg 


1200 


gaggaggcgg 


gggagcgggc 


cgccctttcc 


gagaggctct 


tcgccaacct 


gcttaagagg 


1260 


cttgaggggg 


aggagaggct 


cctttggctt 


taccgggagg 


tggagaggcc 


cctttccgct 


1320 


gtcctggccc 


atatggaggc 


cacgggggtg 


cgcctggacg 


tggcctatct 


cagggccttg 


1380 


tccctggagg 


tggccgagga 


gatcgcccgc 


ctcgaggccg 


aggtcttccg 


cctggccggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tcctctttga 


cgagctaggg 


1500 


cttcccgcca 


tcggcaagac 


gcaaaagacc 


ggcaagcgct 


ccaccagcgc 


cgccgtcctg 


1560 


gaggccctcc 


gcgaggccca 


ccccatcgtg 


gagaagatcc 


tgcagtaccg 


ggagctcacc 


1620 


aagctgaaga 


gcacctacat 


tgaccccttg 


ccggacctca 


tccaccccag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggca 


ggctaagtag 


ctccgatccG 


1740 


aacctccaga 


acatccccgt 


ccgcaccccg 


cttgggcaga 


ggatccgccg 


ggccttcatc 


1800 


gccgaggagg 


99tggctatt 


ggtggccctg 


gactatagcc 


agatagagct 


cagggtgctg 


1860 


gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 


acggagaccg 


Gcagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agaccatcaa 


cttcggggtc 


ctctacggca 


tgtcggccca 


ccgcctctcc 


2040 


caggagctag 


ccatccctta 


cgaggaggcc 


caggccttca 


ttgagcgcta 


ctttcagagc 


2100 


ttccccaagg 


tgcgggcctg 


gattgagaag 


accctggagg 


agggcaggag 


gcQqgggtac 


2160 


gtggagaccc 


tcttcggccg 


ccgccgctac 


gtgccagacc 


tagaggcccg 


ggtgaagagc 


2220 


gtgcgggagg 


cggccgagcg 


catggccttc 


aacatgcccg 


tccagggcac 


cgccgccgac 


2280 


ctcatgaagc 


tggctatggt 


gaagctcttc 


cccaggctgg 


aggaaatggg 


ggccaggatg 


2340 


ctccttcagg 


tcgccaacga 


gctggtcctc 


gaggccccaa 


aagagagggc 


ggaggccgtg 


2400 
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gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 
gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccae 



<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 111 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 

caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 120 

ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 

gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 240 

gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 300 

aaggagctgg tggacctcct ggggttcacg cgcctcgagg tcccgggcta cgaggcggac 360 

gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 420 

accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 480 

gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 540 

gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 600 

ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 660 

aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 720 

aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 780 

aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 840 

agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 900 

cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 960 

gatcttctgg ccctggccgc cgccaggggc ggccgcgtcc accgggcccc cgagccttat 1020 

aaagccctca gggacctgaa ggaggcgcgg gggcttctcg ccaaagacct gagcgttctg 1080 

gccctgaggg aaggccttgg cctcccgccc ggcgacgacc ccatgctcct cgcctacctc 1140 

ctggaccctt cgaacaccac ccccgagggg gtggcccggc gctacggcgg ggagtggacg 1200 

gaggaggcgg gggagcgggc cgccctttcc gagaggctct tcgccaacct gcttaagagg 1260 
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cttgaggggg 


aggagaggct 


cctttggctt 


taccgggagg 


tggagaggcc 


cctttccgct 


1320 


gtcctggccc 


atatggaggc 


cacgggggtg 


cgcctggacg 


tggcctatct 


cagggccttg 


1380 


tccctggagg 


tggccgagga 


gatcgcccgc 


ctcgaggccg 


aggtcttccg 


cctggccggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tcctctttga 


cgagctaggg 


1500 


cttcccgcca 


tcggcaagac 


gcaaaagacc 


ggcaagcgct 


ccaccagcgc 


cgccgtcctg 


1560 


gaggccctcc 


gcgaggccca 


ccccatcgtg 


gagaagatcc 


tgcagtaccg 


ggagctcacc 


1620 


aagctgaaga 


gcacctacat 


tgaccccttg 


ccggacctca 


tccaccccag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggca 


ggctaagtag 


ctccgatccc 


1740 


aacctccaga 


acatccccgt 


ccgcaccccg 


cttgggcaga 


ggatccgccg 


ggccttcatc 


1800 


gccgaggagg 


ggtggctatt 


ggtggccctg 


gactatagcc 


agatagagct 


cagggtgctg 


1860 


gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 


acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agaccatcaa 


cttcggggtc 


ctctacggca 


tgtcggccca 


ccgcctctcc 


2040 


caggagctag 


ccatccctta 


cgaggaggcc 


caggccttca 


ttgagcgcta 


ctttcagagc 


2100 


ttccccaagg 


tgcgggcctg 


gattgagaag 


accctggagg 


agggcaggag 


gcgggggtac 


2160 


gtggagaccc 


tcttcggccg 


ccgccgctac 


gtgccagacc 


tagaggcccg 


ggtgaagagc 


2220 


qtaccrcrQaaa 


CQaccaaaca 


catcraccttc 

V v% V W w \^ 










ctcatgaagc 


tggctatggt 


gaagctcttc 


cccaggctgg 


aggaaatggg 


ggccaggatg 


2340 


ctccttcagg 


tcgccaacga 


gctggtcctc 


gaggccccaa 


aagagagggc 


ggaggccgtg 


2400 


gcccggctgg 


ccaaggaggt 


catggagggg 


gtgtatcccc 


tggccgtgcc 


cctggaggtg 


2460 


gaggtgggga 


taggggagga 


ctggctctcc 


gccaaggagc 


accaccacca 


ccaccac 


2517 



<210> 112 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 112 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 

caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 120 

ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 
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gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 240 

gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 300 

aaggagctgg tggacctcct ggggctggcg cgcctcgagg tcccgggcta cgaggcggac 360 

gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 420 

accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 480 

gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 540 

gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 600 

ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 660 

aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 720 

aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 780 

aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 840 

agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 900 

cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 960 

gatcttctgg ccctggccgc cgccaggggc ggccgcgtcc accgggcccc cgagccttat 1020 

aaagccctca gggacctgaa ggaggcgcgg gggcttctcg ccaaagacct gagcgttctg 1080 

gccctgaggg aaggccttgg cctcccgccc ggcgacgacc ccatgctcct cgcctacctc 1140 

ctggaccctt cgaacaccac ccccgagggg gtggcccggc gctacggcgg ggagtggacg 1200 

gaggaggcgg gggagcgggc cgccctttcc gagaggctct tcgccaacct gcttaagagg 1260 

cttgaggggg aggagaggct cctttggctt taccgggagg tggagaggcc cctttccgct 1320 

gtcctggccc atatggaggc cacgggggtg cgcctggacg tggcctatct cagggccttg 1380 

tccctggagg tggccgagga gatcgcccgc ctcgaggccg aggtcttccg cctggccggc 1440 

caccccttca acctcaactc ccgggaccag ctggaaaggg tcctctttga cgagctcagg 1500 

cttcccaagt tgaagaagac gaagaagacc ggtaagcgct ccaccagcgc cgccgtcctg 1560 

gaggccctcc gcgaggccca ccccatcgtg gagaagatcc tgcagtaccg ggagctcacc 1620 

aagctgaaga gcacctacat tgaccccttg ccggacctca tccaccccag gacgggccgc 1680 

ctccacaccc gcttcaacca gacggccacg gccacgggca ggctaagtag ctccgatccc 1740 

aacctccaga acatccccgt ccgcaccccg cttgggcaga ggatccgccg ggccttcatc 1800 

gccgaggagg ggtggctatt ggtggccctg gactatagcc agatagagct cagggtgctg 1860 
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gcccacctct ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 


acggagaccg ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca agaccatcaa 


cttcggggtc 


ctctacggca 


tgtcggccca 


ccgcctctcc 


2040 


caggagctag ccatccctta 


cgaggaggcc 


caggccttca 


ttgagcgcta 


ctttcagagc 


2100 


ttccccaagg tgcgggcctg 


gattgagaag 


accctggagg 


agggcaggag 


gcgggggtac 


2160 


gtggagaccc tcttcggccg 


ccgccgctac 


gtgccagacc 


tagaggcccg 


ggtgaagagc 


2220 


gtgcgggagg cggccgagcg 


catggccttc 


aacatgcccg 


tccagggcac 


cgccgccgac 


2280 


ctcatgaagc tggctatggt 


gaagctcttc 


cccaggctgg 


aggaaatggg 


ggccaggatg 


2340 


ctccttcagg tcgccaacga 


gctggtcctc 


gaggccccaa 


aagagagggc 


ggaggccgtg 


2400 

1 


gcccggctgg ccaaggaggt 


catggagggg 


gtgtatcccc 


tggccgtgcc 


cctggaggtg 


2460 


gaggtgggga taggggagga 


ctggctctcc 


gccaaggagc 


accaccacca 


ccaccac 


2517 



<210> 113 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 113 



atgaattcgg 


ggatgctgcc 


cctctttgag 


cccaagggcc 


gggtcctcct 


ggtggacggc 


60 


caccacctgg 


cctaccgcac 


cttccacgcc 


ctgaagggcc 


tcaccaccag 


ccggggggag 


120 


ccggtgcagg 


cggtctacgg 


cttcgccaag 


agcctcctca 


aggccctcaa 


ggaggacggg 


180 


gacgcggtga 


tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga 


ggcctacggg 


240 


gggtacaagg 


cgggccgggc 


ccccacgccg 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggctggcg 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 
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840 
900 



aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 
agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 

cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 960 

gatcttctgg ccctggccgc cgccaggggc ggccgcgtcc accgggcccc cgagccttat 1020 

aaagccctca gggacctgaa ggaggcgcgg gggcttctcg ccaaagacct gagcgttctg 1080 

gccctgaggg aaggccttgg cctcccgccc ggcgacgacc <;catgctcct cgcctacctc 1140 

ctggaccctt cgaacaccac ccccgagggg gtggcccggc gctacggcgg ggagtggacg 1200 

gaggaggcgg gggagcgggc cgccctttcc gagaggctct tcgccaacct gcttaagagg 1260 

cttgaggggg aggagaggct cctttggctt taccgggagg tggagaggcc cctttccgct 1320 

gtcctggccc atatggaggc cacgggggtg cgcctggacg tggcctatct cagggccttg 1380 

tccctggagg tggccgagga gatcgcccgc ctcgaggccg aggtcttccg cctggccggc 1440 

caccccttca acctcaactc ccgggaccag ctggaaaggg tcctctttga cgagctaagg 1500 

cttcccaaga tcaacaagac gaagaagacc ggtaagcgct ccaccagcgc cgccgtcctg 1560 

gaggccctcc gcgaggccca ccccatcgtg gagaagatcc tgcagtaccg ggagctcacc 1620 

aagctgaaga gcacctacat tgaccccttg ccggacctca tccaccccag gacgggccgc 1680 

ctccacaccc gcttcaacca gacggccacg gccacgggca ggctaagtag ctccgatccc 1740 

aacctccaga acatccccgt ccgcaccccg cttgggcaga ggatccgccg ggccttcatc 1800 

gccgaggagg ggtggctatt ggtggccctg gactatagcc agatagagct cagggtgctg 1860 

gcccacctct ccggcgacga gaacctgatc cgggtcttcc aggaggggcg ggacatccac 1920 

acggagaccg ccagctggat gttcggcgtc ccccgggagg ccgtggaccc cctgatgcgc 1980 

cgggcggcca agaccatcaa cttcggggtc ctctacggca tgtcggccca ccgcctctcc 2040 

caggagctag ccatccctta cgaggaggcc caggccttca ttgagcgcta ctttcagagc 2100 

ttccccaagg tgcgggcctg gattgagaag accctggagg agggcaggag gcgggggtac 2160 

gtggagaccc tcttcggccg ccgccgctac gtgccagacc tagaggcccg ggtgaagagc 2220 

gtgcgggagg cggccgagcg catggccttc aacatgcccg tccagggcac cgccgccgac 2280 

ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 2340 

ctccttcagg tcgccaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 2400 

gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 2460 
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gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 2517 

<210> 114 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 114 



atgaattcgg 


ggatgctgcc 


cctctttgag 


cccaagggcc 


gggtcctcct 


ggtggacggc 


60 


caccacctgg 


cctaccgcac 


cttccacgcc 


ctgaagggcc 


tcaccaccag 


ccggggggag 


120 


ccggtgcagg 


cggtctacgg 


cttcgccaag 


agcctcctca 


aggccctcaa 


ggaggacggg 


180 


gacgcggtga 


tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga 


ggcctacggg 


240 


gggtacaagg 


cgggccgggc 


ccccacgccg 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggctggcg 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtcc 


accgggcccc 


cgagccttat 


1020 


aaagccctca 


gggacctgaa 


ggaggcgcgg 


gggcttctcg 


ccaaagacct 


gagcgttctg 


1080 


gccctgaggg 


aaggccttgg 


cctcccgccc 


ggcgacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcccggc 


gctacggcgg 


ggagtggacg 


1200 


gaggaggcgg 


gggagcgggc 


cgccctttcc 


gagaggctct 


tcgccaacct 


gcttaagagg 


1260 


cttgaggggg 


aggagaggct 


cctttggctt 


taccgggagg 


tggagaggcc 


cctttccgct 


1320 


gtcctggccc 


atatggaggc 


cacgggggtg 


cgcctggacg 


tggcctatct 


cagggccttg 


1380 
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tccctggagg tggccgagga gatcgcccgc ctcgaggccg aggtcttccg cctggccggc 1440 

caccccttca acctcaactc ccgggaccag ctggaaaggg tcctctttga cgagctaagg 1500 

attcccaaga tcaagaagac gcataagacc ggtaagcgct ccaccagcgc cgccgtcctg 1560 

gaggccctcc gcgaggccca ccccatcgtg gagaagatcc tgcagtaccg ggagctcacc 1620 

aagctgaaga gcacctacat tgaccccttg ccggacctca tccaccccag gacgggccgc 1680 

ctccacaccc gcttcaacca gacggccacg gccacgggca ggctaagtag ctccgatccc 1740 

aacctccaga acatccccgt ccgcaccccg cttgggcaga ggatccgccg ggccttcatc 1800 

gccgaggagg ggtggctatt ggtggccctg gactatagcc agatagagct cagggtgctg 1860 

gcccacctct ccggcgacga gaacctgatc cgggtcttcc aggaggggcg ggacatccac 1920 

acggagaccg ccagctggat gttcggcgtc ccccgggagg ccgtggaccc cctgatgcgc 1980 

cgggcggcca agaccatcaa cttcggggtc ctctacggca tgtcggccca ccgcctctcc 2040 

caggagctag ccatccctta cgaggaggcc caggccttca ttgagcgcta ctttcagagc 2100 

ttccccaagg tgcgggcctg gattgagaag accctggagg agggcaggag gcgggggtac 2160 

gtggagaccc tcttcggccg ccgccgctac gtgccagacc tagaggcccg ggtgaagagc 2220 

gtgcgggagg cggccgagcg catggccttc aacatgcccg tccagggcac cgccgccgac 2280 

ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 2340 

ctccttcagg tcgccaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 2400 

gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 2460 

gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 2517 

<210> 115 
<211> 2517 
<212> DNA 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 115 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 

caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 120 

ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 

gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 240 
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gggtacaagg 


cgggccgggc 


ccccacgccg 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggctggcg 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtcc 


accgggcccc 


cgagccttat 


1020 


aaagccctca 


gggacctgaa 


ggaggcgcgg 


gggcttctcg 


ccaaagacct 


gagcgttctg 


1080 


gccctgaggg 


aaggccttgg 


cctcccgccc 


ggcgacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcccggc 


gctacggcgg 


ggagtggacg 


1200 


gaggaggcgg 


gggagcgggc 


cgccctttcc 


gagaggctct 


tcgccaacct 


gcttaagagg 


1260 


cttgaggggg 


aggagaggct 


cctttggctt 


taccgggagg 


tggagaggcc 


cctttccgct 


1320 


gtcctggccc 


atatggaggc 


cacgggggtg 


cgcctggacg 


tggcctatct 


cagggccttg 


1380 


tccctggagg 


tggccgagga 


gatcgcccgc 


ctcgaggccg 


aggtcttccg 


cctggccggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tcctctttga 


cgagctcagg 


1500 


cttcccaagt 


tgaagaagac 


gaagaagacc 


ggtaagcgct 


ccagcagcgc 


cgccgtcctg 


1560 


gaggccctcc 


gcgaggccca 


ccccatcgtg 


gagaagatcc 


tgcagtaccg 


ggagctcacc 


1620 


aagctgaaga 


gcacctacat 


tgaccccttg 


ccggacctca 


tccaccccag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


cracQQC cacQ 


QCcacQQQca 


QQctaaQtacr 


ctccgatccc 


1740 


aacctccaga 


acatccccgt 


ccgcaccccg 


cttgggcaga 


ggatccgccg 


ggccttcatc 


1800 


gccgaggagg 


99tggctatt 


ggtggccctg 


gactatagcc 


agatagagct 


cagggtgctg 


1860 


gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 


acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 
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<400> 116 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 



cgggcggcca agaccatcaa cttcggggtc ctctacggca tgtcggccca ccgcctctcc 2040 

caggagctag ccatccctta cgaggaggcc caggccttca ttgagcgcta ctttcagagc 2100 

ttccccaagg tgcgggcctg gattgagaag accctggagg agggcaggag gcgggggtac 2160 

gtggagaccc tcttcggccg ccgccgctac gtgccagacc tagaggcccg ggtgaagagc 2220 

gtgcgggagg cggccgagcg catggccttc aacatgcccg tccagggcac cgccgccgac 2280 

ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 2340 

ctccttcagg tcgccaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 2400 

gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 2460 

gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 2517 

<210> 116 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 



60 
120 
180 



caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 
ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 

gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 240 

gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 300 

aaggagctgg tggacctcct ggggctggcg cgcctcgagg tcccgggcta cgaggcggac 360 

gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 420 

accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 480 

gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 540 

gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 600 

ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 660 

aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 720 

aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 780 

aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 840 



409 



wo 01/90337 



PCTAJSOl/17086 



agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtcc 


accgggcccc 


cgagccttat 


1020 


aaagccctca 


gggacctgaa 


ggaggcgcgg 


gggcttctcg 


ccaaagacct 


gagcgttctg 


1080 


gccctgaggg 


aaggccttgg 


cctcccgccc 


ggcgacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcccggc 


gctacggcgg 


ggagtggacg 


1200 


gaggaggcgg 


gggagcgggc 


cgccctttcc 


gagaggctct 


tcgccaacct 


gcttaagagg 


1260 


cttgaggggg 


aggagaggct 


cctttggctt 


taccgggagg 


tggagaggcc 


cctttccgct 


1320 


gtcctggccc 


atatggaggc 


cacgggggtg 


cgcctggacg 


tggcctatct 


cagggccttg 


1380 


tccctggagg 


tggccgagga 


gatcgcccgc 


ctcgaggccg 


aggtcttccg 


cctggccggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tcctctttga 


cgagctcagg 


1500 


cttcccaagt 


tgaagaagac 


gaagaagacc 


ggtaagcgct 


ccaccagcgc 


cgccctcctg 


1560 


gaggccctcc 


gcgaggccca 


ccccatcgtg 


gagaagatcc 


tgcagtaccg 


ggagctcacc 


1620 


aagctgaaga 


gcacctacat 


tgaccccttg 


ccggacctca 


tccaccccag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggca 


ggctaagtag 


ctccgatccc 


1740 


aacctccaga 


acatccccgt 


ccgcaccccg 


cttgggcaga 


ggatccgccg 


ggccttcatc 


1800 


gccgaggagg 


ggtggctatt 


ggtggccctg 


gactatagcc 


agatagagct 


cagggtgctg 


1860 


gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 


acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agaccatcaa 


cttcggggtc 


ctctacggca 


tgtcggccca 


ccgcctctcc 


2040 


caggagctag 


ccatccctta 


cgaggaggcc 


caggccttca 


ttgagcgcta 


ctttcagagc 


2100 


ttccccaagg 


tgcgggcctg 


gattgagaag 


accctggagg 


agggcaggag 


gcgggggtac 


2160 


gtggagaccc 


tcttcggccg 


ccgccgctac 


gtgccagacc 


tagaggcccg 


ggtgaagagc 


2220 


gtgcgggagg 


cggccgagcg 


catggccttc 


aacatgcccg 


tccagggcac 


cgccgccgac 


2280 


ctcatgaagc 


tggctatggt 


gaagctcttc 


cccaggctgg 


aggaaatggg 


ggccaggatg 


2340 


ctccttcagg 


tcgccaacga 


gctggtcctc 


gaggccccaa 


aagagagggc 


ggaggccgtg 


2400 


gcccggctgg 


ccaaggaggt 


catggagggg 


gtgtatcccc 


tggccgtgcc 


cctggaggtg 


2460 


gaggtgggga 


taggggagga 


ctggctctcc 


gccaaggagc 


accaccacca 


ccaccac 


2517 
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<210> 117 
<211> 2526 
<212> DNA 
<213> Artificial 

<:220> 

<223> Synthetic 
<400> 117 

atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 60 

ggccaccacc tggcctaccg caccttcttc gccctgaagg gcctcaccac gagccggggc 120 

gaaccggtgc aggcggtcta cggcttcgcc aagagcctcc tcaaggccct gaaggaggac 180 

gggtacaagg ccgtcttcgt ggtctttgac gccaaggccc cctccttccg ccacgaggcc 240 

tacgaggcct acaaggcggg gagggccccg acccccgagg acttcccccg gcagctcgcc 300 

ctcatcaagg agctggtgga cctcctgggg tttacccgcc tcgaggtccc cggctacgag 360 

gcggacgacg ttctcgccac cctggccaag aaggcggaaa aggaggggta cgaggtgcgc 420 

atcctcaccg ccgaccgcga cctctaccaa ctcgtctccg accgcgtcgc cgtcctccac 480 

cccgagggcc acctcatcac cccggagtgg ctttgggaga agtacggcct caggccggag 540 

cagtgggtgg acttccgcgc cctcgtgggg gacccctccg acaacctccg aggggtcagg 600 

ggcatcgggg agaagaccgc cctcaagctc ctcaaggagt ggggaagcct ggaaaacctc 660 

ctcaagaacc tggaccgggt aaagccagaa aacgtccggg agaagatcaa ggcccacctg 720 

gaagacctca ggctctcctt ggagctctcc cgggtgcgca ccgacctccc cctggaggtg 780 

gacctcgccc aggggcggga gcccgaccgg gaggggctta gggccttcct ggagaggctg 840 

gagttcggca gcctcctcca cgagttcggc ctcctggagg cccccgcccc cctggaggag 900 

gccccctggc ccccgccgga aggggccttc gtgggcttcg tcctctcccg ccccgagccc 960 

atgtgggcgg agcttaaagc cctggccgcc tgcaggggcg gccgcgtgca ccgggcagca 1020 

gaccccttgg cggggctaaa ggacctcaag gaggtccggg gcctcctcgc caaggacctc 1080 

gccgtcttgg cctcgaggga ggggctagac ctcgtgcccg gggacgaccc catgctcctc 1140 

gcctacctcc tggacccttc gaacaccacc cccgaggggg tggcgcggcg ctacgggggg 1200 

gagtggacgg aggacgccgc ccaccgggcc ctcctctcgg agaggctcca tcggaacctc 1260 

cttaagcgcc tcgaggggga ggagaagctc ctttggctct accacgaggt ggaaaagccc 1320 

ctctcccggg tcctggccca tatggaggcc accggggtac ggcgggacgt ggcctacctt 1380 

caggcccttt ccctggagct tgcggaggag atccgccgcc tcgaggagga ggtcttccgc 1440 
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ttggcgggcc 


accccttcaa 


cctcaactcc 


cgggaccagc 


tggaaagggt 


gctctttgac 


1500 


gagcttaggc 


ttcccgcctt 


ggggaagacg 


caaaagacag 


gcaagcgctc 


caccagcgcc 


1560 


gcggtgctgg 


aggccctacg 


ggaggcccac 


cccatcgtgg 


agaagatcct 


ccagcaccgg 


1620 


gagctcacca 


agctcaagaa 


cacctacgtg 


gaccccctcc 


caagcctcgt 


ccacccgagg 


1680 


acgggccgcc 


tccacacccg 


cttcaaccag 


acggccacgg 


ccacggggag 


gcttagtagc 


1740 


tccgacccca 


acctgcagaa 


catccccgtc 


cgcaccccct 


tgggccagag 


gatccgccgg 


1800 


gccttcgtgg 


ccgaggcggg 


ttgggcgttg 


gtggccctgg 


actatagcca 


gatagagctc 


1860 


cgcgtcctcg 


cccacctctc 


cggggacgaa 


aacctgatca 


gggtcttcca 


ggaggggaag 


1920 


gacatccaca 


cccagaccgc 


aagctggatg 


ttcggcgtcc 


ccccggaggc 


cgtggacccc 


1980 


ctgatgcgcc 


gggcggccaa 


gacggtgaac 


ttcggcgtcc 


tctacggcat 


gtccgcccat 


2040 


aggctctccc 


aggagcttgc 


catcccctac 


gaggaggcgg 


tggcctttat 


agagcgctac 


2100 


ttccaaagct 


tccccaaggt 


gcgggcctgg 


atagaaaaga 


ccctggagga 


ggggaggaag 


2160 


cggggctacg 


tggaaaccct 


cttcggaaga 


aggcgctacg 


tgcccgacct 


caacgcccgg 


2220 


gtgaagagcg 


tcagggaggc 


cgcggagcgc 


atggccttca 


acatgcccgt 


ccagggcacc 


2280 


gccgccgacc 


tcatgaagct 


cgccatggtg 


aagctcttcc 


cccgcctccg 


ggagatgggg 


2340 


gcccgcatgc 


tcctccaggt 


cgccaacgag 


ctcctcctgg 


aggcccccca 


agcgcgggcc 


2400 


gaggaggtgg 


cggctttggc 


caaggaggcc 


atggagaagg 


cctatcccct 


cgccgtgccc 


2460 


ctggaggtgg 


aggtggggat 


gggggaggac 


tggctttccg 


ccaagggtca 


ccaccaccac 


2520 


caccac 












2526 


<210> 118 
<211> 2526 
<212> DNA 
<213> Artificial 












<220> 

<223> Synthetic 












<400> 118 

atgaattccg aggcgatgct 


tccgctcttt 


gaacccaaag 


gccgggtcct 


cctggtggac 


60 


ggccaccacc 


tggcctaccg 


caccttcttc 


gccctgaagg 


gcctcaccac 


gagccggggc 


120 


gaaccggtgc 


aggcggtcta 


cggcttcgcc 


aagagcctcc 


tcaaggccct 


gaaggaggac 


180 


gggtacaagg 


ccgtcttcgt 


ggtctttgac 


gccaaggccc 


cctccttccg 


ccacgaggcc 


240 


tacgaggcct acaaggcggg 


gagggccccg 


acccccgagg 


acttcccccg 


gcagctcgcc 


300 
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360 
420 



ctcatcaagg agctggtgga cctcctgggg tttacccgcc tcgaggtccc cggctacgag 
gcggacgacg ttctcgccac cctggccaag aaggcggaaa aggaggggta cgaggtgcgc 

atcctcaccg ccgaccgcga cctctaccaa ctcgtctccg accgcgtcgc cgtcctccac 480 

cccgagggcc acctcatcac cccggagtgg ctttgggaga agtacggcct caggccggag 540 
cagtgggtgg acttccgcgc cctcgtgggg gacccctccg acaacctccc cggggtcaag 600 
ggcatcgggg agtataccgc cctcaagctc ctcaaggagt ggggaagcct ggaaaacctc 660 
ctcaagaacc tggaccgggt aaagccagaa aacgtccggg agaagatcaa ggcccacctg 720 
gaagacctca ggctctcctt ggagctctcc cgggtgcgca ccgacctccc cctggaggtg 780 
gacctcgccc aggggcggga gcccgaccgg gaggggctta gggccttcct ggagaggctg 840 
gagttcggca gcctcctcca cgagttcggc ctcctggagg cccccgcccc cctggaggag 900 
gccccctggc ccccgccgga aggggccttc gtgggcttcg tcctctcccg ccccgagccc 960 

atgtgggcgg agcttaaagc cctggccgcc tgcaggggcg gccgcgtgca ccgggcagca 1020 

gaccccttgg cggggctaaa ggacctcaag gaggtccggg gcctcctcgc caaggacctc 1080 

gccgtcttgg cctcgaggga ggggctagac ctcgtgcccg gggacgaccc catgctcctc 1140 

gcctacctcc tggacccttc gaacaccacc cccgaggggg tggcgcggcg ctacgggggg 1200 

gagtggacgg aggacgccgc ccaccgggcc ctcctctcgg agaggctcca tcggaacctc 1260 

cttaagcgcc tcgaggggga ggagaagctc ctttggctct accacgaggt ggaaaagccc 1320 

ctctcccggg tcctggccca tatggaggcc accggggtac ggcgggacgt ggcctacctt 1380 

caggcccttt ccctggagct tgcggaggag atccgccgcc tcgaggagga ggtcttccgc 1440 

ttggcgggcc accccttcaa cctcaactcc cgggaccagc tggaaagggt gctctttgac 1500 

gagcttaggc ttcccgcctt ggggaagacg caaaagacag gcaagcgctc caccagcgcc 1560 

gcggtgctgg aggccctacg ggaggcccac cccatcgtgg agaagatcct ccagcaccgg 1620 

gagctcacca agctcaagaa cacctacgtg gaccccctcc caagcctcgt ccacccgagg 1680 

acgggccgcc tccacacccg cttcaaccag acggccacgg ccacggggag gcttagtagc 1740 

tccgacccca acctgcagaa catccccgtc cgcaccccct tgggccagag gatccgccgg 1800 

gccttcgtgg ccgaggcggg ttgggcgttg gtggccctgg actatagcca gatagagctc 1860 
cgcgtcctcg cccacctctc cggggacgaa aacctgatca gggtcttcca ggaggggaag 1920 
gacatccaca cccagaccgc aagctggatg ttcggcgtcc ccccggaggc cgtggacccc 1980 
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ctgatgcgcc 


gggcggccaa 


gacggtgaac 


aggctctccc 


aggagcttgc 


catcccctac 


ttccaaagct 


tccccaaggt 


gcgggcctgg 


cggggctacg 


tggaaaccct 


cttcggaaga 


gtgaagagcg 


tcagggaggc 


cgcggagcgc 


gccgccgacc 


tcatgaagct 


cgccatggtg 


gcccgcatgc 


tcctccaggt 


cgccaacgag 


gaggaggtgg 


cggctttggc 


caaggaggcc 


ctggaggtgg 


aggtggggat 


gggggaggac 


caccac 






<210> 119 
<211> 2526 
<212> DNA 
<213> Artificial 




<220> 

<223> Synthetic 




<400> 119 
atgaattccg 


aggcgatgct 


tccgctcttt 


ggccaccacc 


tggcctaccg 


caccttcttc 


gaaccggtgc 


aggcggtcta 


cggcttcgcc 


gggtacaagg 


ccgtcttcgt 


ggtctttgac 


tacgaggcct 


acaaggcggg 


gagggccccg 


ctcatcaagg 


agctggtgga 


cctcctgggg 


gcggacgacg 


ttctcgccac 


cctggccaag 


atcctcaccg 


ccgaccgcga 


cctctaccaa 


cccgagggcc 


acctcatcac 


cccggagtgg 


cagtgggtgg 


acttccgcgc 




ggcatcaggg 


agaagaccgc 


cctcaagctc 


ctcaagaacc 


tggaccgggt 


aaagccagaa 


gaagacctca 


ggctctcctt 


ggagctctcc 


gacctcgccc 


aggggcggga 


gcccgaccgg 



ttcggcgtcc 


tctacggcat 


gtccgcccat 


2040 


gaggaggcgg 


tggcctttat 


agagcgctac 


2100 


atagaaaaga 


ccctggagga 


ggggaggaag 


2160 


aggcgctacg 


tgcccgacct 


caacgcccgg 


2220 


atggccttca 


acatgcccgt 


ccagggcacc 


2280 


aagctcttcc 


cccgcctccg 


ggagatgggg 


2340 


ctcctcctgg 


aggcccccca 


agcgcgggcc 


2400 


atggagaagg 


cctatcccct 


cgccgtgccc 


2460 


tggctttccg 


ccaagggtca 


ccaccaccac 


2520 



2526 



gaacccaaag 


gccgggtcct 


cctggtggac 


60 


gccctgaagg 


gcctcaccac 


gagccggggc 


120 


aagagcctcc 


tcaaggccct 


gaaggaggac 


180 


gccaaggccc 


cctccttccg 


ccacgaggcc 


240 


acccccgagg 


acttcccccg 


gcagctcgcc 


300 


tttacccgcc 


tcgaggtccc 


cggctacgag 


360 


aaggcggaaa 


aggaggggta 


cgaggtgcgc 


420 


ctcgtctccg 


accgcgtcgc 


cgtcctccac 


480 


ctttgggaga 


agtacggcct 


caggccggag 


540 


gacccctccg 


acaacctccc 


cggggtcaag 


600 


ctcaaggagt 


ggggaagcct 


ggaaaacctc 


660 


aacgtccggg 


agaagatcaa 


ggcccacctg 


* 720 


cgggtgcgca 


ccgacctccc 


cctggaggtg 


780 


gaggggctta 


gggccttcct 


ggagaggctg 


840 
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900 
960 



gagttcggca gcctcctcca cgagttcggc ctcctggagg cccccgcccc cctggaggag 
gccccctggc ccccgccgga aggggccttc gtgggcttcg tcctctcccg ccccgagccc 

atgtgggcgg agcttaaagc cctggccgcc tgcaggggcg gccgcgtgca ccgggcagca 1020 

gaccccttgg cggggctaaa ggacctcaag gaggtccggg gcctcctcgc caaggacctc 1080 

gccgtcttgg cctcgaggga ggggctagac ctcgtgcccg gggacgaccc catgctcctc 114 0 

gcctacctcc tggacccttc gaacaccacc cccgaggggg tggcgcggcg ctacgggggg 1200 

gagtggacgg aggacgccgc ccaccgggcc ctcctctcgg agaggctcca tcggaacctc 1260 

cttaagcgcc tcgaggggga ggagaagctc ctttggctct accacgaggt ggaaaagccc 1320 

ctctcccggg tcctggccca tatggaggcc accggggtac ggcgggacgt ggcctacctt 1380 

caggcccttt ccctggagct tgcggaggag atccgccgcc tcgaggagga ggtcttccgc 1440 

ttggcgggcc accccttcaa cctcaactcc cgggaccagc tggaaagggt gctctttgac ISOO 

gagcttaggc ttcccgcctt ggggaagacg caaaagacag gcaagcgctc caccagcgcc 1560 

gcggtgctgg aggccctacg ggaggcccac cccatcgtgg agaagatcct ccagcaccgg 1620 

gagctcacca agctcaagaa cacctacgtg gaccccctcc caagcctcgt ccacccgagg 1680 

acgggccgcc tccacacccg cttcaaccag acggccacgg ccacggggag gcttagtagc 1740 

tccgacccca acctgcagaa catccccgtc cgcaccccct tgggccagag gatccgccgg 1800 

gccttcgtgg ccgaggcggg ttgggcgttg gtggccctgg actatagcca gatagagctc 1860 

cgcgtcctcg cccacctctc cggggacgaa aacctgatca gggtcttcca ggaggggaag 1920 

gacatccaca cccagaccgc aagctggatg ttcggcgtcc ccccggaggc cgtggacccc 1980 

ctgatgcgcc gggcggccaa gacggtgaac ttcggcgtcc tctacggcat gtccgcccat 2040 
aggctctccc aggagcttgc catcccctac gaggaggcgg tggcctttat agagcgctac 2100 
ttccaaagct tccccaaggt gcgggcctgg atagaaaaga ccctggagga ggggaggaag 2160 
cggggctacg tggaaaccct cttcggaaga aggcgctacg tgcccgacct caacgcccgg 2220 
gtgaagagcg tcagggaggc cgcggagcgc atggccttca acatgcccgt ccagggcacc 2280 
gccgccgacc tcatgaagct cgccatggtg aagctcttcc cccgcctccg ggagatgggg 2340 
gcccgcatgc tcctccaggt cgccaacgag ctcctcctgg aggcccccca agcgcgggcc 2400 
gaggaggtgg cggctttggc caaggaggcc atggagaagg cctatcccct cgccgtgccc 2460 
ctggaggtgg aggtggggat ^gggggaggac tggctttccg ccaagggtca ccaccaccac 2520 
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caccac 






2526 


<210> 120 
<211> 2517 
<212> DNA 
<213> Artificial 






<220> 

<223> Synthetic 






<400> 120 
at^aattc99 


ggatgctgcc 


cctctttgag cccaagggcc gggtcctcct ggtggacggc 


60 


caccacct99 


cctaccgcac 


cttccacgcc ctgaagggcc tcaccaccag ccggggggag 


120 




cggtctacgg 


cttcgccaag agcctcctca aggccctcaa ggaggacggg 


180 


gacgcggtga 


tcgtggtctt 


tgacgccaag gccccctcct tccgccacga ggcctacggg 


240 


gggtacaagg 


cgggccgggc 


ccccacgccg gaggactttc cccggcaact cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggttcacg cgcctcgagg tcccgggcta cgaggcggac . 


360 


gacgtcctgg 


ccagcctggc 


caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt tccgaccgca tccacgtcct ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 


540 


gccgactacc 


gggccctgac 


cggggacgag tccgacaacc ttcccggggt caagggcatc 


600 


ggggagaaga 


cggcgaggaa 


gcttctggag gagtggggga gcctggaagc cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg gagaagatcc tggcccacat ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt agggcctttc tggagaggct tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa agccccaagg ccctggagga ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc ggccgcgtcc accgggcccc cgagccttat 


1020 


aaagccctca 


gggacctgaa 


ggaggcgcgg gggcttctcg ccaaagacct gagcgttctg 


1080 


gccctgaggg 


aaggccttgg 


cctcccgccc ggcgacgacc ccatgctcct cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg gtggcccggc gctacggcgg ggagtggacg 


1200 


gaggaggcgg 


gggagcgggc 


cgccctttcc gagaggctct tcgccaacct gcttaagagg 


1260 


cttgaggggg 


aggagaggct 


cctttggctt taccgggagg tggagaggcc cctttccgct 


1320 


gtcctggccc 


atatggaggc 


cacgggggtg cgcctggacg tggcctatct cagggccttg 


1380 
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tccctggagg tggccgagga gatcgcccgc ctcgaggccg aggtcttccg cctggccggc 1440 

caccccttca acctcaactc ccgggaccag ctggaaaggg tcctctttga cgagctcagg 1500 

cttcccaagt tgaagaagac gaagaagacc ggtaagcgct ccagcagcgc cgccgtcctg 1S60 

gaggccctcc gcgaggccca ccccatcgtg gagaagatcc tgcagtaccg ggagctcacc 1620 

aagctgaaga gcacctacat tgaccccttg ccggacctca tccaccccag gacgggccgc 1680 

ctccacaccc gcttcaacca gacggccacg gccacgggca ggctaagtag ctccgatccc 1740 

aacctccaga acatccccgt ccgcaccccg cttgggcaga ggatccgccg ggccttcatc 1800 

gccgaggagg ggtggctatt ggtggccctg gactatagcc agatagagct cagggtgctg 1860 

gcccacctct ccggcgacga gaacctgatc cgggtcttcc aggaggggcg ggacatccac 1920 

acggagaccg ccagctggat gttcggcgtc ccccgggagg ccgtggaccc cctgatgcgc 1980 

cgggcggcca agaccatcaa cttcggggtc ctctacggca tgtcggccca ccgcctctcc 2040 

caggagctag ccatccctta cgaggaggcc caggccttca ttgagcgcta ctttcagagc 2100 

ttccccaagg tgcgggcctg gattgagaag accctggagg agggcaggag gcgggggtac 2160 

gtggagaccc tcttcggccg ccgccgctac gtgccagacc tagaggcccg ggtgaagagc 2220 

gtgcgggagg cggccgagcg catggccttc aacatgcccg tccagggcac cgccgccgac 2280 

ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 2340 

ctccttcagg tcgccaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 2400 

gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 2460 

gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 2517 

<210> 121 
<211> 2517 
<212> DNA 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 121 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 

caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 120 

ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 

gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 240 
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gggtacaagg 


cgggccgggc 


cgagacggag 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggctggcg 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtcc 


accgggcccc 


cgagccttat 


1020 


aaagccctca 


gggacctgaa 


ggaggcgcgg 


gggcttctcg 


ccaaagacct 


gagcgttctg 


1080 


gccctgaggg 


aaggccttgg 


cctcccgccc 


ggcgacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcccggc 


gctacggcgg 


ggagtggacg 


1200 


gaggaggcgg 


gggagcgggc 


cgccctttcc 


gagaggctct 


tcgccaacct 


gcttaagagg 


1260 


cttgaggggg 


aggagaggct 


cctttggctt 


taccgggagg 


tggagaggcc 


cctttccgct 


1320 


gtcctggccc 


atatggaggc 


cacgggggtg 


cgcctggacg 


tggcctatct 


cagggccttg 


1380 


tccctggagg 


tggccgagga 


gatcgcccgc 


ctcgaggccg 


aggtcttccg 


cctggccggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tcctctttga 


cgagctcagg 


1500 


cttcccaagt 


tgaagaagac 


gaagaagacc 


ggtaagcgct 


ccagcagcgc 


cgccgtcctg 


1560 


gaggccctcc 


gcgaggccca 


ccccatcgtg 


gagaagatcc 


tgcagtaccg 


ggagctcacc 


1620 


aagctgaaga 


gcacctacat 


tgaccccttg 


ccggacctca 


tccaccccag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggca 


ggctaagtag 


ctccgatccc 


1740 


aacctccaga 


acatccccgt 


ccgcaccccg 


cttgggcaga 


ggatccgccg 


ggccttcatc 


1800 


gccgaggagg 


ggtggctatt 


ggtggccctg 


gactatagcc 


agatagagct 


cagggtgctg 


1860 


gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 


acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 
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2040 
2100 



cgggcggcca agaccatcaa cttcggggtc ctctacggca tgtcggccca ccgcctctcc 
caggagctag ccatccctta cgaggaggcc caggccttca ttgagcgcta ctttcagagc 

ttccccaagg tgcgggcctg gattgagaag accctggagg agggcaggag gcgggggtac 2X60 

gtggagaccc tcttcggccg ccgccgctac gtgccagacc tagaggcccg ggtgaagagc 2220 

gtgcgggagg cggccgagcg catggccttc aacatgcccg tccagggcac cgccgccgac 2280 

ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 2340 

ctccttcagg tcgccaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 2400 

gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 2460 

gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 2517 

<210> 122 
<211> 2517 
<212> DNA 
<213> Axtificial 

<220> 

<223> Synthetic 
<400> 122 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 

caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 120 

ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 

gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 240 

gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 300 

aaggagctgg tggacctcct ggggttcacg cgcctcgagg tcccgggcta cgaggcggac 360 

gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 420 

accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 480 

gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 540 

gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caatggcatc 600 

ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 660 

aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 720 

aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 780 

aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 840 
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agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtcc 


accgggcccc 


cgagccttat 


1020 


aaagccctca 


gggacctgaa 


ggaggcgcgg 


gggcttctcg 


ccaaagacct 


gagcgttctg 


1080 


gccctgaggg 


aaggccttgg 


cctcccgccc 


ggcgacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcccggc 


gctacggcgg 


ggagtggacg 


1200 


gaggaggcgg 


gggagcgggc 


cgccctttcc 


gagaggctct 


tcgccaacct 


gcttaagagg 


1260 


cttgaggggg 


aggagaggct 


cctttggctt 


taccgggagg 


tggagaggcc 


cctttccgct 


1320 


gtcctggccc 


atatggaggc 


cacgggggtg 


cgcctggacg 


tggcctatct 


cagggccttg 


1380 


tccctggagg 


tggccgagga 


gatcgcccgc 


ctcgaggccg 


aggtcttccg 


cctggccggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tcctctttga 


cgagctcagg 


1500 


cttcccaagt 


tgaagaagac 


gaagaagacc 


ggtaagcgct 


ccagcagcgc 


cgccgtcctg 


1560 


gaggccctcc 


gcgaggccca 


ccccatcgtg 


gagaagatcc 


tgcagtaccg 


ggagctcacc 


1620 


aagctgaaga 


gcacctacat 


tgaccccttg 


ccggacctca 


tccaccccag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggca 


ggctaagtag 


ctccgatccc 


1740 


aacctccaga 


acatccccgt 


ccgcaccccg 


cttgggcaga 


ggatccgccg 


ggccttcatc 


1800 


gccgaggagg 


ggtggctatt 


ggtggccctg 


gactatagcc 


agatagagct 


cagggtgctg 


1860 


gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 


acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agaccatcaa 


cttcggggtc 


ctctacggca 


tgtcggccca 


ccgcctctcc 


2040 


caggagctag 


ccatccctta 


cgaggaggcc 


caggccttca 


ttgagcgcta 


ctttcagagc 


2100 


ttccccaagg 


tgcgggcctg 


gattgagaag 


accctggagg 


agggcaggag 


gcgggggtac 


2160 


gtggagaccc 


tcttcggccg 


ccgccgctac 


gtgccagacc 


tagaggcccg 


ggtgaagagc 


2220 


QtQCaqqsiqQ 


ccrcrccaacrcq 


catggccttc 


aacatgcccg 


tccagggcac 


cgccgccgac 


2280 


ctcatgaagc 


tggctatggt 


gaagctcttc 


cccaggctgg 


aggaaatggg 


ggccaggatg 


2340 


ctccttcagg 


tcgccaacga 


gctggtcctc 


gaggccccaa 


aagagagggc 


ggaggccgtg 


2400 


gcccggctgg 


ccaaggaggt 


catggagggg 


gtgtatcccc 


tggccgtgcc 


cctggaggtg 


2460 


gaggtgggga 


taggggagga 


ctggctctcc 


gccaaggagc 


accaccacca 


ccaccac 


2517 
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<210> 123 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 123 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 
caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 
ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 
gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 
gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 
aaggagctgg tggacctcct ggggttcacg cgcctcgagg tcccgggcta cgaggcggac 
gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 
accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 
gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 
gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 
ggggagaaga cgcagaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 
aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 
aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 780 
aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 
agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 
cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 960 
gatcttctgg ccctggccgc cgccaggggc ggccgcgtcc accgggcccc cgagccttat 1020 
aaagccctca gggacctgaa ggaggcgcgg gggcttctcg ccaaagacct gagcgttctg 1080 
gccctgaggg aaggccttgg cctcccgccc ggcgacgacc ccatgctcct cgcctacctc 1140 
ctggaccctt cgaacaccac ccccgagggg gtggcccggc gctacggcgg ggagtggacg 1200 
gaggaggcgg gggagcgggc cgccctttcc gagaggctct tcgccaacct gcttaagagg 
cttgaggggg aggagaggct cctttggctt taccgggagg tggagaggcc cctttccgct 
gtcctggccc atatggaggc cacgggggtg cgcctggacg tggcctatct cagggccttg 1380 
tccctggagg tggccgagga gatcgcccgc ctcgaggccg aggtcttccg cctggccggc 1440 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



840 

900 



1260 

1320 
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caccccttca acctcaactc 


ccgggaccag 


ctggaaaggg 


tcctctttga 


cgagctcagg 


1500 


cttcccaagt tgaagaagac 


gaagaagacc 


ggtaagcgct 


ccagcagcgc 


cgccgtcctg 


1560 


gaggccctcc gcgaggccca 


ccccatcgtg 


gagaagatcc 


tgcagtaccg 


ggagctcacc 


1620 


aagctgaaga gcacctacat 


tgaccccttg 


ccggacctca 


tccaccccag 


gacgggccgc 


1680 


ctccacaccc gcttcaacca 


gacggccacg 


gccacgggca 


ggctaagtag 


ctccgatccc 


1740 


aacctccaga acatccccgt 


ccgcaccccg 


cttgggcaga 


ggatccgccg 


ggccttcatc 


1800 


gccgaggagg ggtggctatt 


ggtggccctg 


gactatagcc 


agatagagct 


cagggtgctg 


1860 


gcccacctct ccggcgacga 


gaacctgatc 


cgggtcttccf 


aggaggggcg 


ggacatccac 


1920 


acggagaccg ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca agaccatcaa 


cttcggggtc 


ctctacggca 


tgtcggccca 


ccgcctctcc 


2040 


caggagctag ccatccctta 


cgaggaggcc 


caggccttca 


ttgagcgcta 


ctttcagagc 


2100 


ttccccaagg tgcgggcctg 


gattgagaag 


accctggagg 


agggcaggag 


gcgggggtac 


2160 


gtggagaccc tcttcggccg 


ccgccgctac 


gtgccagacc 


tagaggcccg 


ggtgaagagc 


2220 


gtgcgggagg cggccgagcg 


catggccttc 


aacatgcccg 


tccagggcac 


cgccgccgac 


2280 


ctcatgaagc tggctatggt 


gaagctcttc 


cccaggctgg 


aggaaatggg 


ggccaggatg 


2340 


ctccttcagg tcgccaacga 


gctggtcctc 


gaggccccaa 


aagagagggc 


ggaggccgtg 


2400 


gcccggctgg ccaaggaggt 


catggagggg 


gtgtatcccc 


tggccgtgcc 


cctggaggtg 


2460 


gaggtgggga taggggagga 


ctggctctcc 


gccaaggagc 


accaccacca 


ccaccac 


2517 



<210> 124 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 124 



atgaattcgg ggatgctgcc 


cctctttgag 


cccaagggcc 


gggtcctcct 


ggtggacggc 


60 


caccacctgg cctaccgcac 


cttccacgcc 


ctgaagggcc 


tcaccaccag 


ccggggggag 


120 


ccggtgcagg cggtctacgg 


cttcgccaag 


agcctcctca 


aggccctcaa 


ggaggacggg 


180 


gacgcggtga tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga 


ggcctacggg 


240 


gggtacaagg cgggccgggc 


ccccacgccg 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


aaggagctgg tggacctcct 


ggggttcacg 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 
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540 
600 
660 
720 
780 
840 
900 
9^0 



gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 420 . 

accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 480 
gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 
gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatg 
ggggagaaga cggggaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 
aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 
aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 
aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 
agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 
cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 

gatcttctgg ccctggccgc cgccaggggc ggccgcgtcc accgggcccc cgagccttat 102 0 

aaagccctca gggacctgaa ggaggcgcgg gggcttctcg ccaaagacct gagcgttctg 1080 

gccctgaggg aaggccttgg cctcccgccc ggcgacgacc ccatgctcct cgcctacctc 1140 

ctggaccctt cgaacaccac ccccgagggg gtggcccggc gctacggcgg ggagtggacg 1200 

gaggaggcgg gggagcgggc cgccctttcc gagaggctct tcgccaacct gcttaagagg 1260 

cttgaggggg aggagaggct cctttggctt taccgggagg tggagaggcc cctttccgct 1320 

gtcctggccc atatggaggc cacgggggtg cgcctggacg tggcctatct cagggccttg 1380 

tccctggagg tggccgagga gatcgcccgc ctcgaggccg aggtcttccg cctggccggc 1440 

caccccttca acctcaactc ccgggaccag ctggaaaggg tcctctttga cgagctcagg 1500 

cttcccaagt tgaagaagac gaagaagacc ggtaagcgct ccagcagcgc cgccgtcctg 1560 

gaggccctcc gcgaggccca ccccatcgtg gagaagatcc tgcagtaccg ggagctcacc 1620 

aagctgaaga gcacctacat tgaccccttg ccggacctca tc.caccccag gacgggccgc 1680 

ctccacaccc gcttcaacca gacggccacg gccacgggca ggctaagtag ctccgatccc 1740 
aacctccaga acatccccgt ccgcaccccg cttgggcaga ggatccgccg ggccttcatc 1800 
gccgaggagg ggtggctatt ggtggccctg gactatagcc agatagagct cagggtgctg 1860 
gcccacctct ccggcgacga gaacctgatc cgggtcttcc aggaggggcg ggacatccac 1920 
acggagaccg ccagctggat gttcggcgtc ccccgggagg ccgtggaccc cctgatgcgc 1980 
cgggcggcca agaccatcaa cttcggggtc ctctacggca tgtcggccca ccgcctctcc 2040 
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caggagctag ccatccctta 


cgaggaggcc 


ttccccaagg tgcgggcctg 


gattgagaag 


gtggagaccc tcttcggccg 


ccgccgctac 


gtgcgggagg 


cggccgagcg 


catggccttc 


ctcatgaagc tggctatggt 


gaagctcttc 


ctccttcagg tcgccaacga 


gctggtcctc 


gcccggctgg ccaaggaggt 


catggagggg 


gaggtgggga taggggagga 


ctggctctcc 


<210> 125 
<211> 2517 
<212> DNA 
<213> Artificial 




<220> 

<223> Synthetic 




<400> 125 
atgaattcgg 


ggatgctgcc 


cctctttgag 


caccacctgg 


cctaccgcac 


cttccacgcc 


ccggtgcagg 


cggtctacgg 


cttcgccaag 


gacgcggtga 


tcgtggtctt 


tgacgccaag 


gggtacaagg 


cgggccgggc 


ccccacgccg 


aaggagctgg 


tggacctcct 


ggggttcacg 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


accgccgaca 


aagaccttta 


ccagctcctt 


gggtacctca 


tcaccccggc 


ctggctttgg 


gccgactacc 


gggccctgac 


cggsgacgag 


ggggagaata 


cggcgaggaa 


gcttctggag 


aacctggacc 


ggctgaagcc 




aagctctcct 


gggacctggc 


caaggtgcgc 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agcctcctcc 


acgagttcgg 


ccttctggaa 


cccccgccgg 


aaggggcctt 


cgtgggcttt 
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caggccttca 


ttgagcgcta 


ctttcagagc 


2100 


accctggagg 


agggcaggag 


gcgggggtac 


2160 


gtgccagacc 


tagaggcccg 


ggtgaagagc 


2220 


aacatgcccg 


tccagggcac 


cgccgccgac 


2280 


cccaggctgg 


aggaaatggg 


ggccaggatg 


2340 


gaggccccaa 


aagagagggc 


ggaggccgtg 


2400 


gtgtatcccc 


tggccgtgcc 


cctggaggtg 


2460 


gccaaggagc 


accaccacca 


ccaccac 


2517 



cccaagggcc 


gggtcctcct 


ggtggacggc 


60 


ctgaagggcc 


tcaccaccag 


ccggggggag 


120 


agcctcctca 


aggccctcaa 


ggaggacggg 


180 


gccccctcct 


tccgccacga 


ggcctacggg 


240 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 
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gatcttctgg ccctggccgc cgccaggggc ggccgcgtcc accgggcccc cgagccttat 1020 

aaagccctca gggacctgaa ggaggcgcgg gggcttctcg ccaaagacct gagcgttctg 1080 

gccctgaggg aaggccttgg cctcccgccc ggcgacgacc ccatgctcct cgcctacctc 1140 

ctggaccctt cgaacaccac ccccgagggg gtggcccggc gctacggcgg ggagtggacg 1200 

gaggaggcgg gggagcgggc cgccctttcc gagaggctct tcgccaacct gcttaagagg 1260 

cttgaggggg aggagaggct cctttggctt taccgggagg tggagaggcc cctttccgct 1320 

gtcctggccc atatggaggc cacgggggtg cgcctggacg tggcctatct cagggccttg 1380 

tccctggagg tggccgagga gatcgcccgc ctcgaggccg aggtcttccg cctggccggc 1440 

caccccttca acctcaactc ccgggaccag ctggaaaggg tcctctttga cgagctcagg 1500 

cttcccaagt tgaagaagac gaagaagacc ggtaagcgct ccagcagcgc cgccgtcctg 1560 

gaggccctcc gcgaggccca ccccatcgtg gagaagatcc tgcagtaccg ggagctcacc 1620 

aagctgaaga gcacctacat tgaccccttg ccggacctca tccaccccag gacgggccgc 1680 

ctccacaccc gcttcaacca gacggccacg gccacgggca ggctaagtag ctccgatccc 1740 

aacctccaga acatccccgt ccgcaccccg cttgggcaga ggatccgccg ggccttcatc 1800 

gccgaggagg ggtggctatt ggtggccctg gactatagcc agatagagct cagggtgctg 1860 

gcccacctct ccggcgacga gaacctgatc cgggtcttcc aggaggggcg ggacatccac 1920 

acggagaccg ccagctggat gttcggcgtc ccccgggagg ccgtggaccc cctgatgcgc 1980 

cgggcggcca agaccatcaa cttcggggtc ctctacggca tgtcggccca ccgcctctcc 2040 

caggagctag ccatccctta cgaggaggcc. caggccttca ttgagcgcta ctttcagagc 2100 

ttccccaagg tgcgggcctg gattgagaag accctggagg agggcaggag gcgggggtac 2160 

gtggagaccc tcttcggccg ccgccgctac gtgccagacc tagaggcccg ggtgaagagc 2220 

gtgcgggagg cggccgagcg catggccttc aacatgcccg tccagggcac cgccgccgac 2280 

ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 2340 

ctccttcagg tcgccaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 2400 

gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 2460 

gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 2517 



<210> 126 
<211> 2517 
<212> DNA 
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<213> Artificial 
<220> 

<223> Synthetic 

<400> 126 



atgaattcgg 


ggatgctgcc 


cctctttgag 


cccaagggcc 


gggtcctcct 


ggtggacggc 


60 


caccacctgg 


cctaccgcac 


cttccacgcc 


ctgaagggcc 


tcaccaccag 


ccggggggag 


120 


ccggtgcagg 


cggtctacgg 


cttcgccaag 


agcctcctca 


aggccctcaa 


ggaggacggg 


180 


gacgcggtga 


tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga 


ggcctacggg 


240 


gggtacaagg 


cgggccgggc 


ccccacgccg 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggttcacg 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


ggggagaagc 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtcc 


accgggcccc 


cgagccttat 


1020 


aaagccctca 


gggacctgaa 


ggaggcgcgg 


gggcttctcg 


ccaaagacct 


gagcgttctg 


1080 




a.a.ggccfctgg 


WW t.wwwgu>^W 


ggcgacga c c 


^waugwCWwU 


wgowL.GlwwUw 




ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcccggc 


gctacggcgg 


ggagtggacg 


1200 


gaggaggcgg 


gggagcgggc 


cgccctttcc 


gagaggctct 


tcgccaacct 


gcttaagagg 


1260 


cttgaggggg 


aggagaggct 


cctttggctt 


taccgggagg 


tggagaggcc 


cctttccgct 


1320 


gtcctggccc 


atatggaggc 


cacgggggtg 


cgcctggacg 


tggcctatct 


cagggccttg 


1380 


tccctggagg 


tggccgagga 


gatcgcccgc 


ctcgaggccg 


aggtcttccg 


cctggccggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tcctctttga 


cgagctcagg 


1500 


cttcccaagt 


tgaagaagac 


gaagaagacc 


ggtaagcgct 


ccagcagcgc 


cgccgtcctg 


1560 
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gaggccctcc gcgaggccca ccccatcgtg gagaagatcc tgcagtaccg ggagctcacc 1620 

aagctgaaga gcacctacat tgaccccttg ccggacctca tccaccccag gacgggccgc 1680 

ctccacaccc gcttcaacca gacggccacg gccacgggca ggctaagtag ctccgatccc 1740 

aacctccaga acatccccgt ccgcaccccg cttgggcaga ggatccgccg ggccttcatc 1800 

gccgaggagg ggtggctatt ggtggccctg gactatagcc agatagagct cagggtgctg 1860 

gcccacctct ccggcgacga gaacctgatc cgggtcttcc aggaggggcg ggacatccac 1920 

acggagaccg ccagctggat gttcggcgtc ccccgggagg ccgtggaccc cctgatgcgc 1980 

cgggcggcca agaccatcaa cttcggggtc ctctacggca tgtcggccca ccgcctctcc 2040 

caggagctag ccatccctta cgaggaggcc caggccttca ttgagcgcta ctttcagagc 2100 

ttccccaagg tgcgggcctg gattgagaag accctggagg agggcaggag gcgggggtac 2160 

gtggagaccc tcttcggccg ccgccgctac gtgccagacc tagaggcccg ggtgaagagc 2220 

gtgcgggagg cggccgagcg catggccttc ■ aacatgcccg tccagggcac cgccgccgac 2280 

ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 2340 

ctccttcagg tcgccaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 2400 

gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 2460 

gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 2517 

<210> 127 
<211> 2517 
<212> DNA 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 127 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 

caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 120 

ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 

gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 240 

gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 300 

aaggagctgg tggacctcct ggggttcacg cgcctcgagg tcccgggcta cgaggcggac 360 

gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 420 
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accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtcc 


accgggcccc 


cgagccttat 


1020 


aaagccctca 


gggacctgaa 


ggaggcgcgg 


gggcttctcg 


ccaaagacct 


gagcgttctg 


1080 


gccctgaggg 


aaggccttgg 


cctcccgccc 


ggcgacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcccggc 


gctacggcgg 


ggagtggacg 


1200 


gaggaggcgg 


gggagcgggc 


cgccctttcc 


gagaggctct 


tcgccaacct 


gcttaagagg 


1260 


cttgaggggg 


aggagaggct 


cctttggctt 


taccgggagg 


tggagaggcc 


cctttccgct 


1320 


gtcctggccc 


atatggaggc 


cacgggggtg 


cgcctggacg 


tggcctatct 


cagggccttg 


1380 


tccctggagg 


tggccgagga 


gatcgcccgc 


ctcgaggccg 


aggtcttccg 


cctggccggc " 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tcctctttga 


cgagctcagg 


1500 


cttcccaagt 


tgaagaagac 


gaagaagacc 


ggtaagcgct 


ccagcagcgc 


cgccgtcctg 


1560 


gaggccctcc 


gcgaggccca 


ccccatcgtg 


gagaagatcc 


tgcagtaccg 


ggagctcacc 


1620 


aagctgaaga 


gcacctacat 


tgaccccttg 


ccggacctca 


tccaccccag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggca 


ggctaagtag 


ctccgatccc 


1740 


aacctccaga 


acatccccgt 


ccgcaccccg 


cttgggcaga 


ggatccgccg 


ggccttcatc 


1800 


gccgaggagg 


ggtggctatt 


ggtggccctg 


gactatagcc 


agatagagct 


cagggtgctg 


1860 


gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc 


aggaggggcg 


ggacatccac 


1920 


acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agaccatcaa 


cttcggggtc 


ctctacggca 


tgtcggccca 


cgccctctcc 


2040 


caggagctag 


ccatccctta 


cgaggaggcc 


caggccttca 


ttgagcgcta 


ctttcagagc 


2100 


ttccccaagg 


tgcgggcctg 


gattgagaag 


accctggagg 


agggcaggag 


gcgggggtac 


2160 
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gtggagaccc tcttcggccg ccgccgctac gtgccagacc tagaggcccg ggtgaagagc 2220 

gtgcgggagg cggccgagcg catggccttc aacatgcccg tccagggcac cgccgccgac 2280 

ctcatgaagc tggctatggt gaagctcttc cccaggctgg aggaaatggg ggccaggatg 2340 

ctccttcagg tcgccaacga gctggtcctc gaggccccaa aagagagggc ggaggccgtg 2400 

gcccggctgg ccaaggaggt catggagggg gtgtatcccc tggccgtgcc cctggaggtg 2460 

gaggtgggga taggggagga ctggctctcc gccaaggagc accaccacca ccaccac 25X7 

<210> 128 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 128 ^. 
atgaattcgg ggatgctgcc octet ttgag cccaagggcc gggtcctcct ggtggacggc 60 

caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 120 

ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 

gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 240 

gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 300 

aaggagctgg tggacctcct ggggctggcg cgcctcgagg tcccgggcta cgaggcggac 360 

gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 420 

accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 

gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 

gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 600 

ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 

aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 

aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 

aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 

agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 

cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 

gatcttctgg ccctggccgc cgccaggggc ggccgcgtgc accgggcagc agaccccttg 1020 



480 
540 



660 
720 
780 
840 

900 

960 
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gcggggctaa 


aggacctcaa 


ggaggtccgg ggcctcctcg 


ccaaggacct 


cgccgtcttg 


1080 


gcctcgaggg 


aggggctaga 


cctcgtgccc ggggacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg gtggcgcggc 


gctacggggg 


ggagtggacg 


1200 


gaggacgccg 


cccaccgggc 


cctcctctcg gagaggctcc 


atcggaacct 


ccttaagcgc 


1260 


ctcgaggggg 


aggagaagct 


cctttggctc taccacgagg 


tggaaaagcc 


cctctcccgg 


1320 


gtcctggccc 


atatggaggc 


caccggggta cggcgggacg 


tggcctacct 


tcaggccctt 


1380 


tccctggagc 


ttgcggagga 


gatccgccgc ctcgaggagg 


aggtcttccg 


cttggcgggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag ctggaaaggg 


tgctctttga 


cgagcttagg 


1500 


cttcccgcct 


tgaagaagac 


gaagaagaca ggcaagcgct 


ccaccagcgc 


cgcggtgctg 


1560 


gaggccctac 


gggaggccca 


ccccatcgtg gagaagatcc 


tccagcaccg 


ggagctcacc 


1620 


aagctcaaga 


acacctacgt 


ggaccccctc ccaagcctcg 


tccacccgag 


gacgggccgc 


1660 


ctccacaccc 


gcttcaacca 


gacggccacg gccacgggga 


ggcttagtag 


ctccgacccc 


1740 


aacctgcaga 


acatccccgt 


ccgcaccccc ttgggccaga 


ggatccgccg 


ggccttcgtg 


1800 


gccgaggcgg 


gttgggcgtt 


ggtggccctg gactatagcc 


agatagagct 


ccgcgtcctc 


1860 


gcccacctct 


ccggggacga 


aaacctgatc agggtcttcc 


aggaggggaa 


ggacatccac 


1920 


acccagaccg 


caagctggat 


gttcggcgtc cccccggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agacggtgaa 


cttcggcgtc ctctacggca 


tgtccgccca 


taggctctcc 


2040 


caggagcttg 


ccatccccta 


cgaggaggcg gtggccttta 


tagagcgcta 


cttccaaagc 


2100 


ttccccaagg 


tgcgggcctg 


gatagaaaag accctggagg 


aggggaggaa 


gcggggctac 


2160 


gtggaaaccc 


tcttcggaag 


aaggcgctac gtgcccgacc 


tcaacgcccg 


ggtgaagagc 


2220 


QtcaQQQaQQ 


ccQCQCiaoccr 


catioacctkG s^s.ca.\icsccca 

>-* w *3 ^ ^ w CtCi w Vj WWW J 




w y w v**5 \* w y ci w 


2280 


ctcatgaagc 


tcgccatggt 


gaagctcttc ccccgcctcc 


gggagatggg 


ggcccgcatg 


2340 


ctcctccagg 


tcgccaacga 


gctcctcctg gaggcccccc 


aagcgcgggc 


cgaggaggtg 


2400 


gcggctttgg 


ccaaggaggc 


catggagaag gcctatcccc 


tcgccgtgcc 


cctggaggtg 


2460 


gaggtgggga 


tgggggagga 


ctggctttcc gccaagggtc 


accaccacca 


ccaccac 


2517 



<210> 129 

<211> 2526 

<212> DNA 

<213> Artificial 

<220> 
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<223> Synthetic 

<400> 129 

atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 60 

ggccaccacc tggcctaccg caccttcttc gccctgaagg gcctcaccac gagccggggc 120 

gaaccggtgc aggcggtcta cggcttcgcc aagagcctcc tcaaggccct gaaggaggac 180 

gggtacaagg ccgtcttcgt ggtctttgac gccaaggccc cctccttccg ccacgaggcc 240 

tacgaggcct acaaggcggg gagggccccg acccccgagg acttcccccg gcagctcgcc 300 

ctcatcaagg agctggtgga cctcctgggg tttacccgcc tcgaggtccc cggctacgag 360 

gcggacgacg ttctcgccac cctggccaag aaggcggaaa aggaggggta cgaggtgcgc 420 

atcctcaccg ccgaccgcga cctctaccaa ctcgtctccg accgcgtcgc cgtcctccac 480 

cccgagggcc acctcatcac cccggagtgg ctttgggaga agtacggcct caggccggag 540 

cagtgggtgg acttccgcgc cctcgtgggg gacccctccg acaacctccc cggggtcaag 600 

ggcatcgggg agaagaccgc cctcaagctc ctcaaggagt ggggaagcct ggaaaacctc 660 

ctcaagaacc tggaccgggt aaagccagaa aacgtccggg agaagatcaa ggcccacctg 720 

gaagacctca ggctctcctt ggagctctcc cgggtgcgca ccgacctccc cctggaggtg 780 

gacctcgccc aggggcggga gcccgaccgg gaggggctta gggccttcct ggagaggctg 840 

gagttcggca gcctcctcca cgagttcggc ctcctggagg cccccgcccc cctggaggag 900 

gccccctggc ccccgccgga aggggccttc gtgggcttcg tcctctcccg ccccgagccc 960 

atgtgggcgg agcttaaagc cctggccgcc tgcaggggcg gccgcgtcca ccgggccccc 1020 

gagccttata aagccctcag ggacctgaag gaggcgcggg ggcttctcgc caaagacctg 1080 

agcgttctgg ccctgaggga aggccttggc ctcccgcccg gcgacgaccc catgctcctc 1140 

gcctacctcc tggacccttc gaacaccacc cccgaggggg tggcccggcg ctacggcggg 1200 

gagtggacgg aggaggcggg ggagcgggcc gccctttccg agaggctctt cgccaacctg 1260 

cttaagaggc ttgaggggga ggagaggctc ctttggcttt accgggaggt ggagaggccc 1320 

ctttccgctg tcctggccca tatggaggcc acgggggtgc gcctggacgt ggcctatctc 1380 

agggccttgt ccctggaggt ggccgaggag atcgcccgcc tcgaggccga ggtcttccgc 1440 

ctggccggcc accccttcaa cctcaactcc cgggaccagc tggaaagggt cctctttgac 1500 

gagctagggc ttcccgccat caagaagacg caaaagaccg gcaagcgctc caccagcgcc 1560 

gccgtcctgg aggccctccg cgaggcccac cccatcgtgg agaagatcct gcagtaccgg 1620 
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gagctcacca 


agctgaagag 


cacctacatt 


gaccccttgc 


cggacctcat. 


ccaccccagg 


1680 


acgggccgcc 


tccacacccg 


cttcaaccag 


acggccacgg 


ccacgggcag 


gctaagtagc 


1740 


tccgatccca 


acctccagaa 


catccccgtc 


cgcaccccgc 


ttgggcagag 


gatccgccgg 


1800 


gccttcatcg 


ccgaggaggg 


gtggctattg 


gtggccctgg 


actatagcca 


gatagagctc 


1860 


agggtgctgg 


cccacctctc 


cggcgacgag 


aacctgatcc 


gggtcttcca 


ggaggggcgg 


1920 


gacatccaca 


cggagaccgc 


cagctggatg 


ttcggcgtcc 


cccgggaggc 


cgtggacccc 


1980 


ctgatgcgcc 


gggcggccaa 


gaccatcaac 


ttcggggtcc 


tctacggcat 


gtcggcccac 


2040 


cgcctctccc 


aggagctagc 


catcccttac 


gaggaggccc 


aggccttcat 


tgagcgctac 


2100 


tttcagagct 


tccccaaggt 


gcgggcctgg 


attgagaaga 


ccctggagga 


gggcaggagg 


2160 


cgggggtacg 


tggagaccct 


cttcggccgc 


cgccgctacg 


tgccagacct 


agaggcccgg 


2220 


gtgaagagcg 


tgcgggaggc 


ggccgagcgc 


atggccttca 


acatgcccgt 


ccagggcacc 


2280 


gccgccgacc 


tcatgaagct 


ggctatggtg 


aagctcttcc 


ccaggctgga 


ggaaatgggg 


2340 


gccaggatgc 


tccttcaggt 


cgccaacgag 


ctggtcctcg 


aggccccaaa 


agagagggcg 


2400 


gaggccgtgg 


cccggctggc 


caaggaggtc 


atggaggggg 


tgtatcccct 


ggccgtgccc 


2460 


ctggaggtgg 


aggtggggat 


aggggaggac 


tggctctccg 


ccaaggagca 


ccaccaccac 


2520 


caccac 












252$ 


<210> 130 
<211> 2508 
<212> DNA 
<213> Artificial 












<220> 

<223> Synthetic 












<400> 130 

atggaattca ccccactttt 


tgacctggag 


gaacccccca 


agcgggtgct 


tctggtggac 


60 


ggccaccacc 


tggcctaccg 


caccttctat 


gccctgagcc 


tcaccacctc 


ccggggggag 


120 


ccggtgcaga 


tggtctacgg 


cttcgcccgg 


agcctcctca 


aggccttgaa 


ggaggacgga 


180 


caggcggtgg 


tcgtggtctt 




^ C U ^ U L> U i« L- 






240 


gcctacaagg 


cgggccgggc 


ccccaccccg 


gaggacttcc 


cccgccagct 


cgccttggtc 


300 


aagcggctgg 


tggaccttct 


gggcctggtc 


cgcctcgagg 


ccccggggta 


cgaggcggac 


360 


gacgtcctgg gcaccctggc 


caagaaggcc 


gaaagggagg 


ggatggaggt 


gcgcatcctc 


420 


acgggagacc 


gggacttctt 


ccagctcctc 


tccgagaagg 


tctcggtcct 


cctgccggac 


480 
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540 
600 
660 
720 
780 
840 
900 
960 



1080 

1140 



gggaccctgg tcaccccaaa ggacgtccag gagaagtacg gggtgccccc ggagcgctgg 
gtggacttcc gcgccctcac gggggaccgc tcggacaaca tccccggggt ggcggggata 
ggggagaaga ccgcccttcg actcctcgca gagtggggga gcgtggaaaa cctcctgaag 
aacctggacc gggtaaagcc ggactcgctc cggcgcaaga tagaggcgca cctcgaggac 
ctccacctct ccttagacct ggcccgcatc cgcaccgacc tccccctgga ggtggacttt 
aaggccctgc gccgcaggac ccccgacctg gagggcctga gggccttttt ggaggagctg 
gagttcggaa gcctcctcca cgagttcggc ctcctgggag gggagaagcc ccgggaggag 
gccccctggc ccccgcccga aggggccttc gtgggcttcc tcctttcccg caaggagccc 
atgtgggcgg agcttctggc cctggcggcg gcctcggagg gccgggtcca ccgggcaaca 1020 
agcccggttg aggccctggc cgacctcaag gaggcccggg ggttcctggc caaggacctg 
gccgttttgg ccctgcggga gggggtggcc ctggacccca cggacgaccc cctcctggtg 
gcctacctcc tggacccggc caacacccac cccgaggggg tggcccggcg ctacgggggc 1200 
gagttcacgg aggacgcagc ggagagggcc ctcctctccg agaggctctt ccagaacctc 1260 
tttaaacggc tttccgagaa gctcctctgg ctctaccagg aggtggagcg gcccctcttc 1320 
cgggtcttgg cccacatgga ggcccggggg gtgaggctgg acgtccccct tctggaggcc 1380 
ctctcctttg agctggagaa ggagatggag cgcctggagg gggaggtctt ccgtttggcc 1440 
ggccacccct tcaacctcaa ctcccgcgac cagctggaaa gggtcctctt tgacgagctg 1500 
ggcctcaccc cggtgggccg gacgcagaag acgggcaagc gctccaccgc ccagggggcc 1560 
ctggaggccc tccggggggc ccaccccatc gtggagctca tcctccagta ccgggagctt 1620 
tccaagctca aaagcaccta cctggacccc ctgccccggc tcgtccaccc gcggacgggc 1680 
cggctccaca cccgcttcaa ccagacggcc acggccacgg gaaggctttc cagctccgac 1740 
cccaacctgc agaacatccc cgtgcgcacc cccttggggc agcgcatccg caaggccttc 1800 
gtggccgagg aggggtggct ccttttggcg gcggactact cccagattga gctccgggtc 1860 
ctggcccacc tctcggggga cgagaacctg aagcgggtct tccgggaggg gaaggacatc 1920 
cataccgaga ccgccgcctg gatgttcggc ttagaccccg ctctggtgga tccaaagatg 1980 
cgccgggcgg ccaagacggt caacttcggc gtcctctacg ggatgtccgc ccacaggctc 
tcccaggagc tcggcataga ctacaaggag gcggaggcct ttattgagcg ctacttccag 
agcttcccca aggtgcgggc ctggatagaa aggaccctgg aggagggccg gacgcggggc 2160 



2040 
2100 
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tacgtggaga ccctgttcgg 


caggaggcgc 


tatgtgcccg 


acctggcctc ccgggtccgc 


2220 


tcggtgcggg aggcggcgga 


gcggatggcc 


ttcaacatgc 


ccgtgcaggg caccgccgcc 


2280 


gacctgatga agatcgccat 


ggtcaagctc 


ttccccaggc 


taaagcccct gggggcccac 


2340 


ctcctcctcc aagtgcacaa 


cgagctggtc 


ctggaggtgc 


ccgaggaccg ggccgaggag 


2400 


gccaaggccc tggtcaagga. 


ggtcatggag 


aacgcctacc 


ccctggacgt gcccctcgag 


2460 


gtggaggtgg gcgtgggtcg 


ggactggctg 


gaggcgaagc 


aggattga 


2508 



<210> 131 

<211> 2499 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 131 



atggaattcc 


tgcccctctt 


tgagcccaag 


ggccgggtgc 


ttctggtgga cggccaccac 


60 


ctggcctacc 


gtaccttttt 


tgccctgaag 


ggcctcacca 


ccagccgcgg ggagccggtc 


120 


ca.Qacaat.Qt. 










180 


gtgatcgtgg 


tgtttgacgc 


caaggccccc 


tccttccgcc 


accagaccta cgaggcctac 


240 


aaggcggggc 


gggctcccac 


ccccgaggac 


tttccccggc 


agcttgccct tatcaaggag 


300 


atggtggacc 


ttttgggcct 


ggagcgcctc 


gaggtgccgg 


gctttgaagc ggatgacgtc 


360 


ctggctaccc 


tggccaagaa 


ggcggaaaag 


gaaggctacg 


aagtgcgcat cctcaccgcg 


420 


gaccgggacc 


tttaccagct 


tctttcggag 


cgaatctcca 


tccttcaccc ggagggttac 


480 


ctgatcaccc 


cggagtggct 


ttgggagaag 


tatgggctta 


agccttccca gtgggtggac 


540 


taccgggcct 


tggccgggga 


cccttccgac 


aacatccccg 


gcgtgaaggg catcggggag 


600 


aagacggcgg 


ccaagctgat 


ccgggagtgg 


ggaagcctgg 


aaaaccttct taagcacctg 


660 


gaacaggtga 


aacctgcctc 


cgtgcgggag 


aagatcctta 


gccacatgga ggacctcaag 


720 


ctatccctgg 


agctatcccg 


ggtgcacacg 


gacttgctcc 


ttcaggtgga cttcgcccgg 


780 


cgccgggagc 


cggaccggga 


ggggcttaag 


gcctttttgg 


agaggctgga gttcggaagc 


840 


ctcctccacg 


agttcggcct 


gttggaaagc 


ccggtggcgg 


cggaggaagc tccctggccg 


900 


ccccccgagg 


gagccttcgt 


ggggtacgtt 


ctttcccgcc 


ccgagcccat gtgggcggag 


960 


cttaacgcct 


tggccgccgc 


ctgggaggga 


agggtttacc 


gggcggagga tcccttggag 


1020 


gccttgcggg 


ggcttgggga 


ggtgaggggg 


cttttggcca 


aggacctggc ggtgctggcc 


1080 
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ctgagggaag ggattgccct ggcaccgggc gacgacccca tgctcctcgc ctacctcctg 1140 

gatccttcca acaccgcccc cgaaggggta gcccggcgct acggggggga gtggaccgag 1200 

gaggcggggg aaagggcgct gctttccgaa aggctttacg ccgccctcct gaagcggctt 1260 

aagggggagg agaggcttct ttggctttac gaggaggtgg aaaagcccct ttcgcgggtc 1320 

ctggcccaca tggaggccac gggggtacgg ttggatgtgg cctacttaaa ggccctttcc 1380 

ctggaggtgg aggcggagat aaggcgcttc gaggaggagg tccaccgcct ggccgggcat 1440 

cctttcaacc tgaactcccg ggaccagctg gaaagggtca tctttgacga gcttgggctt 1500 

cccgccatcg gcaagacgca gaagacgggc aagcgctcca ccagcgccgc cgttttggag 1560 

gccttgcggg aggctcatcc catcgtggac cgcatccttc agtaccggga gctttccaag 1620 

ctcaagggaa cctacatcga tcccttgcct gccctggtcc accccaagac gaaccgcctc 1680 

cacacccgtt tcaaccagac ggccaccgcc acggggaggc ttagcagctc ggatcctaat 1740 

ctgcaaaata tccccgtgcg cacccctttg ggccagcgga tccgccgggc cttcgtggcc 1800 

gaggaggggt ggaggctggt ggttttggac tacagccaga ttgagctcag ggtcctggcg 1860 

cacctttccg gggacgagaa cctaatccgg gtcttccagg agggccagga catccacacc 1920 
cagacggcca gctggatgtt cggcgtgccc ccagaggccg tggattccct gatgcgccgg 1980 
gcggccaaga ccatcaactt cggcgtcctc tacggcatgt ccgcccaccg gctttcggga 2040 
gagctggcca tcccctacga ggaggcggtg gccttcatcg agcggtattt ccagagctac 2100 
cccaaggtgc gggcctggat tgagaaaacc ctggcggaag gacgggaacg gggctatgtg 2160 
gaaaccctct ttggccgccg gcgctacgtg cccgacttgg cttcccgggt gaagagcatc 2220 
cgggaggcag cggagcgcat ggccttcaac atgccggtcc aggggaccgc cgcggatttg 2280 
atgaaactgg ccatggtgaa gctctttccc aggcttcagg agctgggggc caggatgctt 2340 
ttgcaggtgc acaacgaact ggtcctcgag gctcccaagg agcaagcgga ggaagtcgcc 
caggaggcca agcggaccat ggaggaggtg tggcccctga aggtgccctt ggaggtggaa 
gtgggcatcg gggaggactg gctttccgcc aaggcctag 



<210> 132 

<211> 2526 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 



2400 
2460 
2499 
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<400> 132 
atgaattcca 


ccccactttt 


tgacctggag 


gaacccccca 


agcgggtgct 


tctggtggac 


60 


ggccaccacc 


tggcctaccg 


caccttctat 


gccctgagcc 


tcaccacctc 


ccggggggag 


120 


ccggtgcaga 


tggtctacgg 


cttcgcccgg 


agcctcctca 


aggccttgaa 


ggaggacgga 


180 


caggcggtgg 


tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga 


ggcctacgag 


240 


gcctacaagg 


cgggccgggc 


ccccaccccg 


gaggacttcc 


cccgccagct 


cgccttggtc 


300 


aagcggctgg 


tggaccttct 


gggcctggtc 


cgcctcgagg 


ccccggggta 


cgaggcggac 


360 


gacgtcctgg 


gcaccctggc 


caagaaggcc 


gaaagggagg 


ggatggaggt 


gcgcatcctc 


420 


^cgggagacc 


gggacttctt 


ccagctcctc 


tccgagaagg 


tctcggtcct 


cctgccggac 


480 


gggaccctgg 


tcaccccaaa 


ggacgtccag 


gagaagtacg 


gggtgccccc 


ggagcgctgg 


540 


gtggacttcc 


gcgccctcac 


gggggaccgc 


tcggacaaca 


tccccggggt 


ggcggggata 


600 


ggggagaaga 


ccgcccttcg 


actcctcgca 


gagtggggga 


gcgtggaaaa 


cctcctgaag 


660 


aacctggacc 


gggtaaagcc 


ggactcgctc 


cggcgcaaga 


tagaggcgca 


cctcgaggac 


720 


ctccacctct 


ccttagacct 


ggcccgcatc 


cgcaccgacc 


tccccctgga 


ggtggacttt 


780 


aaggccctgc 


gccgcaggac 


ccccgacctg 


gagggcctga 


gggccttttt 


ggaggagctg 


840 


gagttcggaa 


gcctcctcca 


cgagttcggc 


ctcctgggag 


gggagaagcc 


ccgggaggag 


900 


gccccctggc 


ccccgcccga 


aggggccttc 


gtgggcttcc 


tcctttcccg 


caaggagccc 


960 


atgtgggcgg 


agcttctggc 


cctggcggcg 


gcctcgggcg 


gccgcgtgca 


ccgggcagca 


1020 


gaccccttgg 


cggggctaaa 


ggacctcaag 


gaggtccggg 


gcctcctcgc 


caaggacctc 


1080 


gccgtcttgg 


cctcgaggga 


ggggctagac 


ctcgtgcccg 


gggacgaccc 


catgctcctc 


1140 


gcctacctcc 


tggacccttc 


gaacaccacc 


cccgaggggg 


tggcgcggcg 


ctacgggggg 


1200 


gagtggacgg 


aggacgccgc 


ccaccgggcc 


ctcctctcgg 


cigaggctcca 


tcggaacctc 


1260 


cttaagcgcc 


tcgaggggga 


ggagaagctc 


ctttggctct 


accacgaggt 


ggaaaagccc 


1320 


ctctcccggg 


tcctggccca 


tatggaggcc 


accggggtac 


ggcgggacgt 


ggcctacctt 


1380 


caggcccttt 


ccctggagct 


tgcggaggag 


atccgccgcc 


tcgaggagga 


ggtcttccgc 


1440 


ttggcgggcc 


accccttcaa 


cctcaactcc 


cgggaccagc 


tggaaagggt 


gctctttgac 


1500 


gagcttaggc 


ttcccgcctt 


gaagaagacg 


aagaagacag 


gcaagcgctc 


caccagcgcc 


1560 


gcggtgctgg 


aggccctacg 


ggaggcccac 


cccatcgtgg 


agaagatcct 


ccagcaccgg 


1620 


gagctcacca 


agctcaagaa 


cacctacgtg 


gaccccctcc 


caagcctcgt 


ccacccgagg 


1680 
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acgggccgcc tccacacccg cttcaaccag acggccacgg ccacggggag gcttagtagc 1740 

tccgacccca acctgcagaa catccccgtc cgcaccccct tgggccagag gatccgccgg 1800 

gccttcgtgg ccgaggcggg ttgggcgttg gtggccctgg actatagcca gatagagctc 1860 

cgcgtcctcg cccacctctc cggggacgaa aacctgatca gggtcttcca ggaggggaag 1920 

gacatccaca cccagaccgc aagctggatg ttcggcgtcc ccccggaggc cgtggacccc 1980 

ctgatgcgcc gggcggccaa gacggtgaac ttcggcgtcc tctacggcat gtccgcccat 2040 

aggctctccc aggagcttgc catcccctac gaggaggcgg tggcctttat agagcgctac 2100 

ttccaaagct tccccaaggt gcgggcctgg atagaaaaga' ccctggagga ggggaggaag 2160 

cggggctacg tggaaaccct cttcggaaga aggcgctacg tgcccgacct caacgcccgg 2220 

gtgaagagcg tcagggaggc cgcggagcgc atggccttca acatgcccgt ccagggcacc 2280 

gccgccgacc tcatgaagct cgccatggtg aagctcttcc cccgcctccg ggagatgggg 2340 

gcccgcatgc tcctccaggt cgccaacgag ctcctcctgg aggcccccca agcgcgggcc 2400 

gaggaggtgg cggctttggc caaggaggcc atggagaagg cctatcccct cgccgtgccc 2460 

ctggaggtgg aggtggggat gggggaggac tggctttccg ccaagggtca ccaccaccac 2520 
caccac 



60 



2526 



<210> 133 

<211> 2514 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 133 

atgaattccc tgcccctctt tgagcccaag ggccgggtgc ttctggtgga cggccaccac 
ctggcctacc gtaccttttt tgccctgaag ggcct caeca ccagccgcgg ggagccggtc 120 
caggcggtgt acgggtttgc caagagcctt ttgaaggcgc taagggaaga cggggatgtg 180 
gtgatcgtgg tgtttgacgc caaggccccc tccttccgcc accagaccta cgaggcctac 240 
aaggcggggc gggctcccac ccccgaggac tttccccggc agcttgccct tatcaaggag 300 
atggtggacc ttttgggcct ggagcgcctc gaggtgccgg gctttgaagc ggatgacgtc 3 60 
ctggctaccc tggccaagaa ggcggaaaag gaaggctacg aagtgcgcat cctcaccgcg 420 
gaccgggacc tttaccagct tctttcggag cgaatctcca tccttcaccc ggagggttac 480 
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ctgatcaccc 


cggagtggct 


ttgggagaag 


tatgggctta 


agccttccca 


gtgggtggac 


540 


taccgggcct 


tggccgggga 


cccttccgac 


aacatccccg 


gcgtgaaggg 


catcggggag 


600 


aagacggcgg 


ccaagctgat 


ccgggagtgg 


ggaagcctgg 


aaaaccttct 


taagcacctg 


660 


gaacaggtga 


aacctgcctc 


cgtgcgggag 


aagatcctta 


gccacatgga 


ggacctcaag 


720 


ctatccctgg 


agctatcccg 


ggtgcacacg 


gacttgctcc 


ttcaggtgga 


cttcgcccgg 


780 


cgccgggagc 


cggaccggga 


ggggcttaag 


gcctttttgg 


agaggctgga 


gttcggaagc 


840 


ctcctccacg 


agttcggcct 


gttggaaagc 


ccggtggcgg 


cggaggaagc 


tccctggccg 


900 


ccccccgagg 


gagccttcgt 


ggggtacgtt 


ctttcccgcc 


ccgagcccat 


gtgggcggag 


960 


cttaacgcct 


tggccgccgc 


ctggggcggc 


cgcgtgcacc 


gggcagcaga 


ccccttggcg 


1020 


gggctaaagg 


acctcaagga 


ggtccggggc 


ctcctcgcca 


aggacctcgc 


cgtcttggcc 


1080 


tcgagggagg 


ggctagacct 


cgtgcccggg 


gacgacccca 


tgctcctcgc 


ctacctcctg 


1140 


gacccttcga 


acaccacccc 


cgagggggtg 


gcgcggcgct 


acggggggga 


gtggacggag 


1200 


gacgccgccc 


accgggccct 


cctctcggag 


aggctccatc 


ggaacctcct 


taagcgcctc 


1260 


gagggggagg 


agaagatcct 


ttggctctac 


cacgaggtgg 


aaaagcccct 


ctcccgggtc 


1320 


ctggcccata 


tggaggccac 


cggggtacgg 


cgggacgtgg 


cctaccttca 


ggccctttcc 


1360 


ctggagcttg 


cggaggagat 


ccgccgcctc 


gaggaggagg 


tcttccgctt 


ggcgggccac 


1440 


cccttcaacc 


tcaactcccg 


ggaccagctg 


gaaagggtgc 


tctttgacga 


gcttaggctt 


1500 


cccgccttga 


agaagacgaa 


gaagacaggc 


aagcgctcca 


ccagcgccgc 


ggtgctggag 


1560 


gccctacggg 


aggcccaccc. 


catcgtggag 


aagatcctcc 


agcaccggga 


gctcaccaag 


1620 


ctcaagaaca 


cctacgtgga 


ccccctccca 


agcctcgtcc 


acccgaggac 


gggccgcctc 


1680 


cacacccgct 


tcaaccagac 


ggccacggcc 


acggggaggc 


ttagtagctc 


cgaccccaac 


1740 


ctgcagaaca 


tccccgtccg 


cacccccttg 


ggccagagga 


tccgccgggc 


cttcgtggcc 


1800 


gaggcgggtt 


gggcgttggt 


ggccctggac 


tatagccaga 


tagagctccg 


cgtcctcgcc 


1860 


cacctctccg 


gggacgaaaa 


cctgatcagg 


gtcttccagg 


aggggaagga 


catccacacc 


1920 








^ wy y oy y wy 


4* cma r^fr* r»r»t* 


y w^*y wyw^yy 


1980 


gcggccaaga 


cggtgaactt 


cggcgtcctc 


tacggcatgt 


ccgcccatag 


gctctcccag 


2040 


gagcttgcca 


tcccctacga 


ggaggcggtg 


gcctttatag 


agcgctactt 


ccaaagcttc 


2100 


cccaaggtgc 


gggcctggat 


agaaaagacc 


ctggaggagg 


ggaggaagcg 


gggctacgtg 


2160 


gaaaccctct 


tcggaagaag 


gcgctacgtg 


cccgacctca 


acgcccgggt 


gaagagcgtc 


2220 
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2400 

2460 



agggaggccg cggagcgcat ggccttcaac atgcccgtcc agggcaccgc cgccgacctc 2280 
atgaagctcg ccatggtgaa gctcttcccc cgcctccggg agatgggggc ccgcatgctc 2340 
ctccaggtcg ccaacgagct cctcctggag gccccccaag cgcgggccga ggaggtggcg 
gctttggcca aggaggccat ggagaaggcc tatcccctcg ccgtgcccct ggaggtggag 
gtggggatgg gggaggactg gctttccgcc aagggtcacc accaccacca ccac 2514 

<210> 134 

<211> 2526 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 134 

atgaattcca ccccactttt tgacctggag gaacccccca agcgggtgct tctggtggac 
ggccaccacc tggcctaccg caccttctat gccctgagcc tcaccacctc ccggggggag 
ccggtgcaga tggtctacgg cttcgcccgg agcctcctca aggccttgaa ggaggacgga 
caggcggtgg tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacgag 
gcctacaagg cgggccgggc ccccaccccg gaggacttcc cccgccagct cgccttggtc 300 
aagcggctgg tggaccttct gggcctggtc cgcctcgagg ccccggggta cgaggcggac 
gacgtcctgg gcaccctggc caagaaggcc gaaagggagg ggatggaggt gcgcatcctc 
acgggagacc gggacttctt ccagctcctc tccgagaagg tctcggtcct cctgccggac 
gggaccctgg tcaccccaaa ggacgtccag gagaagtacg gggtgccccc ggagcgctgg 
gtggacttcc gcgccctcac gggggaccgc tcggacaaca tccccggggt ggcggggata 
ggggagaaga ccgcccttcg actcctcgca gagtggggga gcgtggaaaa cctcctgaag 
aacctggacc gggtaaagcc ggactcgctc cggcgcaaga tagaggcgca cctcgaggac 
ctccacctct ccttagacct ggcccgcatc cgcaccgacc tccccctgga ggtggacttt 
aaggccctgc gccgcaggac ccccgacctg gagggcctga gggccttttt ggaggagctg 840 
gagttcggaa gcctcctcca cgagttcggc ctcctgggag gggagaagcc ccgggaggag 900 
gccccctggc ccccgcccga aggggccttc gtgggcttcc tcctttcccg caaggagccc 960 
atgtgggcgg agcttctggc cctggcggcg gcctcgggcg gccgcgtcca ccgggccccc 1020 
gagccttata aagccctcag ggacctgaag gaggcgcggg ggcttctcgc caaagacctg 



60 
120 
180 
240 



360 
420 
480 
540 
600 
660 
720 
780 



1080 
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agcgttctgg 


ccctgaggga 


aggccttggc ctcccgcccg gcgacgaccc catgctcctc 


1140 


gcctacctcc 


tggacccttc 


gaacaccacc cccgaggggg tggcccggcg ctacggcggg 


1200 


gagtggacgg 


aggaggcggg 


ggagcgggcc gccctttccg agaggctctt cgccaacctg 


1260 


cttaagaggc 


ttgaggggga 


ggagaggctc ctttggcttt accgggaggt ggagaggccc 


1320 


ctttccgctg 


tcctggccca 


tatggaggcc acgggggtgc gcctggacgt ggcctatctc 


1380 


agggccttgt 


ccctggaggt 


ggccgaggag atcgcccgcc tcgaggccga ggtcttccgc 


1440 


ctggccggcc 


accccttcaa 


cctcaactcc cgggaccagc tggaaagggt cctctttgac 


1500 


gagctagggc 


ttcccgccat 


caagaagacg caaaagaccg gcaagcgctc caccagcgcc 


1560 


gccgtcctgg 


aggccctccg 


cgaggcccac cccatcgtgg agaagatcct gcagtaccgg 


1620 


gagctcacca 


agctgaagag 


cacctacatt gaccccttgc cggacctcat ccaccccagg 


1680 


acgggccgcc 


tccacacccg 


cttcaaccag acggccacgg ccacgggcag gctaagtagc 


1740 


tccgatccca 


acctccagaa 


catccccgtc cgcaccccgc ttgggcagag gatccgccgg 


1800 


gccttcatcg 


ccgaggaggg 


gtggctattg gtggccctgg actatagcca gatagagctc 


1860 


agggtgctgg 


cccacctctc 


cggcgacgag aacctgatcc gggtcttcca ggaggggcgg 


1920 


gacatccaca 


cggagaccgc 


cagctggatg ttcggcgtcc cccgggaggc cgtggacccc 


1980 


ctgatgcgcc 


gggcggccaa 


gaccatcaac ttcggggtcc tctacggcat gtcggcccac 


2040 


cgcctctccc 


aggagctagc 


catcccttac gaggaggccc aggccttcat tgagcgctac 


2100 


tttcagagct 


tccccaaggt 


gcgggcctgg attgagaaga ccctggagga gggcaggagg 


2160 


cgggggtacg 


tggagaccct 


cttcggccgc cgccgctacg tgccagacct agaggcccgg 


2220 


gtgaagagcg 


tgcgggaggc 


ggccgagcgc atggccttca acatgcccgt ccagggcacc 


2280 








2340 


gccaggatgc 


tccttcaggt 


cgccaacgag ctggtcctcg aggccccaaa agagagggcg 


2400 


gaggccgtgg 


cccggctggc 


caaggaggtc atggaggggg tgtatcccct ggccgtgccc 


2460 


ctggaggtgg 


aggtggggat 


aggggaggac tggctctccg ccaaggagca ccaccaccac 


2520 


caccac 






2526 



<210> 135 

<211> 2514 

<212> DNA 

<213> Artificial 

<220> 
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60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 

720 



<223> Synthetic 
<400> 135 

atgaattccc tgcccctctt tgagcccaag ggccgggtgc ttctggtgga cggccaccac 
ctggcctacc gtaccttttt tgccctgaag ggcctcacca ccagccgcgg ggagccggtc 
caggcggtgt acgggtttgc caagagcctt ttgaaggcgc taagggaaga cggggatgtg 
gtgatcgtgg tgtttgacgc caaggccccc tccttccgcc accagaccta cgaggcctac 
aaggcggggc gggctcccac ccccgaggac tttccccggc agcttgccct tatcaaggag 
atggtggacc ttttgggcct ggagcgcctc gaggtgccgg gctttgaagc ggatgacgtc 
ctggctaccc tggccaagaa ggcggaaaag gaaggctacg aagtgcgcat cctcaccgcg 
gaccgggacc tttaccagct tctttcggag cgaatctcca tccttcaccc ggagggttac 
ctgatcaccc cggagtggct ttgggagaag tatgggctta agccttccca gtgggtggac 
taccgggcct tggccgggga cccttccgac aacatccccg gcgtgaaggg catcggggag 
aagacggcgg ccaagctgat ccgggagtgg ggaagcctgg aaaaccttct taagcacctg 
gaacaggtga aacctgcctc cgtgcgggag aagatcctta gccacatgga ggacctcaag 
ctatccctgg agctatcccg ggtgcacacg gacttgctcc ttcaggtgga cttcgcccgg 780 
cgccgggagc cggaccggga ggggcttaag gcctttttgg agaggctgga gttcggaagc 840 
ctcctccacg agttcggcct gttggaaagc ccggtggcgg cggaggaagc tccctggccg 900 
ccccccgagg gagccttcgt ggggtacgtt ctttcccgcc ccgagcccat gtgggcggag 960 
cttaacgcct tggccgccgc ctggggcggc cgcgtccacc gggcccccga gccttataaa 1020 
gccctcaggg acctgaagga ggcgcggggg cttctcgcca aagacctgag cgttctggcc 1080 
ctgagggaag gccttggcct cccgcccggc gacgacccca tgctcctcgc ctacctcctg 1140 
gacccttcga acaccacccc cgagggggtg gcccggcgpt acggcgggga gtggacggag 1200 
gaggcggggg agcgggccgc cctttccgag aggctcttcg ccaacctgct taagaggctt 1260 
gagggggagg agaggctcct ttggctttac cgggaggtgg agaggcccct ttccgctgtc 1320 
ctggcccata tggaggccac gggggtgcgc ctggacgtgg cctatctcag ggccttgtcc 1380 
ctggaggtgg ccgaggagat cgcccgcctc gaggccgagg tcttccgcct ggccggccac 1440 
cccttcaacc tcaactcccg ggaccagctg gaaagggtcc tctttgacga gctagggctt 1500 
cccgccatca agaagacgca aaagaccggc aagcgctcca ccagcgccgc cgtcctggag 1560 
gccctccgcg aggcccaccc catcgtggag aagatcctgc agtaccggga gctcaccaag 1620 
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ctgaagagca 


cctacattga 


ccccttgccg 


gacctcatcc 


accccaggac 


gggccgcctc 


1680 


cacacccgct 


tcaaccagac 


•ggccacggcc 


acgggcaggc 


taagtagctc 


cgatcccaac 


1740 


ctccagaaca 


tccccgtccg 


caccccgctt 


gggcagagga 


tccgccgggc 


cttcatcgcc 


1800 


gaggaggggt 


ggctattggt 


ggccctggac 


tatagccaga 


tagagctcag 


ggtgctggcc 


1860 


cacctctccg 


gcgacgagaa 


cctgatccgg 


gtcttccagg 


aggggcggga 


catccacacg 


1920 


gagaccgcca 


gctggatgtt 


cggcgtcccc 


cgggaggccg 


tggaccccct 


gatgcgccgg 


1980 


gcggccaaga 


ccatcaactt 


cggggtcctc 


tacggcatgt 


cggcccaccg 


cctctcccag 


2040 


gagctagcca 


tcccttacga 


ggaggcccag 


gccttcattg 


agcgctactt 


tcagagcttc 


2100 


cccaaggtgc 


gggcctggat 


tgagaagacc 


ctggaggagg 


gcaggaggcg 


ggggtacgtg 


2160 


gagaccctct 


tcggccgccg 


ccgctacgtg 


ccagacctag 


aggcccgggt 


gaagagcgtg 


2220 


caaaacraccra 


ccaaacacat 


oacctticaac 


ahcfccccf tec 




ccrccaacctc 


2280 


atgaagctgg 


ctatggtgaa 


gctcttcccc 


aggctggagg 


aaatgggggc 


caggatgctc 


2340 


cttcaggtcg 


ccaacgagct 


ggtcctcgag 


gccccaaaag 


agagggcgga 


ggccgtggcc 


2400 


cggctggcca 


aggaggtcat 


ggagggggtg 


tatcccctgg 


ccgtgcccct 


ggaggtggag 


2460 


gtggggatag 


gggaggactg 


gctctccgcc 


aaggagcacc 


accaccacca 


ccac 


2514 



<210> 136 

<211> 320 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 136 

Met Ala Ser Met Thr Gly Gly Gin Gin Met Gly Arg He Asn Ser Gly 
15 10 15 

Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu Val Asp Gly 

20 25 30 

His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys Gly Leu Thr Thr 
35 40 45 

Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala Lys Ser Leu 
50 55 60 

Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He Val Val Phe Asp 
65 70 75 80 

Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly Gly Tyr Lys Ala 
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85 



90 95 



Gly Arg Ala Pro Thr Leu Val Pro Arg Gly Ser Glu Asp Phe Pro Arg 
100 105 110 

Gin Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg 
115 120 125 

Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala 
130 135 140 

Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp 
145 150 155 1^0 

Lys Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro 
165 170 175 

Glu Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu 
180 185 190 

Arg Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr. Gly Asp Glu Ser 
195 200 205 

Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys 
210 215 220 

Leu Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp 
225 230 235 240 

Arg Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp 
245 250 255 

Leu Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu 
260 265 270 

Glu Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg 
275 280 285 

Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly 
290 295 300 

Leu Leu Glu Ser Pro Lys Ala Ala Leu Glu His His His His His His 
305 310 315 320 

<210> 137 

<211> 73 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 137 

ccctatcttt aaagttttta aaaagtttga cccccttttg ggggccctat ctttaaagtt 60 



tttaaaaatt tga 



73 
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<210> 138 

<211> 15 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 
<400> 138 

cgcgcggaac gcgcg 15 



<210> 139 

<211> 16 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 
<400> 139 

cccgggtttt cccggg 16 



<210> 140 

<211> 20 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 



<400> 140 

aggcgcacca atttggtgtt 20 



<210> 141 

<211> 53 

<212> RNA 

<213> Artificial 



<220> 

<223> Synthetic 
<400> 141 

uucgcuuucu ucccuuccuu ucucgccacg uucgccggcu uuccccguca age 53 



<210> 142 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
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<400> 142 3Q 
acggggaaag ccggcgaacg tggcgagaaa 



<210> 143 

<211> 26 

<212> DMA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 143 26 



attagaaagg aagggaagaa agcgaa 



<210> 144 

<211> 30 

<212> DNA . 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 144 



acggggaaag ccggcgaacg tggcgagaac 



<210> 145 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 145 

cttgacgggg aaagccggcg aacgtggcgc 



<210> 146 

<211> 23 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400^ 146 23 
agaaaggaag ggaagaaagc gaa 



<210> 147 

<211> 63 

<212> DNA 

<213> Artificial 
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<220> 

<223> Synthetic 
<400> 147 

gcgcggcggc gggtgtggtg gttacgcgca gcgtgaccgc tacacttgcc agcgccctag 60 
cgc 63 



<210> 148 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 148 

gcgctagggc gctggcaagt gtagcggtca 

<210> 149 

<211> 35 

<212> DNA 

<213> Artificial" 

<220> 

<223> Synthetic 

<400> 149 

gatcgctgcg cgtaaccacc acacccgcgc cgcgc 



<210> 150 

<211> 30 

<212> DNA 

<213> Artificial 



30 



35 



<220> 

<223> Synthetic 
<400> 150 

ggactctgcc tcaagacggt agtcaacgtg 30 



<210> 151 

<211> 16 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 151 

cacgttgact accgtc 16 
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<210> 152 

<211> 34 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 152 

catgtcaagc agtcctaact ttgaggcaga gtcc 



<210> 153 

<211> 2506 

<212> DNA 

<213> Thermus aquaticus 



34 



60 



<400> 153 

atgaggggga tgctgcccct ctttgagccc aagggccggg tcctcctggt ggacggccac 
cacctggcct accgcacctt ccacgccctg aagggcctca ccaccagccg gggggagccg 120 
gtgcaggcgg tctacggctt cgccaagagc ctcctcaagg ccctcaagga ggacggggac 180 
gcggtgatcg tggtctttga cgccaaggcc ccctccttcc gccacgaggc ctacgggggg 
tacaaggcgg gccgggcccc cacgccggag gactttcccc ggcaactcgc cctcatcaag 
gagctggtgg acctcctggg gctggcgcgc ctcgaggtcc cgggctacga ggcggacgac 
gtcctggcca gcctggccaa gaaggcggaa aaggagggct acgaggtccg catcctcacc 
gccgacaaag acctttacca gctcctttcc gaccgcatcc acgtcctcca ccccgagggg 
tacctcatca ccccggcctg gctttgggaa aagtacggcc tgaggcccga ccagtgggcc 540 
gactaccggg ccctgaccgg ggacgagtcc gacaaccttc ccggggtcaa gggcatcggg 
gagaagacgg cgaggaagct tctggaggag tgggggagcc tggaagccct cctcaagaac 
ctggaccggc tgaagcccgc catccgggag aagatcctgg cccacatgga cgatctgaag 
ctctcctggg acctggccaa ggtgcgcacc gacctgcccc tggaggtgga cttcgccaaa 
aggcgggagc ccgaccggga gaggcttagg gcctttctgg agaggcttga gtttggcagc 
ctcctccacg agttcggcct tctggaaagc cccaaggccc tggaggaggc cccctggccc 
ccgccggaag gggccttcgt gggctttgtg ctttcccgca aggagcccat gtgggccgat 
cttctggccc tggccgccgc cagggggggc cgggtccacc gggcccccga gccttataaa 1020 
gccctcaggg acctgaagga ggcgcggggg cttctcgcca aagacctgag cgttctggcc 1080 
ctgagggaag gccttggcct cccgcccggc gacgacccca tgctcctcgc ctacctcctg 1140 
gacccttcca acaccacccc cgagggggtg gcccggcgct acggcgggga gtggacggag 1200 



240 
300 

360 
420 
480 



600 
660 
720 
780 
840 
900 
960 
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gaggcggggg 


agcgggccgc 


cctttccgag 


aggctcttcg 


ccaacctgtg ggggaggctt 


1260 


gagggggagg 


agaggctcct 


ttggctttac 


cgggaggtgg 


agaggcccct ttccgctgtc 


1320 


ctggcccaca 


tggaggccac 


gggggtgcgc 


ctggacgtgg 


cctatctcag ggccttgtcc 


1380 


ctggaggtgg 


ccgaggagat 


cgcccgcctc 


gaggccgagg 


tcttccgcct ggccggccac 


1440 


cccttcaacc 


tcaactcccg 


ggaccagctg 


gaaagggtcc 


tctttgacga gctagggctt 


1500 


cccgccatcg 


gcaagacgga 


gaagaccggc 


aagcgctcca 


ccagcgccgc cgtcctggag 


1560 


gccctccgcg 


aggcccaccc 


catcgtggag 


aagatcctgc 


agtaccggga gctcaccaag 


1620 


ctgaagagca 


cctacattga 


ccccttgccg 


gacctcatcc 


accccaggac gggccgcctc 


1680 


cacacccgct 


tcaaccagac 


ggccacggcc 


acgggcaggc 


taagtagctc cgatcccaac 


1740 


ctccagaaca 


tccccgtccg 


caccccgctt 


gggcagagga 


tccgccgggc cttcatcgcc 


1800 


gaggaggggt 


ggctattggt 


ggccctggac 


tatagccaga 


tagagctcag ggtgctggcc 


1860 


cacctctccg 


gcgacgagaa 


cctgatccgg 


gtcttccagg 


aggggcggga catccacacg 


1920 


gagaccgcca 


gctggatgtt 


cggcgtcccc 


cgggaggccg 


tggaccccct gatgcgccgg 


1980 


gcggccaaga 


ccatcaactt 


cggggtcctc 


tacggcatgt 


cggcccaccg cctctcccag 


2040 


gagctagcca 


tcccttacga 


ggaggcccag 


gccttcattg 


agcgctactt tcagagcttc 


2100 


cccaaggtgc 


gggcctggat 


tgagaagacc 


ctggaggagg 


gcaggaggcg ggggtacgtg 


2160 


gagaccctct 


tcggccgccg 


ccgctacgtg 


ccagacctag 


aggcccgggt gaagagcgtg 


2220 


cgggaggcgg 


ccgagcgcat 


ggccttcaac 


atgcccgtcc 


agggcaccgc cgccgacctc 


2280 


atgaagctgg 


ctatggtgaa 


gctcttcccc 


aggctggagg 


aaatgggggc caggatgctc 


2-340 


cttcaggtcc 


acgacgagct 


ggtcctcgag 


gccccaaaag 


agagggcgga ggccgtggcc 


2400 


cggctggcca 


aggaggtcat 


ggagggggtg 


tatcccctgg 


ccgtgcccct ggaggtggag 


2460 


gtggggatag 


gggaggactg 


gctctccgcc 


aaggagtgat 


accacc 


2506 



<210> 154 
<211> 2496 
<212> DNA 

<213> Thermus flavus 

<400> 154 

atggcgatgc ttcccctctt tgagcccaaa ggccgcgtgc tcctggtgga cggccaccac 60 
ctggcctacc gcaccttctt tgccctcaag ggcctcacca ccagccgcgg cgaacccgtt 120 
caggcggtct acggcttcgc caaaagcctc ctcaaggccc tgaaggagga cggggacgtg 180 
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gtggtggtgg tctttgacgc caaggccccc tccttccgcc acgaggccta cgaggcctac 
aaggcgggcc gggcccccac cccggaggac tttccccggc agctggccct catcaaggag 
ttggtggacc tcctaggcct tgtgcggctg gaggttcccg gctttgaggc ggacgacgtg 
ctggccaccc tggccaagcg ggcggaaaag gaggggtacg aggtgcgcat cctcactgcc 
gaccgcgacc tctaccagct cctttcggag cgcatcgcca tcctccaccc tgaggggtac 
ctgatcaccc cggcgtggct ttacgagaag tacggcctgc gcccggagca gtgggtggac 
taccgggccc tggcggggga cccctcggat aacatccccg gggtgaaggg catcggggag 
aagaccgccc agaggctcat ccgcgagtgg gggagcctgg aaaacctctt ccagcacctg 
gaccaggtga agccctcctt gcgggagaag ctccaggcgg gcatggaggc cctggccctt 
tcccggaagc tttcccaggt gcacactgac ctgcccctgg aggtggactt cgggaggcgc 
cgcacaccca acctggaggg tctgcgggct tttttggagc ggttggagtt tggaagcctc 
ctccacgagt tcggcctcct ggaggggccg aaggcggcag aggaggcccc ctggccccct 
ccggaagggg cttttttggg cttttccttt tcccgtcccg agcccatgtg ggccgagctt 
ctggccctgg ctggggcgtg ggaggggcgc ctccatcggg cacaagaccc ccttaggggc 
ctgagggacc ttaagggggt gcggggaatc ctggccaagg acctggcggt tttggccctg 
cgggagggcc tggacctctt cccagaggac gaccccatgc tcctggccta ccttctggac 1140 
ccctccaaca ccacccctga gggggtggcc cggcgttacg ggggggagtg gacggaggat 1200 
gcgggggaga gggccctcct ggccgagcgc ctcttccaga ccctaaagga gcgccttaag 
ggagaagaac gcctgctttg gctttacgag gaggtggaga agccgctttc ccgggtgttg 
gcccggatgg aggccacggg ggtccggctg gacgtggcct acctccaggc cctctccctg 
gaggtggagg cggaggtgcg ccagctggag gaggaggtct tccgcctggc cggccacccc 1440 
ttcaacctca actcccgcga ccagctggag cgggtgctct ttgacgagct gggcctgcct 1500 
gccatcggca agacggagaa gacggggaaa cgctccacca gcgctgccgt gctggaggcc 1560 
ctgcgagagg cccaccccat cgtggaccgc atcctgcagt accgggagct caccaagctc 1620 
aagaacacct acatagaccc cctgcccgcc ctggtccacc ccaagaccgg ccggctccac 1680 
acccgcttca accagacggc caccgccacg ggcaggcttt ccagctccga ccccaacctg 1740 
cagaacatcc ccgtgcgcac ccctctgggc cagcgcatcc gccgagcctt cgtggccgag 
gagggctggg tgctggtggt cttggactac agccagattg agcttcgggt cctggcccac 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



1260 
1320 
1380 



1800 
1860 
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ctctccgggg 


acgagaacct 


gatccgggtc 


tttcaggagg ggagggacat ccacacccag 


1920 


accgccagct 


ggatgttcgg 


cgtttccccc 


gaaggggtag accctctgat gcgccgggcg 


1980 


gccaagacca 


tcaacttcgg 


ggtgctctac 


ggcatgtccg cccaccgcct ctccggggag 


2040 


ctttccatcc 


cctacgagga 


ggcggtggcc 


ttcattgagc gctacttcca gagctacccc 


2100 


aaggtgcggg 


cctggattga 


ggggaccctc 


gaggagggcc gccggcgggg gtatgtggag 


2160 


accctcttcg 


gccgccggcg 


ctatgtgccc 


gacctcaacg cccgggtgaa gagcgtgcgc 


2220 


gaggcggcgg 


agcgcatggc 


cttcaacatg 


ccggtccagg gcaccgccgc cgacctcatg 


2280 


aagctggcca 


tggtgcggct 


tttcccccgg 


cttcaggaac tgggggcgag gatgcttttg 


2340 


caggtgcacg 


acgagctggt 


cctcgaggcc 


cccaaggacc gggcggagag ggtagccgct 


2400 


ttggccaagg 


aggtcatgga 


gggggtctgg 


cccctgcagg tgcccctgga ggtggaggtg 


2460 


ggcctggggg 


aggactggct 


ctccgccaag 


gagtag 


2496 



<210> 155 

<211> 2505 

<212> DNA 

<213> Thermus thermophilus 

<400> 155 



atggaggcga 


tgcttccgct 


ctttgaaccc 


aaaggccggg tcctcctggt ggacggccac 


60 


cacctggcct 


accgcacctt 


cttcgccctg 


aagggcctca ccacgagccg gggcgaaccg 


120 


gtgcaggcgg 


tctacggctt 


cgccaagagc 


ctcctcaagg ccctgaagga ggacgggtac 


180 


aaggccgtct 


tcgtggtctt 


tgacgccaag 


gccccctcct tccgccacga ggcctacgag 


240 


gcctacaagg 


cggggagggc 


cccgaccccc 


gaggacttcc cccggcagct cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggngtttacc 


cgcctcgagg tccccggcta cgaggcggac 


360 


gacgttctcg 


ccaccctggc 


caagaaggcg 


gaaaaggagg ggtacgaggt gcgcatcctc 


420 


accgccgacc 


gcgacctcta 


ccaactcgtc 


tccgaccgcg tcgccgtcct ccaccccgag 


480 


ggccacctca 


tcaccccgga 


gtggctttgg 


gagaagtacg gcctcaggcc ggagcagtgg 


540 


gtggacttcc 


gcgccctcgt 


gggggacccc 


tccgacaacc tccccggggt caagggcatc 


600 


ggggagaaga 


ccgccctcaa 


gctcctcaag 


gagtggggaa gcctggaaaa cctcctcaag 


660 


aacctggacc 


gggtaaagcc 


agaaaacgtc 


cgggagaaga tcaaggccca cctggaagac 


720 


ctcaggctct 


ccttggagct 


ctcccgggtg 


cgcaccgacc tccccctgga ggtggacctc 


780 


gcccaggggc 


gggagcccga 


ccgggagggg 


cttagggcct tcctggagag gctggagttc 


840 
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90 0 
960 



ggcagcctcc tccaogagtt cggcctcctg gaggcccccg cccccctgga ggaggccccc 
tggcccccgc cggaaggggc cttcgtgggc ttcgtcctct cccgccccga gcccatgtgg 

gcggagctta aagccctggc cgcctgcagg gacggccggg tgcaccgggc agcagacccc 1020 

ttggcggggc taaaggacct caaggaggtc cggggcctcc tcgccaagga cctcgccgtc 1080 

ttggcctcga gggaggggct agacotcgtg cccggggacg accccatgct cctcgcctac 1140 

ctcctggacc cctccaacac cacccccgag ggggtggcgc ggcgctacgg gggggagtgg 1200 

acggaggacg ccgcccaccg ggccctcctc tcggagaggc tccatcggaa cctccttaag 1260 

cgoctcgagg gggaggagaa gctcctttgg ctctaccacg aggtggaaaa gcccctctcc 1320 

cgggtcctgg cccacatgga ggccaccggg gtacggctgg acgtggccta ccttcaggcc 1380 

ctttccctgg agcttgcgga ggagatccgc cgcctogagg aggaggtctt ccgcttggcg 1440 

ggccacccct tcaacctcaa ctcccgggao cagctggaaa gggtgctctt tgacgagctt 1500 

aggcttcccg ccttggggaa gacgcaaaag acaggoaagc gctccaccag cgccgcggtg 1560 

ctggaggccc tacgggaggo ccaocccatc gtggagaaga tcetccagca cogggagctc 1620 

accaagctca agaacaccta cgtggacccc ctcccaagcc tcgtccaccc gaggacgggc 1680 

cgcctcoaoa cccgcttcaa ccagacggcc acggccacgg ggaggcttag tagctccgac 1740 

cccaacctgc agaacatccc cgtccgcacc cccttgggcc agaggatccg ccgggccttc 1800 

gtggccgagg cgggttgggc gttggtggcc ctggactata gccagataga gctccgcgtc 1860 

ctcgcccacc tctccgggga cgaaaacctg atcagggtct tccaggaggg gaaggacatc 1920 

cacacccaga ccgcaagctg gatgttcggc gtccccccgg aggccgtgga ccccctgatg 1980 

ogccgggcgg ccaagacggt gaacttcggc gtcctctacg gcatgtccgc ccataggctc 2040 

tcccaggagc ttgccatccc ctacgaggag gcggtggcct ttatagagcg ctacttccaa 2100 

agcttcccca aggtgcgggc ctggatagaa aagaccctgg aggaggggag gaagcggggc 2160 

tacgtggaaa ccctcttcgg aagaaggcgc tacgtgcccg acctcaacgc ccgggtgaag 2220 

agcgtcaggg aggccgcgga gcgcatggcc ttcaacatgc ocgtccaggg caccgccgcc 2280 

gacctcatga agctcgccat ggtgaagctc ttcccccgcc tccgggagat gggggcccgc 2340 

atgctcctcc aggtccacga cgagctcctc ctggaggccc cccaagcgcg ggccgaggag 2400 

gtggcggctt tggccaagga ggccatggag aaggcctatc ccctcgccgt gcccctggag 2460 

gtggaggtgg ggatggggga ggactggctt tccgccaagg gttag 2505 
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<210> 156 

<211> 2502 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<220> 

<221> n 

<222> (1) . . (2502) 

<223> n is any of the bases g, a, t, or c 



<400> 156 
atgnnggcga 


tgcttcccct 


ctttgagccc 


aaaggccggg tcctcctggt 


ggacggccac 


60 


cacctggcct 


accgcacctt 


cttcgccctg 


aagggcctca ccaccagccg 


gggcgaaccg 


120 


gtgcaggcgg 


tctacggctt 


cgccaagagc 


ctcctcaagg ccctgaagga 


ggacggggac 


180 


imggcggtgn 


tcgtggtctt 


tgacgccaag 


gccccctcct tccgccacga 


ggcctacgag 


240 


gcctacaagg 


cgggccgggc 


ccccaccccg 


gaggactttc cccggcagct 


cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggcttgcg 


cgcctcgagg tccccggcta 


cgaggcggac 


360 


gacgtnctgg 


ccaccctggc 


caagaaggcg 


gaaaaggagg ggtacgaggt 


gcgcatcctc 


420 


accgccgacc 


gcgacctcta 


ccagctcctt 


tccgaccgca tcgccgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


gtggctttgg 


gagaagtacg gcctgaggcc 


ggagcagtgg 


540 


gtggactacc 


gggccctggc 


gggggacccc 


tccgacaacc tccccggggt 


caagggcatc 


600 


ggggagaaga 


ccgcccngaa 


gctcctcnag 


gagtggggga gcctggaaaa 


cctcctcaag 


660 


aacctggacc 


gggtgaagcc 


cgccntccgg 


gagaagatcc aggcccacat 


ggangacctg 


720 


angctctcct 


gggagctntc 


ccaggtgcgc 


accgacctgc ccctggaggt 


ggacttcgcc 


780 


aagnggcggg 


agcccgaccg 


ggaggggctt 


agggcctttc tggagaggct 


ggagtttggc 


840 


agcctcctcc 


acgagttcgg 


cctcctggag 


ggccccaagg ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtcctttccc gccccgagcc 


catgtgggcc 


960 


gagcttctgg 


ccctggccgc 


cgccagggag 


ggccgggtcc accgggcacc 


agaccccttt 


1020 


angggcctna 


gggacctnaa 


ggaggtgcgg 


ggnctcctcg ccaaggacct 


ggccgttttg 


1080 


gccctgaggg 


agggcctnga 


cctcntgccc 


ggggacgacc ccatgctcct 


cgcctacctc 


1140 


ctggacccct 


ccaacaccac 


ccccgagggg 


gtggcccggc gctacggggg 


ggagtggacg 


1200 


gaggangcgg 


gggagcgggc 


cctcctntcc 


gagaggctct tccngaacct 


nnngcagcgc 


1260 
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cttgaggggg aggagaggct cctttggctt taccaggagg tggagaagcc cctttcccgg 1320 

gtcctggccc acatggaggc cacgggggtn cggctggacg tggcctacct ccaggccctn 1380 

tccctggagg tggcggagga gatccgccgo ctcgaggagg aggtcttccg cctggccggc 1440 

caccccttca acctcaactc ccgggaccag ctggaaaggg tgctctttga cgagctnggg 1500 

cttcccgcca tcggcaagac ggagaagacn ggcaagcgct ccaccagcgc cgccgtgctg 1560 

gaggccctnc gngaggccca ccccatcgtg gagaagatcc tgcagtaccg ggagctcacc 1620 

aagctcaaga acacctacat ngaccccctg ccngncctcg tocaocccag gacgggcogc 1680 

ctccacaccc gcttcaacca gacggccacg gccacgggca ggcttagtag ctccgacccc 1740 

aacctgcaga acatccccgt ccgcaccccn ctgggccaga ggatccgccg ggccttcgtg 1800 

gccgaggagg gntgggtgtt ggtggccctg gactatagcc agatagagot ccgggtcctg i860 

gcccacctct ccggggacga gaacctgatc cgggtcttoc aggaggggag ggacatccac 1920 

acccagaccg ccagotggat gttcggcgtc cccccggagg ccgtggaccc cctgatgcgc 1980 

cgggcggcca agaccatoaa cttcggggtc ctctacggca tgtccgccca ccgcctctcc 2040 

caggagcttg ccatccccta cgaggaggcg gtggccttca ttgagcgcta cttccagagc 2100 

ttccccaagg tgcgggcctg gattgagaag accctggagg agggcaggag gcgggggtac 2160 

gtggagaccc tcttcggccg ccggcgctac gtgcccgacc tcaacgcccg ggtgaagagc 2220 

gtgcgggagg cggcggagcg catggccttc aacatgcccg tccagggcac cgccgccgac 2280 

ctcatgaagc tggccatggt gaagctcttc ccccggctnc aggaaatggg ggccaggatg 2340 

ctcctncagg tccacgacga gctggtcctc gaggccccca aagagcgggc ggaggnggtg 2400 

gccgctttgg ccaaggaggt catggagggg gtctatcccc tggccgtgcc cctggaggtg 2460 
gaggtgggga tgggggagga ctggctctcc gccaaggagt ag 

<210> 157 
<211> 832 
<212> PRT 

<213> Therraus aguaticus 
<400> 157 

Met Arg Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu 
15 10 15 

Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys Gly 
2'0 25 30 

Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala 



2502 
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35 



40 



45 



Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He Val 
50 55 60 

Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly Gly 
65 70 75 80 

Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu 
85 90 95 

Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu Glu 
100 105 110 

Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys Lys 
115 120 125 

Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys Asp 
130 135 140 

Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu Gly 
145 150 155 160 

Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg Pro 
165 170 175 

Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp Asn 
180 185 190 

Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu Leu 
195 200 205 

Glu Glu Trp Gly Ser ^ Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg Leu 
210 215 220 

Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu Lys 
225 230 235 240 

Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu Val 
245 250 255 

Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala Phe 
260 265 270 

Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu Leu 
275 280 285 

Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu Gly 
290 295 300 

Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala Asp 
305 310 315 320 

Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala Pro 
325 330 335 



Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu Leu 
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340 



345 



350 



Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu Pro 

355 360 365 

Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser Asn 
370 375 380 

Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr Glu 
385 390 395 400 

Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn Leu 
405 410 415 

Trp Gly Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg Glu 
420 425 430 

Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr Gly 
435 440 445 

Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val Ala 
450 455 460 

Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly His 
465 470 475 480 

Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe Asp 
485 490 495 ' 

Glu Leu Gly Leu Pro Ala He Gly Lys Thr Glu Lys Thr Gly Lys Arg 
500 505 510 

Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro He 
515 520 525 

Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser Thr 
530 535 540 

Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg Leu 
545 550 555 560 

His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser 
565 570 575 

Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly Gin 
580 585 590 

Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val Ala 
595 600 605 

Leu Aep Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser Gly 
610 615 620 

Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His Thr 
625 630 635 640 

Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp Pro 
645 650 655 
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Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr Gly 
660 665 670 

Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu Glu 
675 680 685 

Ala Glu Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val Arg 
690 695 700 

Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr Val 
705 710 715 720 

Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala Arg 
725 730 735 

Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met Pro 
740 745 750 

Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys Leu 
755 760 765 

Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val His 
770 775 780 

Asp Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val Ala 
785 790 795 800 

Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val Pro 
805 810 815 

Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys Glu 
820 825 830 

<210> 158 
<211> 831 
<212> PRT 

<213> Thermus flavus 
<400> 158 

Met Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu Val 
15 10 15 

Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys Gly Leu 
20 25 30 

Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala Lys 
35 40 45 

Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Val Val Val Val Val 
50 55 60 

Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Glu Ala Tyr 
65 70 75 80 

Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu Ala 
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85 



90 



95 



Leu He I.ys Glu Leu Val Asp Leu Leu Gly Leu Val Arg Leu Glu Val 
100 105 

Pro Gly Phe Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys Arg Ala 
X15 120 125 

Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Arg Asp Leu 
130 135 140 

Tyr Gin Leu Leu Ser Glu Arg He Ala He Leu His Pro Glu Gly Tyr 
^ 155 160 



145 



150 



Leu He Thr Pro Ala Trp Leu Tyr Glu Lys Tyr Gly Leu Arg Pro Glu 
165 170 175 

Gin Trp Val Asp Tyr Arg Ala Leu Ala Gly Asp Pro Ser Asp Asn lie 
180 185 190 

Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Gin Arg Leu He Arg 



195 



200 



205 



Glu Trp Gly Ser Leu Glu Asn Leu Phe Gin His Leu Asp Gin Val Lys 
210 215 220 

Pro Ser Leu Arg Glu Lys Leu Gin Ala Gly Met Glu Ala Leu Ala Leu 
225 230 235 240 

Ser Arg Lys Leu Ser Gin Val His Thr Asp Leu Pro Leu Glu Val Asp 
245 250 255 

Phe Gly Arg Arg Arg Thr Pro Asn Leu Glu Gly Leu Arg Ala Phe Leu 
260 265 270 

Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu Leu Glu 
275 280 285 

Gly Pro Lys Ala Ala Glu Glu Ala Pro Trp Pro Pro Pro Glu Gly Ala 
290 295 300 

Phe Leu Gly Phe Ser Phe Ser Arg Pro Glu Pro Met Trp Ala Glu Leu 
305 310 315 320 

Leu Ala Leu Ala Gly Ala Trp Glu Gly Arg Leu His Arg Ala Gin Asp 
325 330 335 

Pro Leu Arg Gly Leu Arg Asp Leu Lys Gly Val Arg Gly He Leu Ala 
340 345 350 

Lys Asp Leu Ala Val Leu Ala Leu Arg Glu Gly Leu Asp Leu Phe Pro 
355 360 365 

Glu Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser Asn Thr 
370 375 380 

Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr Glu Asp 
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385 



390 



395 



400 



Ala Gly Glu Arg Ala Leu Leu Ala Glu Arg Leu Phe Gin Thr Leu Lys 
405 410 415 

Glu Arg Leu Lys Gly Glu Glu Arg Leu Leu Trp Leu Tyr Glu Glu Val 
420 425 430 

Glu Lys Pro Leu Ser Arg Val Leu Ala Arg Met Glu Ala Thr Gly Val 
435 440 445 

Arg Leu Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Val Glu Ala 
450 455 460 

Glu Val Arg Gin Leu Glu Glu Glu Val Phe Arg Leu Ala Gly His Pro 
465 470 475 480 

Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe Asp Glu 
485 490 495 

Leu Gly Leu Pro Ala lie Gly Lys Thr Glu Lys Thr Gly Lys Arg Ser 
500 505 510 

Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro He Val 
515 520 525 

Asp Arg He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Asn Thr Tyr 
530 535 540 

He Asp Pro Leu Pro Ala Leu Val His Pro Lys Thr Gly Arg Leu His 
545 550 555 560 

Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser Ser 

565 570 575 

Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly Gin Arg 
580 585 590 

He Arg Arg Ala Phe Val Ala Glu Glu Gly Trp Val Leu Val Val Leu 
595 600 605 

Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser Gly Asp 
610 615 620 

Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His Thr Gin 
625 630 635 640 

Thr Ala Ser Trp Met Phe Gly Val Ser Pro Glu Gly Val Asp Pro Leu 
645 650 655 

Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr Gly Met 
660 665 670 

Ser Ala His Arg Leu Ser Gly Glu Leu Ser He Pro Tyr Glu Glu Ala 
675 680 685 



Val Ala Phe He Glu Arg Tyr Phe Gin Ser Tyr Pro Lys Val Arg Ala 
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690 695 700 

Trp He Glu Gly Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr Val Glu 



705 710 715 

Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn Ala Arg Val 



725 



730 



Lvs Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met Pro Val 
^ 745 750 



740 

Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Arg Leu Phe 
755 760 765 

Pro Arg Leu Gin Glu Leu Gly Ala Arg Met Leu Leu Gin Val His Asp 
770 775 780 

Glu Leu Val Leu Glu Ala Pro Lys Asp Arg Ala Glu Arg Val Ala Ala 
785 790 795 800 

Leu Ala Lys Glu Val Met Glu Gly Val Trp Pro Leu Gin Val Pro Leu 
805 810 

Glu Val Glu Val Gly Leu Gly Glu Asp Trp Leu Ser Ala Lys Glu 
820 825 830 



<210> 159 

<211> 833 

<212> PRT 

<213> Artificial 



<220> 

<223> Synthetic 

<:220> 

<221> X 

<222> (1),.(833) 

<223> X is any amino acid 



<400> 159 

Met Xaa Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu 
15 10 15 

Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys Gly 
20 25 30 

Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala 
35 40 45 

Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val Xaa Val 
50 55 ^0 

Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Glu Ala 

75 80 



65 



70 
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Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu 
85 90 95 

Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Xaa Arg Leu Glu 
100 105 110 

Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Tbr Leu Ala Lys Lys 
115 120 125 

Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Arg Asp 
130 135 140 

Leu Tyr Gin Leu Leu Ser Asp Arg lie Ala Val Leu His Pro Glu Gly 
145 150 155 160 

Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg Pro 
165 170 175 

Glu Gin Trp Val Asp Tyr Arg Ala Leu Xaa Gly Asp Pro Ser Asp Asn 
180 185 190 

Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Xaa Lys Leu Leu 
195 200 205 

Xaa Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu Asp Arg Val 
210 215 220 

Lys Pro Xaa Xaa Arg Glu Lys He Xaa Ala His Net Glu Asp Leu Xaa 
225 230 235 240 

Leu Ser Xaa Xaa Leu Ser Xaa Val Arg Thr Asp Leu Pro Leu Glu Val 
245 250 255 

Asp Phe Ala Xaa Arg Arg Glu Pro Asp Arg Glu Gly Leu Arg Ala Phe 
260 265 270 

Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu Leu 
275 280 285 

Glu Xaa Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu Gly 
290 295 300 

Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro Met Trp Ala Glu 
305 310 315 320 

Leu Leu Ala Leu Ala Ala Ala Arg Xaa Gly Arg Val His Arg Ala Xaa 
325 330 335 

Asp Pro Leu Xaa Gly Leu Arg Asp Leu Lys Glu Val Arg Gly Leu Leu 
340 345 350 

Ala Lys Asp Leu Ala Val Leu Ala Leu Arg Glu Gly Leu Asp Leu Xaa 
355 360 365 



Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser Asn 
370 375 380 
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Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr Glu 

395 400 



385 



390 



Asp Ala Gly Glu Arg Ala Leu Leu Ser Glu Arg Leu Phe Xaa Asn Leu 
405 410 415 

Xaa Xaa Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Xaa Glu 
420 425 430 

Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr Gly 



Lys 
435 



440 



445 



val Arg Leu Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Val Ala 
450 455 460 

Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly His 

475 480 



465 



470 



Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe Asp 

485 490 495 

Glu Leu Gly Leu Pro Ala He Gly Lys Thr Glu Lys Thr Gly Lys Arg 

500 505 510 

Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro He 

515 ' 520 525 

Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Asn Thr 



530 



535 



540 



Tyr He Asp Pro Leu Pro Xaa Leu Val His Pro Arg Thr Gly Arg Leu 
545 550 555 560 

His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser 
565 570 575 

Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly Gin 



580 



585 



590 



Arg He Arg Arg Ala Phe Val Ala Glu Glu Gly Trp Xaa Leu Val Ala 
595 600 605 

Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser Gly 
610 615 620 

Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His Thr 
625 630 635 640 

Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp Pro 
645 650 655 

Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr Gly 
660 665 670 

Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu Glu 
675 680 685 



461 



wo 01/90337 



PCTAJSOl/17086 



Ala Val Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val Arg 
690 695 700 

Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr Val 
705 710 715 720 

Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn Ala Arg 
725 730 735 

Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met Pro 
740 745 750 

Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys Leu 
755 760 765 

Phe Pro Arg Leu Xaa Glu Met Gly Ala Arg Met Leu Leu Gin Val His 
770 775 780 

Asp Glu Leu Val Leu Glu Ala Pro Lys Xaa Arg Ala Glu Xaa Val Ala 
785 790 795 800 

Ala Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val Pro 
805 810 815 

Leu Glu Val Glu Val Gly Xaa Gly Glu Asp Trp Leu Ser Ala Lys Glu 
820 825 830 

Xaa 



<210> 160 

<2ia> 640 

<212> RNA 

<213> Artificial 

<220>' 

<223> Synthetic 
<400> 160 

gggagcccag cuaugaacuc cuucuccaca agcgccuucg guccaguugc cuucucccug 60 

gggcugcucc ugguguugcc ugcugccuuc ccugccccag uacccccagg agaagauucc 120 

aaagauguag ccgccccaca cagacagcca cucaccucuu cagaacgaau ugacaaacaa 180 

auucgguaca uccucgacgg caucucagcc cugagaaagg agacauguaa caagaguaac 240 

augugugaaa gcagcaaaga ggcacuggca gaaaacaacc ugaaccuucc aaagauggcu 300 

gaaaaagaug gaugcuucca aucuggauuc aaugaggaga cuugccuggu gaaaaucauc 360 

acuggucuuu uggaguuuga gguauaccua gaguaccucc agaacagauu ugagaguagu 420 

gaggaacaag ccagagcugu ccagaugagu acaaaagucc ugauccaguu ccugcagaaa 480 

aaggcaaaga aucuagaugc aauaaccacc ccugacccaa ccacaaaugc cagccugcug 540 
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acgaagcugc aggcacagaa ccaguggcug caggacauga caacucaucu cauucugcgc 600 
agcuuuaagg aguuccugca guccagccug agggcucuuc 



640 



<210> 161 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 161 27 
agggagaagg caactggacc gaaggcc 

<210> 162 

<211> 35 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<220> 

<221> n 

<222> (1)..(1) 

<223> TH 5» end has a fluorescein label. 



<220> 

<221> n 

<222> (35).. (35) 

<223> TH 3' end is modified with a dideoxynucleotide . 



<400> 162 

ncgaaattaa tacgcttgtg gagaaggagt tcatn 



<210> 163 

<211> 53 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 163 

gctatgaact ccttctccac aagcgccttc ggtccagttg ccttctccct ggg 



<210> 164 

<211> 214 

<212> PRT 

<213> Artificial 



463 



wo 01/90337 



PCT/USOl/17086 



<220> 

<223> Synthetic 
<400> 164 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
15 10 15 

Glu Trp Thr Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu 
20 25 30 

Phe Ala Asn Leu Trp Gly Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp 
35 40 45 

Leu Tyr Arg Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met 
50 55 60 

Glu Ala Thr Gly Val Arg Leu Asp Val Ala Tyr Leu Arg Ala Leu Ser 
65 70 75 80 

Leu Glu Val Ala Glu Glu lie Ala Arg Leu Glu Ala Glu Val Phe Arg 
85 90 95 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
100 105 110 

Val Leu Phe Asp Glu Leu Gly Leu Pro Ala lie Gly Lys Thr Glu Lys 
115 120 125 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
130 135 140 

Ala His Pro lie Val Glu Lys lie Leu Gin Tyr Tlrg Glu Leu Thr Lys 
145 150 155 160 

Leu Lys Ser Thr Tyr lie Asp Pro Leu Pro Asp Leu lie His Pro Arg 
165 170 175 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
180 185 190 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr 
195 200 205 

Pro Leu Gly Gin Arg lie 
210 

<210> 165 

<211> 214 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 165 
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Asp Pro Ser Asn Thr Thr Pro Glu Qly Val Ala Arg Arg Tyr Gly Gly 
15 10 15 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
20 25 30 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
35 40 45 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
50 55 60 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
65 70 75 80 

Leu Glu Leu Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg 
85 90 95 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
100 105 110 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Gly Lys Thr Gin Lys 
115 120 125 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
130 135 140 

Ala His Pro He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys 
145 150 155 160 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
165 170 175 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
180 185 190 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
195 200 205 

Pro Leu Gly Gin Arg He 
210 

<210> 166 

<211> 37 

<212> RNA 

<213> Artificial 

<220> 

<223> Synthetic 

<220> 

<221> n 

<222> (1)..(1) 

<223> TH 5' end is modified with a biotin-streptavitin complex. 
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<400> 166 

nucacggcag uuggugcgcc ucggaacgag gcgcacg 37 



<210> 167 

<211> 17 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<220> 

<221> n 

<222> {1)..{1) 

<223> TH 5» end is labeled with tetrachlororf luorescein. 

<400> 167 

nttttcaact gccgtga 17 



<210> 168 

<211> 36 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 168 

tcacggcagt tggtgcgcct cggaacgagg cgcacg 36 



<210> 169 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 169 

cgccgagatc acctttacat tttctatcgt 



<210> 170 

<211> 24 

<212> DNA 

<213> Artificial 



30 



<220> 

<223> Synthetic 
<400> 170 

ccttccttat cctggatctt ggca 24 
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<210> 171 

<211> 24 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 171 

acgatagaaa atgtaaaggt gate 



<210> 172 

<211> 15 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<220> 

<221> n 

<222> (4).. (4) 

<223> The residue at this position is a z28 linker group. 



<400> 172 
ctcnttctca gtgcg 



<210> 173 

<211> 29 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 173 

cgcagtgaga atgaggtgat ctcggcggt 



<210> 174 

<211> 31 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 174 

ccgccgagat cacggatgtt gtaatcagag a 



<210> 175 
<211> 21 
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<212> DNA 

<:213> Artificial 

<220> 

<223> Synthetic 
<400> 175 

gtgcagggtt gactccttct c 21 



<210> 176 

<211> 21 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 



<400> 176 

gtgcagggtt gactctttct c 21 



<210> 177 

<211> 21 

<212> mh 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 177 

gtgcagggtc gactctttct c 21 



<210> 178 

<211> 25 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 



<400> 178 

tctctgatta caacatccgt gatct 25 



<210> 179 

<211> 29 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 
<400> 179 

cgccgagatc acgtagttga ggtcaatga 29 
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<210> 180 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 180 

gaatcatact ggaacatgta gaccatc 



<210> 181 

<211> 24 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 181 

tcattgacct caactacgtg atct 



<210> 182 

<211> 28 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 182 

ccgccgagat cacgatgatc ttgaggct 



<210> 183 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 183 

tggtgcagga ggcattgctc 



<210> 184 

<211> 23 

<212> DNA 

<213> Artificial 

<220> 

<;223> Synthetic 
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<400> 184 

cagcctcaag attaccgtga tct 23 



<210> 185 

<211> 23 , 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 185 

ccgtcacgcc tcctccacgg etc 23 



<210> 186 

<211> 23 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 186 

aggcgaaagc cctcaatttc cca 23 



<210> 187 

<211> 15 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 187 

aaccactgcc gcaca 15 



<210> 188 

<211> 17 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 188 

gagccgtgga ggaggcg 17 



<210> 189 

<211> 14 

<212> DNA 

<213> Artificial 
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<220> 

<223> Synthetic 

<220> 

<221> n 

<222> (4) . . (4) 

<223> The' residue at this position is a z28 linker group. 



<400> 189 
cacntgcttc gtgg 



<210> 190 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 190 

ccaggaagca agtggaggcg tgacggt 



<210> 191 

<211> 26 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 191 

ccgtcacgcc tccttcggag tttggg 



<210> 192 

<211> 25 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 192 

gggttgtgga gtgagtgttc aagta 



<21Q> 193 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
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<400> 193 

gggaaactcc gaaggaggcg 20 



<210> 194 

<211> 23 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 194 

ccgtcacgcc tctctgactg cca 23 



<210> 195 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 195 

ttgtcactcg gggttcgaga agatgaa 27 



<210> 196 

<211> 11 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 196 

gggccagagg ^ 11 

<210> 197 

<211> 17 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 197 

aggcagtcag agaggcg 17 



<210> 198 

<211> 26 

<212> DNA 

<213> Artificial 
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245 250 255 

Leu Leu Gin Val Asp Phe Ala Arg Arg Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Lys Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ser Pro Val Ala Ala Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Tyr Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Asn Ala Leu Ala Ala Ala Trp Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 

355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu S^r 
450 455 460 

Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro lie Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys 530 
535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 



473 



wo 01/90337 



PCTAJSOl/17086 



545 



550 



555 



560 



Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 

740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 8X0 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly His His His His His His 
835 840 



<210> 473 
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<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
I. 

<400> 473 

aagccactcc ggggtgatca ggtaacc 



<210> 474 

<211> 27 

<212> DNA 

<213> , Artificial 

<220> 

<223> Synthetic 

<400> 474 

atcaccccgg agtggctttg ggagaag 



<210> 475 

<211> 2514 

<212i> DNA 

<213> Artificial 



<220> 

<223> Synthetic 



<400> 475 
atgaattccc 


tgcccctctt tgagcccaag 


ggccgggtgc ttctggtgga cggccaccac 


60 


ctggcctacc 


gtaccttttt tgccctgaag 


ggcctcacca ccagccgcgg ggagccggtc 


120 


caggcggtgt 


acgggtttgc caagagcctt 


ttgaaggcgc taagggaaga cggggatgtg 


180 


gtgatcgtgg 


tctttgacgc cgaggccccc 


tccttccgcc accagaccta cgaggcctac 


240 


aaggcggggc 


gggctcccac ccccgaggac 


tttccccggc agcttgccct tatcaaggag 


300 


atggtggacc 


ttttgggcct ggagcgcctc 


gaggtgccgg gctttgaagc ggatgacgtc 


360 


ctggctaccc 


tggccaagaa ggcggaaaag 


gaaggctacg aagtgcgcat cctcaccgcg 


420 


gaccgggacc 


tttaccagct tctttcggag 


cgaatctcca tccttcaccc ggagggttac 


480 


ctgatcaccc 


cggagtggct ttgggagaag 


tacggcctca ggccggagca gtgggtggac 


540 


ttccgcgccc 


tcgtggggga cccctccgac 


aacctccccg gggtcaaggg catcggggag 


600 


aagaccgccc 


tcaagctcct caaggagtgg 


ggaagcctgg aaaacctcct caagaacctg 


660 


gaccgggtaa 


agccagaaaa cgtccgggag 


aagatcaagg cccacctgga agacctcagg 


720 


ctctccttgg 


agctctcccg ggtgcgcacc 


gacctccccc tggaggtgga cctcgcccag 


780 
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gggcgggagc 


ccgaccggga 


ggggcttagg 


gccttcctgg 


agaggctgga 


gttcggcagc 


840 


ctcctccacg 


agttcggcct 


cctggaggcc 


cccgcccccc 


tggaggaggc 


cccctggccc 


900 


ccgccggaag 


gggccttcgt 


gggcttcgtc 


ctctcccgcc 


ccgagcccat 


gtgggcggag 


960 


cttaaagccc 


tggccgcctg 


caggggcggc 


cgcgtgcacc 


gggcagcaga 


ccccttggcg 




gggctaaagg 


acctcaagga 


ggtccggggc 


ctcctcgcca 


aggacctcgc 


cgtcttggcc 


1080 


tcgagggagg 


ggctagacct 


cgtgcccggg 


gacgacccca 


tgctcctcgc 


ctacctcctg 


1140 


gacccttcga 


acaccacccc 


cgagggggtg 


gcgcggcgct 


acggggggga 


gtggacggag 


1200 


gacgccgccc 


accgggccct 


cctctcggag 


aggctccatc 


ggaacctcct 


taagcgcctc 


1260 


gagggggagg 


agaagctcct 


ttggctctac 


cacgaggtgg 


aaaagcccct 


ctcccgggtc 


1320 


ctggcccata 


tggaggccac 


cggggtacgg 


cgggacgtgg 


cctaccttca 


ggccctttcc 


1380 


ctggagcttg 


cggaggagat 


ccgccgcctc 


gaggaggagg 


tcttccgctt 


ggcgggccac 


1440 


cccttcaacc 


tcaactcccg 


ggaccagctg 


gaaagggtgc 


tctttgacga 


gcttaggctt 


1500 


cccgccttga 


agaagacgaa 


gaagacaggc 


aagcgctcca 


ccagcgccgc 


ggtgctggag 


1560 


gccctacggg 


aggcccaccc 


catcgtggag 


aagatcctcc 


agcaccggga 


gctcaccaag 


1620 


ctcaagaaca 


cctacgtgga 


ccccctccca 


^gcctcgtcc 


acccgaggac 


gggccgcctc 


1680 


cacacccgct 


tcaaccagac 


ggccacggcc 


acggggaggc 


ttagtagctc 


cgaccccaac 


1740 


ctgcagaaca 


tccccgtccg 


cacccccttg 


ggccagagga 


tccgccgggc 


cttcgtggcc 


1800 


gaggcgggtt 


gggcgttggt 


ggccctggac 


tatagccaga 


tagagctccg 


cgtcctcgcc 


1860 


cacctctccg 


gggacgaaaa 


cctgatcagg 


gtcttccagg 


aggggaagga 


catccacacc 


1920 


cagaccgcaa 


gctggatgtt 


cggcgtcccc 


ccggaggccg 


tggaccccct 


gatgcgccgg 


1980 


gcggccaaga 


cggtgaactt 


cggcgtcctc 


tacggcatgt 


ccgcccatag 


gctctcccag 


2040 


gagcttgcca 


tcccctacga 


ggaggcggtg 


gcctttatag 


agcgctactt 


ccaaagcttc 


2100 


cccaaggtgc 


gggcctggat 


agaaaagacc 


ctggaggagg 


ggaggaagcg 


gggctacgtg 


2160 


gaaaccctct 


tcggaagaag 


gcgctacgtg 


cccgacctca 


acgcccgggt 


gaagagcgtc 


2220 


agggaggccg 


cggagcgcat 


ggccttcaac 


atgcccgtcc 


agggcaccgc 


cgccgacctc 


2260 


atgaagctcg 


ccatggtgaa 


gctcttcccc 


cgcctccggg 


agatgggggc 


ccgcatgctc 


2340 


ctccaggtcg 


ccaacgagct 


cctcctggag 


gccccccaag 


cgcgggccga 


ggaggtggcg 


2400 


gctttggcca 


aggaggccat 


ggagaaggcc 


tatcccctcg 


ccgtgcccct 


ggaggtggag 


2460 
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gtggggatgg gggaggactg gctttccgcc aagggtcacc accaccacca ccac 



<210> 476 

<211> 838 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 476 

Met Asn Ser Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu Val 

1 5 10 15 

Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys Gly Leu 
20 25 30 

Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala Lys 
35 40 45 

Ser Leu Leu Lys Ala Leu Arg Glu Asp Gly Asp Val Val He Val Val 
50 55 60 

Phe Asp Ala Glu Ala Pro Ser Phe Arg His Gin Thr Tyr Glu Ala Tyr 
65 70 75 80 

Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu Ala 
85 90 95 

Leu He Lys Glu Met Val Asp Leu Leu Gly Leu Glu Arg Leu Glu Val 
100 105 110 

Pro Gly Phe Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys Lys Ala 
115 120 125 

Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Arg Asp Leu 
130 135 140 

Tyr Gin Leu Leu Ser Glu Arg He Ser He Leu His Pro Glu Gly Tyr 
145 150 155 160 

Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly Leu Arg Pro Glu 
165 170 175 

Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro Ser Asp Asn Leu 
180 185 190 

Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu Lys Leu Leu Lys 
195 200 205 

Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu Asp Arg Val Lys 
210 215 220 

Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu Glu Asp Leu Arg 
225 230 235 240 



2514 
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Leu Ser Leu Qlu Leu Ser Arg Val Arg Thr Asp Leu Pro Leu Glu Val 
245 250 255 

Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly Leu Arg Ala Phe 
260 265 270 

Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu L6u 
275 280 285 

Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro Pro Pro Qlu Gly 
290 295 300 

Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro Met Trp Ala Glu 
305 310 315 320 

Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val His Arg Ala Ala 

325 330 335 , 

Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu Leu 
340 345 350 

Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu Val 
355 360 365 

Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser Asn 
370 375 380 

Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr Glu 
385 390 395 400 

Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn Leu 
405 410 415 

Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His Glu 
420 425 430 

Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr Gly 
435 440 445 

Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu Ala 
450 455 460 

Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly His 
465 470 475 480 

Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe Asp 
485 490 495 

Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys Thr Gly Lys Arg 
500 • 505 510 

Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro He 
515 520 525 



Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys Leu Lys Asn Thr 
530 535 540 
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Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arg Leu 
545 550 555 560 

His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser 
565 570 575 

Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly Gin 



580 



585 



590 



Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu Val Ala 
595 600 605 

Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser Gly 
610 615 620 

Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys Asp He His Thr 
625 630 635 640 

Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp Pro 
645 650 655 

Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu Tyr Gly 
660 665 670 

Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu Glu 
675 680 685 

Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val Arg 
690 695 700 

Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly Tyr Val 
705 710 715 720 

Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn Ala Arg 
725 730 735 

Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met Pro 
740 745 750 

Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys Leu 
755 760 765 

Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin Val Ala 
770 775 780 

Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu Val Ala 
785 790 795 800 

Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala Val Pro 
805 810 815 

Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala Lys Gly 
820 825 830 



His His His His His His 
835 
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<210> 477 

<211> 64 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 477 

caccaccacc accaccacgt cgactagtgc tagcgtcgac tagctgcagg catgcaagct 60 
tggc 64 



<210> 


478 


<211> 


64 


<212> 


DNA ' 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


478 



gccaagcttg catgcctgca gctagtcgac gctagcacta gtcgacgtgg tggtggtggt 60 
ggtg 64 



<210> 


479 


<211> 


41 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


479 



caggaagcgg ccgcgtcgac atgaccatga ttacgccaag c 41 



<210> 480 

<211> 42 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 
<400> 480 

gggcccgcca gggtcgactc agggcgatgg cccactacgt ga 42 



<210> 481 

<211> 3135 

<212> DNA 

<213> Artificial 
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<220> 

<223> Synthetic 
<400> 481 

atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 60 

ggccaccacc tggcctaccg caccttcttc gccctgaagg gcctcaccac gagccggggc 120 

gaaccggtgc aggcggtcta cggcttcgcc aagagcctcc tcaaggccct gaaggaggac 180 

gggtacaagg ccgtcttcgt ggtctttgac gccaaggccc cctccttccg ccacgaggcc 240 

tacgaggcct acaaggcggg gagggccccg acccccgagg acttcccccg gcagctcgcc 300 

ctcatcaagg agctggtgga cctcctgggg tttacccgcc tcgaggtccc cggctacgag 360 

gcggacgacg ttctcgccac cctggccaag aaggcggaaa aggaggggta cgaggtgcgc 420 

atcctcaccg ccgaccgcga cctctaccaa ctcgtctccg accgcgtcgc cgtcctccac 480 

cccgagggcc acctcatcac cccggagtgg ctttgggaga agtacggcct caggccggag 540 

cagtgggtgg acttccgcgc cctcgtgggg gacccctccg acaacctccc cggggtcaag 600 

ggcatcgggg agaagaccgc cctcaagctc ctcaaggagt ggggaagcct ggaaaacctc 660 

ctcaagaacc tggaccgggt aaagccagaa aacgtccggg agaagatcaa ggcccacctg 720 

gaagacctca ggctctcctt ggagctctcc cgggtgcgca ccgacctccc cctggaggtg 780 

gacctcgccc aggggcggga gcccgaccgg gaggggctta gggccttcct ggagaggctg 840 

gagttcggca gcctcctcca cgagttcggc ctcctggagg cccccgcccc cctggaggag 900 

gccccctggc ccccgccgga aggggccttc gtgggcttcg tcctctcccg ccccgagccc 960 

atgtgggcgg agcttaaagc cctggccgcc tgcaggggcg gccgcgtgca ccgggcagca 1020 

gaccccttgg cggggctaaa ggacctcaag gaggtccggg gcctcctcgc caaggacctc 1080 

gccgtcttgg cctcgaggga ggggctagac ctcgtgcccg gggacgaccc catgctcctc 1140 

gcctacctcc tggacccttc gaacaccacc cccgaggggg tggcgcggcg ctacgggggg 1200 

gagtggacgg aggacgccgc ccaccgggcc ctcctctcgg agaggctcca tcggaacctc 1260 

cttaagcgcc tcgaggggga ggagaagctc ctttggctct accacgaggt ggaaaagccc 1320 

ctctcccggg tcctggccca tatggaggcc accggggtac ggcgggacgt ggcctacctt 1380 

caggcccttt ccctggagct tgcggaggag atccgccgcc tcgaggagga ggtcttccgc 1440 

ttggcgggcc accccttcaa cctcaactcc cgggaccagc tggaaagggt gctctttgac 1500 

gagcttaggc ttcccgcctt gaagaagacg aagaagacag gcaagcgctc caccagcgcc 1560 
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gcggtgctgg aggccctacg ggaggcccac cccatcgtgg agaagatcct ccagcaccgg 1620 

gagctcacca agctcaagaa cacctacgtg gaccccctcc caagcctcgt ccacccgagg 1680 

acgggccgcc tccacacccg cttcaaccag acggccacgg ccacggggag gcttagtagc 1740 

tccgacccca acctgcagaa catccccgtc cgcaccccct tgggccagag gatccgccgg 1800 

gccttcgtgg ccgaggcggg ttgggcgttg gtggccctgg actatagcca gatagagctc 1860 

cgcgtcctcg cccacctctc cggggacgaa aacctgatca gggtcttcca ggaggggaag 1920 

gacatccaca cccagaccgc aagctggatg ttcggcgtcc ccccggaggc cgtggacccc 1980 

ctgatgcgcc gggcggccaa gacggtgaac ttcggcgtcc tctacggcat gtccgcccat 2040 

aggctctccc aggagcttgc catcccctac gaggaggcgg tggcctttat agagcgctac 2100 

ttccaaagct tccccaaggt gcgggcctgg atagaaaaga ccctggagga ggggaggaag 2160 

cggggctacg tggaaaccct cttcggaaga aggcgctacg tgcccgacct caacgcccgg 2220 

gtgaagagcg tcagggaggc cgcggagcgc atggccttca acatgcccgt ccagggcacc 2280 

gccgccgacc tcatgaagct cgccatggtg aagctcttcc cccgcctccg ggagatgggg 2340 

gcccgcatgc tcctccaggt cgccaacgag ctcctcctgg aggcccccca agcgcgggcc 2400 

gaggaggtgg cggctttggc caaggaggcc atggagaagg cctatcccct cgccgtgccc 2460 

ctggaggtgg aggtggggat gggggaggac tggctttccg ccaagggtca ccaccaccac 2520 

caccacgtcg acatgaccat gattacgcca agctatttag gtgacactat agaatactca 2580 

agctatgcat caagcttggt accgagctcg gatccactag taacggccgc cagtgtgctg 2640 

gaattctgca gatatccatc acactggcgg ccgctcgagc atgcatctag agggcccaat 2700 

tcgccctata gtgagtcgta ttacaattca ctggccgtcg ttttacaacg tcgtgactgg 2760 

gaaaaccctg gcgttaccca acttaatcgc cttgcagcac atcccccttt cgccagctgg 2820 

cgtaatagcg aagaggcccg caccgatcgc ccttcccaac agttgcgcag cctgaatggc 2880 

gaatgggacg cgccctgtag cggcgcatta agcgcggcgg gtgtggtggt tacgcgcagc 2940 

gtgaccgcta cacttgccag cgccctagcg cccgctcctt tcgctttctt cccttccttt 3000 

ctcgccacgt tcgccggctt tccccgtcaa gctctaaatc gggggctccc tttagggttc 3060 

cgatttagag ctttacggca cctcgaccgc aaaaaacttg atttgggtga tggttcacgt 3120 

agtgggccat cgccc 3135 

<210> 482 
<211> 1045 



482 
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<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 482 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
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260 



265 



270 



Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys 

500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 



Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
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565 



570 



575 



Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly His His His His His His Val Asp Met Thr Met He 
835 840 845 

Thr Pro Ser Tyr Leu Gly Asp Thr He Glu Tyr Ser Ser Tyr Ala Ser 
850 855 860 



Ser Leu Val Pro Ser Ser Asp Pro Leu Val Thr Ala Ala Ser Val Leu 
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865 870 875 880 

Glu Phe Cys Arg Tyr Pro Ser His Trp Arg Pro Leu Glu His Ala Ser 
885 890 895 

Arg Gly Pro Asn Ser Pro Tyr Ser Glu Ser Tyr Tyr Asn Ser Leu Ala 
900 905 910 

Val Val Leu Gin Arg Arg Asp Trp Glu Asn Pro Gly Val Thr Gin Leu 
915 920 925 

Asn Arg Leu Ala Ala His Pro Pro Phe Ala Ser Trp T^g Asn Ser Glu 
930 935 940 

Glu Ala Arg Thr Asp Arg Pro Ser Gin Gin Leu Arg Ser Leu J^n Gly 
945 950 955 960 

Glu Trp Asp Ala Pro Cys Ser Gly Ala Leu Ser Ala Ala Gly Val Val 
965 970 975 

Val Thr Arg Ser Val Thr Ala Thr Leu Ala Ser Ala Leu Ala Pro Ala 
980 985 990 

Pro Phe Ala Phe Phe Pro Ser Phe Leu Ala Thr Phe Ala Gly Phe Pro 
995 1000 1005 

Arg Gin Ala Leu Asn Arg Gly Leu Pro Leu Gly Phe Arg Phe Arg 
1010 1015 1020 

Ala Leu Arg His Leu Asp Arg Lys Lys Leu Asp Leu Gly Asp Gly 
1025 1030 1035 

Ser Arg Ser Gly^ Pro Ser Pro 
1040 " 1045 



<210> 


483 


<211> 


36 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


483 



cgggacctcg aggcgcgtga accccaggag gtccac 36 



<210> 


484 


<211> 


2517 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


484 



atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 
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caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 120 
ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 
gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 
gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 
aaggagctgg tggacctcct ggggttcacg cgcctcgagg tcccgggcta cgaggcggac 
gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 
accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 
gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 
gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 
ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 
aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 
aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 
aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 
agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 
cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 
gatcttctgg ccctggccgc cgccaggggc ggccgcgtgc accgggcagc agaccccttg 1020 
gcggggctaa aggacctcaa ggaggtccgg ggcctcctcg ccaaggacct cgccgtcttg 1080 
gcctcgaggg aggggctaga cctcgtgccc ggggacgacc ccatgctcct cgcctacctc 1140 
ctggaccctt cgaacaccac ccccgagggg gtggcgcggc gctacggggg ggagtggacg 1200 
gaggacgccg cccaccgggc cctcctctcg gagaggctcc atcggaacct ccttaagcgc 1260 
ctcgaggggg aggagaagct cctttggctc taccacgagg tggaaaagcc cctctcccgg 1320 
gtcctggccc atatggaggc caccggggta cggcgggacg tggcctacct tcaggccctt 1380 
tccctggagc ttgcggagga gatccgccgc ctcgaggagg aggtcttccg cttggcgggc 1440 
caccccttca acctcaactc ccgggaccag ctggaaaggg tgctctttga cgagcttagg 1500 
cttcccgcct tgaagaagac gaagaagaca ggcaagcgct ccaccagcgc cgcggtgctg 1560 
gaggccctac gggaggccca ccccatcgtg gagaagatcc tccagcaccg ggagctcacc 1620 
aagctcaaga acacctacgt ggaccccctc ccaagcctcg tccacccgag gacgggccgc 1680 
ctccacaccc gcttcaaoca gacggccacg gccacgggga ggcttagtag ctccgacccc 1740 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
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aacctgcaga 


acatccccgt 


ccgcaccccc 


ttgggccaga 


ggatccgccg 


ggccttcgtg 


1800 


gccgaggcgg 


gttgggcgtt 


ggtggccctg 


gactatagcc 


agatagagct 


ccgcgtcctc 


1860 


gcccacctct 


ccggggacga 


aaacctgatc 


agggtcttcc 


aggaggggaa 


ggacatccac 


1920 


acccagaccg 


caagctggat 


gttcggcgtc 


cccccggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agacggtgaa 


cttcggcgtc 


ctctacggca 


tgtccgccca 


taggctctcc 


2040 


caggagcttg 


ccatccccta 


cgaggaggcg 


gtggccttta 


tagagcgcta 


cttccaaagc 


2100 


ttccccaagg 


tgcgggcctg 


gatagaaaag 


accctggagg 


aggggaggaa 


gcggggctac 


2160 


gtggaaaccc 


tcttcggaag 


aaggcgctac 


gtgcccgacc 


tcaacgcccg 


ggtgaagagc 


2220 




c cacactAaca 

^'jj y w jj 


catacfccttc 




heGacrcfOGar* 

l« ^ Vm CiVj yy WCi^^ 




2280 


ctcatgaagc 


tcgccatggt 


gaagctcttc 


ccccgcctcc 


gggagatggg 


ggcccgcatg 


2340 


ctcctccagg 


tcgccaacga 


gctcctcctg 


gaggcccccc 


aagcgcgggc 


cgaggaggtg 


2400 


gcggctttgg 


ccaaggaggc 


catggagaag 


gcctatcccc 


tcgccgtgcc 


cctggaggtg 


2460 


gaggtgggga 


tgggggagga 


ctggctttcc 


gccaagggtc 


accaccacca 


ccaccac 


2517 



<210> 485 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 485 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 
100 105 110 
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Qlu Val Pro Gly Tyr Olu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lya Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 



130 



135 



140 



Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 



180 



185 



190 



Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 . 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 



225 



230 



235 



240 



Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 ' 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Qlu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met. Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 365 

val Pro Gly Asp Asp Pro Met. Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
405 410 415 



489 



wo 01/90337 



PCT/USOl/17086 



Leu Leu Lys Arg Leu 61u 61y Glu Glu Lys Leu Leu Trp Leu Tyr His 
420 425 430 

Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu 
450 455 460 

Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys Thr Gly Lys 

500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys Leu Lys Asn 
530 535 540 

Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg lie Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys Asp He His 
625 630 635 640 

Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 

675 680 685 

Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 



Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly Tyr 
705 710 715 720 
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Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn Ala 
725 730 735 

Arg val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu Val 
785 790 795 800 

Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Gly His His His His His His 





835 


<210> 


486 


<211> 


36 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


486 



gtggaccttc tgggctttac ccgcctcgag gccccg 



<210> 


487 


<211> 


36 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


487 



cggggcctcg aggcgggtaa agcccagaag gtccac 



<210> 488 

<211> 842 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
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<400> 488 

Met Asn Ser Thr Pro Leu Phe Asp Leu Glu Glu Pro Pro Lys Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Tyr Ala Leu 
20 25 30 

Ser Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Met Val Tyr Gly Phe 
35 40 45 

Ala Arg Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Gin Ala Val Val 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Glu 
65 70 75 80 

Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu Val Lys Arg Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 
100 105 110 

Glu Ala Pro Gly Tyr Glu Ala Asp Asp Val Leu Gly Thr Leu Ala Lys 
115 120 125 

Lys Ala Glu Arg Glu Gly Met Glu Val Arg lie Leu Thr Gly Asp Arg 
130 135 140 

Asp Phe Phe Gin Leu Leu Ser Glu Lys Val Ser Val Leu Leu Pro Asp 
145 150 155 160 

Gly Thr Leu Val Thr Pro Lys Asp Val Gin Glu Lys Tyr Gly Val Pro 
165 170 175 

Pro Glu Arg Trp Val Asp Phe Arg Ala Leu Thr Gly Asp Arg Ser Asp 
180 185 190 

Asn lie Pro Gly Val Ala Gly lie Gly Glu Lys Thr Ala Leu Arg Leu 
195 200 205 

Leu Ala Glu Trp Gly Ser Val Glu Asn Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Val Lys Pro Asp Ser Leu Arg Arg Lys lie Glu Ala His Lqu Glu Asp 

225 230 235 240 

Leu His Leu Ser Leu Asp Leu Ala Arg lie Arg Thr Asp Leu Pro Leu 
245 250 255 

Glu Val Asp Phe Lys Ala Leu Arg Arg Arg Thr Pro Asp Leu Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Glu Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Gly Gly Glu Lys Pro Arg Glu Glu Ala Pro Trp Pro 
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290 



295 



300 



Pro Pro Glu Gly Ala Phe Val Gly Phe Leu Leu Ser Arg Lys Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Leu Ala Leu Ala Ala Ala Ser Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
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595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp lie His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly His His His His His His 
835 840 

<210> 489 

<211> 24 

<212> DNA 

<213> Artificial 

<2205> 

<223> Synthetic 
<400> 489 

tttacccgcc tcgaggtgcc gggc 24 



494 



wo 01/90337 



PCT/USOl/17086 



<210> 


490 


<211> 


36 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


490 



cggcacctcg aggcgggtaa agcccaaaag gtccac 



<210> 491 

<211> 838 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 491 

Met Asn Ser Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu Val 
1 5 10 15 

Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys Gly Leu 
20 25 30 

Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala Lys 
35 40 45 

Ser Leu Leu Lys Ala Leu Arg Glu Asp Gly Asp Val Val He Val Val 
50 55 60 

Phe ASP Ala Lys Ala Pro Ser Phe Arg His Gin Thr Tyr Glu Ala Tyr 
65 70 75 80 

Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu Ala 
85 90 95 

Leu He Lys Glu Met Val Asp Leu Leu Gly Phe Thr Arg Leu Glu Val 
100 105 110 

Pro Gly Phe Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys Lys Ala 
115 120 125 

Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Arg Asp Leu 
130 135 140 

Tyr Gin Leu Leu Ser Glu Arg He Ser He Leu His Pro Glu Gly Tyr 
145 150 155 160 

Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly Leu Lys Pro Ser 
165 170 175 
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Gin Trp Val Asp Tyr Arg Ala Leu Ala Gly Asp Pro Ser Asp Asn lie 
180 185 190 

Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Ala Lys Leu lie Arg 
195 200 205 

Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys His Leu Glu Gin Val Lys 
210 215 220 

Pro Ala Ser Val Arg Glu Lys lie Leu Ser His Met Glu Asp Leu Lys 
225 230 235 240 

Leu Ser Leu Glu Leu Ser Arg Val His Thr Asp Leu Leu Leu Gin Val 
245 250 255 

Asp Phe Ala Arg Arg Arg Glu Pro Asp Arg Glu Gly Leu Lys Ala Phe 
260 265 270 

Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu Leu 
275 280 285 

Glu Ser Pro Val Ala Ala Glu Glu Ala Pro Trp Pro Pro Pro Glu Gly 
290 295 300 

Ala Phe Val Gly Tyr Val Leu Ser Arg Pro Glu Pro Met Trp Ala Glu 
305 310 315 320 

Leu Asn Ala Leu Ala Ala Ala Trp Gly Gly Arg Val His Arg- Ala Ala 
325 330 335 

Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu Leu 
340 345 350 

Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu Val 
355 360 365 

Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser Asn 
370 375 380 

Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr Glu 
385 390 395 400 

Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn Leu 
405 410 415 

Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His Glu 
420 425 430 

Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr Gly 
435 440 445 

Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu Ala 
450 455 460 



Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly His 
465 470 475 480 
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Pro Phe Aan Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe Asp 
485 490 495 

Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys Thr Gly Lys Arg 
500 505 510 

Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro lie 
515 520 525 

Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys Leu Lys Asn Thr ■ 
530 535 540 

Tyr val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arg Leu 
545 550 555 560 

His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser 
565 570 575 

Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly Gin 
580 585 590 

Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu Val Ala 
595 ^00 

Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser Gly 
610 615 620 

Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys Asp He His Thr 
625 630 635 640 

Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp Pro 
645 650 655 

Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu Tyr Gly 
660 665 670 

Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu Glu 
675 680 685 

Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val Arg 
690 695 700 

Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly Tyr Val 
705 710 715 720 

Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn Ala Arg 
725 730 735 

Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met Pro 
740 745 750 

Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys Leu 
755 , 760 765 

Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin Val Ala 
770 775 780 
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Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu Val Ala 
785 790 795 800 

Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala Val Pro 
805 810 815 

Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala Lys Gly 
820 825 830 

His His His His His His 
835 



<210> 


492 


<211> 


34 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


492 



atcgtggtct ttgacgccga ggccccctcc ttcc 34 



<210> 493 

<211> 34 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 493 

ggaaggaggg ggcctcggcg tcaaagacca cgat ' 34 



<210> 494 

<211> 842 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 494 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 



498 



wo 01/90337 



PCT/USOl/17086 



Val Phe Val Val Phe Asp Ala Glu Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Qlu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu lie Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Leu 

195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys lie Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Qlu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 



Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 
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Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 

515 520 525 

Ala His Pro lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg lie Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp lie His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 
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Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 

740 745 750 

Phe Asn Met Pro Val Gin Gly Thr. Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 

805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly His His His His His His 
835 840 

<210> 495 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 495 



atgaattcgg ggatgctgcc 


cctctttgag 


cccaagggcc 


gggtcctcct ggtggacggc 


60 


caccacctgg 


cctaccgcac 


cttccacgcc 


ctgaagggcc 


tcaccaccag ccggggggag 


120 


ccggtgcagg 


cggtctacgg 


cttcgccaag 


agcctcctca 


aggccctcaa ggaggacggg 


180 


gacgcggtga 


tcgtggtctt 


tgacgccgag 


gccccctcct 


tccgccacga ggcctacggg 


240 


gggtacaagg 


cgggccgggc 


ccccacgccg 


gaggactttc 


cccggcaact cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggctggcg 


cgcctcgagg 


tcccgggcta cgaggcggac 


360 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt. ccgcatcctc 


420 
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accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


gsggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtgc 


accgggcagc 


agaccccttg 


1020 


gcggggctaa 


aggacctcaa 


ggaggtccgg 


ggcctcctcg 


ccaaggacct 


cgccgtcttg 


1080 


gcctcgaggg 


aggggctaga 


cctcgtgccc 


ggggacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcgcggc 


gctacggggg 


ggagtggacg 


1200 


gaggacgccg 


cccaccgggc 


cctcctctcg 


gagaggctcc 


atcggaacct 


ccttaagcgc 


1260 


ctcgaggggg 


aggagaagct 


cctttggctc 


taccacgagg 


tggaaaagcc 


cctctcccgg 


1320 


gtcctggccc 


atatggaggc 


caccggggta 


cggcgggacg 


tggcctacct 


tcaggccctt 


1380 


tccctggagc 


ttgcggagga 


gatccgccgc 


ctcgaggagg 


aggtcttccg 


cttggcgggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tgctctttga 


cgagcttagg 


1500 


cttcccgcct 


tgaagaagac 


gaagaagaca 


ggcaagcgct 


ccaccagcgc 


cgcggtgctg 


1560 


gaggccctac 


g99aggccca 


ccccatcgtg 


gagaagatcc 


tccagcaccg 


ggagctcacc 


1620 


aagctcaaga 


acacctacgt 


ggaccccctc 


ccaagcctcg 


tccacccgag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggga 


ggcttagtag 


ctccgacccc 


1740 


aacctgcaga 


acatccccgt 


ccgcaccccc 


ttgggccaga 


ggatccgccg 


ggccttcgtg 


1800 


gccgaggcgg 


gttgggcgtt 


ggtggccctg 


gactatagcc 


agatagagct 


ccgcgtcctc 


1860 


- 

gcccacctcti 


ccggggacga 


aaacctgatc 


agggtcttcc 


aggaggggaa 


ggacatccac 


1920 


acccagaccg 


caagctggat 


gttcggcgtc 


cccccggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agacggtgaa 


cttcggcgtc 


ctctacggca 


tgtccgccca 


taggctctcc 


2040 


caggagcttg 


ccatccccta 


cgaggaggcg 


gtggccttta 


tagagcgcta 


cttccaaagc 


2100 


ttccccaagg 


tgcgggcctg 


gatagaaaag 


accctggagg 


aggggaggaa 


gcggggctac 


2160 
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gtggaaaccc tcttcggaag aaggcgctac gtgcccgacc tcaacgcccg ggtgaagagc 2220 

gtcagggagg ccgcggagcg catggccttc aacatgcccg tccagggcac cgccgccgac 2280 

ctcatgaagc tcgccatggt gaagctcttc ccccgcctcc gggagatggg ggcccgcatg 2340 

ctcctccagg tcgccaacga gctcctcctg gaggcccccc aagcgcgggc cgaggaggtg 2400 

gcggctttgg ccaaggaggc catggagaag gcctatcccc tcgccgtgcc cctggaggtg 2460 

gaggtgggga tgggggagga ctggctttcc gccaagggtc accaccacca ccaccac 2517 



<210> 496 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 496 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

Val Val Phe Asp Ala Glu Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 
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Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lya Gly lie Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 365 

Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 
420 425 430 

Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu 
450 455 460 



Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 
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His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys Leu Lys Asn 
530 535 540 

Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys Asp He His 
625 630 635 640 

Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 



Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 
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Ala Asn Qlu Leu Leu Leu Qlu Ala Pro Gin Ala Arg Ala Glu Glu Val 
785 790 795 800 

Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala Lys 
820 . 825 830 

Gly His His His His His His 
835 

<210> 497 

<211> 2526 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 497 



atgaattcca 


ccccactttt 


tgacctggag 


gaacccccca 


agcgggtgct 


tctggtggac 


60 


ggccaccacc 


tggcctaccg 


caccttctat 


gccctgagcc 


tcaccacctc 


ccggggggag 


120 


ccggtgcaga 


tggtctacgg 


cttcgcccgg 


agcctcctca 


aggccttgaa 


ggaggacgga 


180 


caggcggtgg 


tcgtggtctt 


tgacgccgag 


gccccctcct 


tccgccacga 


ggcctacgag 


240 - 


gcctacaagg 


cgggccgggc 


ccccaccccg 


gaggacttcc 


cccgccagct 


cgccttggtc 


300 


aagcggctgg 


tggaccttct 


gggcctggtc 


cgcctcgagg 


ccccggggta 


cgaggcggac 


360 


gacgtcctgg 


gcaccctggc 


caagaaggcc 


gaaagggagg 


ggatggaggt 


gcgcatcctc 


420 


acgggagacc 


gggacttctt 


ccagctcctc 


tccgagaagg 


tctcggtcct 


cctgccggac 


480 


gggaccctgg 


tcaccccaaa 


ggacgtccag 


gagaagtacg 


gggtgccccc 


ggagcgctgg 


540 


gtggacttcc 


gcgccctcac 


gsrgggaccgc 


tcggacaaca 


tccccggggt 


ggcggggata 


600 


ggggagaaga 


ccgcccttcg 


actcctcgca 


gagtggggga 


gcgtggaaaa 


cctcctgaag 


660 


aacctggacc 


gggtaaagcc 


ggactcgctc 


cggcgcaaga 


tagaggcgca 


cctcgaggac 


720 


ctccacctct 


ccttagacct 


ggcccgcatc 


cgcaccgacc 


tccccctgga 


ggtggacttt 


780 


aaggccctgc 


gccgcaggac 


ccccgacctg 


gagggcctga 


gggccttttt 


ggaggagctg 


840 


gagttcggaa 


gcctcctcca 


cgagttcggc 


ctcctgggag 


gggagaagcc 


ccgggaggag 


900 


gccccctggc 


ccccgcccga 


aggggccttc 


gtgggcttcc 


tcctttcccg 


caaggagccc 


960 


atgtgggcgg 


agcttctggc 


cctggcggcg 


gcctcgggcg 


gccgcgtgca 


ccgggcagca 


1020 


gaccccttgg 


cggggctaaa 


ggacctcaag 


gaggtccggg 


gcctcctcgc 


caaggacctc 


1080 
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gccgtcttgg cctcgaggga ggggctagac ctcgtgcccg gggacgaccc catgctcctc 1140 

gcctacctcc tggacccttc gaacaccacc cccgaggggg tggcgcggcg ctacgggggg 1200 

gagtggacgg aggacgccgc ccaccgggcc ctcctctcgg agaggctcca tcggaacctc 1260 

cttaagcgcc tcgaggggga ggagaagctc ctttggctct accacgaggt ggaaaagccc 1320 

ctctcccggg tcctggccca tatggaggcc accggggtac ggcgggacgt ggcctacctt 1380 

caggcccttt ccctggagct tgcggaggag atccgccgcc tcgaggagga ggtcttccgc 1440 

ttggcgggcc accccttcaa cctcaactcc cgggaccagc tggaaagggt gctctttgac 1500 

gagcttaggc ttcccgcctt gaagaagacg aagaagacag gcaagcgctc caccagcgcc 1560 

gcggtgctgg aggccctacg ggaggcccac cccatcgtgg agaagatcct ccagcaccgg 1620 

gagctcacca agctcaagaa cacctacgtg gaccccctcc caagcctcgt ccacccgagg 1680 

acgggccgcc tccacacccg cttcaaccag acggccacgg ccacggggag gcttagtagc 1740 

tccgacccca acctgcagaa catccccgtc cgcaccccct tgggccagag gatccgccgg 1800 

gccttcgtgg ccgaggcggg ttgggcgttg gtggccctgg actatagcca gatagagctc 1860 

cgcgtcctcg cccacctctc cggggacgaa aacctgatca gggtcttcca ggaggggaag 1920 

gacatccaca cccagaccgc aagctggatg ttcggcgtcc ccccggaggc cgtggacccc 1980 

ctgatgcgcc gggcggccaa gacggtgaac ttcggcgtcc tctacggcat gtccgcccat 2040 

aggctctccc aggagcttgc catcccctac gaggaggcgg tggcctttat agagcgctac 2100 

ttccaaagct tccccaaggt gcgggcctgg atagaaaaga ccctggagga ggggaggaag 2160 

cggggctacg tggaaaccct cttcggaaga aggcgctacg tgcccgacct caacgcccgg 2220 

gtgaagagcg tcagggaggc cgcggagcgc atggccttca acatgcccgt ccagggcacc 2280 

gccgccgacc tcatgaagct cgccatggtg aagctcttcc cccgcctccg ggagatgggg 2340 

gcccgcatgc tcctccaggt cgccaacgag ctcctcctgg aggcccccca agcgcgggcc 2400 

gaggaggtgg cggctttggc caaggaggcc atggagaagg cctatcccct cgccgtgccc 2460 

ctggaggtgg aggtggggat gggggaggac tggctttccg ccaagggtca ccaccaccac 2520 

caccac 2526 

<210> 498 
<211> 842 
<212> PRT 
<213> Artificial 

<220> 
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<223> Synthetic 
<400> 498 

Met Asn Ser Thr Pro Leu Phe Asp Leu Glu Glu Pro Pro Lys Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Tyr Ala Leu 

20 25 30 

Ser Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Met Val Tyr Gly Phe 
35 40 45 

Ala Arg Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Gin Ala Val Val 
50 55 60 

Val Val Phe Asp Ala Glu Ala Pro Ser Phe Arg His Glu Ala Tyr Glu 
65 70 75 80 

Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu Val Lys Arg Leu Val Asp Leu Leu Gly Leu Val Arg Leu 
100 105 110 

Glu Ala Pro Gly Tyr Glu Ala Asp Asp Val Leu Gly Thr Leu Ala Lys 
115 120 125 

Lys Ala Glu Arg Glu Gly Met Glu Val Arg lie Leu Thr Gly Asp Arg 

130 135 140 

Asp Phe Phe Gin Leu Leu Ser Glu Lys Val Ser Val Leu Leu Pro Asp 
145 150 155 160 

Gly Thr Leu Val Thr Pro Lys Asp Val Gin Glu Lys Tyr Gly Val Pro 
165 170 175 

Pro Glu Arg Trp Val Asp Phe Arg Ala Leu Thr Gly Asp Arg Ser Asp 
180 185 190 

Asn lie Pro Gly Val Ala Gly lie Gly Glu Lys Thr Ala Leu Arg Leu 
195 200 205 

Leu Ala Glu Trp Gly Ser Val Glu Asn Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Val Lys Pro Asp Ser Leu Arg Arg Lys lie Glu Ala His Leu Glu Asp 
225 230 235 240 

Leu His Leu Ser Leu Asp Leu Ala Arg lie Arg Thr Asp Leu Pro Leu 

245 250 255 

Glu Val Asp Phe Lys Ala Leu Arg Arg Arg Thr Pro Asp Leu Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Glu Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 
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Phe Gly Leu Leu Gly Gly Glu Lys Pro Arg Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Leu Leu Ser Arg Lys Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Leu Ala Leu Ala Ala Ala Ser Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 ' 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 



Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
580 585 590 
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Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Net Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 

740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg, Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Txp Leu 
820 825 830 

Ser Ala Lys Gly His His His His His His 

840 





835 


<210> 


499 


<211> 


2514 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 
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atgaattccc 


tgcccctctt 


tgagcccaag 


ggccgggtgc 


ttctggtgga 


cggccaccac 


60 


ctggcctacc 


gtaccttttt 


tgccctgaag 


ggcctcacca 


ccagccgcgg 


ggagccggtc 


120 


caggcggtgt 


acgggtttgc 


caagagcctt 


ttgaaggcgc 


taagggaaga 


cggggatgtg 


180 


gtgatcgtgg 


tctttgacgc 


cgaggccccc 


tccttccgcc 


accagaccta 


cgaggcctac 


240 


aaggcggggc 


gggctcccac 


ccccgaggac 


tttccccggc 


agcttgccct 


tatcaaggag 


300 


atggtggacc 


ttttgggcct 


ggagcgcctc 


gaggtgccgg 


gctttgaagc 


ggatgacgtc 


360 


ctggctaccc 


tggccaagaa 


ggcggaaaag 


gaaggctacg 


aagtgcgcat 


cctcaccgcg 


420 


gaccgggacc 


tttaccagct 


tctttcggag 


cgaatctcca 


tccttcaccc 


ggagggttac 


480 


ctgatcaccc 


cggagtggct 


tfcgggagaag 


tatgggctta 


agccttccca 


gtgggtggac 


540 


taccgggcct 


tggccgggga 


cccttccgac 


aacatccccg 


gcgtgaaggg 


catcggggag 


600 


aagacggcgg 


ccaagctgat 


ccgggagtgg 


ggaagcctgg 


aaaaccttct 


taagcacctg 


660 


gaacaggtga 


aacctgcctc 


cgtgcgggag 


aagatcctta 


gccacatgga 


ggacctcaag 


720 


ctatccctgg 


agctatcccg 


ggtgcacacg 


gacttgctcc 


ttcaggtgga 


cttcgcccgg 


780 


cgccgggagc 


cggaccggga 


ggggcttaag 


gcctttttgg 


agaggctgga 


gttcggaagc 


840 


ctcctccacg 


agttcggcct 


gttggaaagc 


ccggtggcgg 


cggaggaagc 


tccctggccg 


900 


ccccccgagg 


gagccttcgt 


ggggtacgtt 


ctttcccgcc 


ccgagcccat 


gtgggcggag 


960 


cttaacgcct 


tggccgccgc 


ctggggcggc 


cgcgtgcacc 


gggcagcaga 


ccccttggcg 


1020 


gggctaaagg 


acctcaagga 


ggtccggggc 


ctcctcgcca 


aggacctcgc 


cgtcttggcc 


1080 


tcgagggagg 


ggctagacct 


cgtgcccggg 


gacgacccca 


tgctcctcgc 


ctacctcctg 


1140 


gacccttcga 


acaccacccc 


cgagggggtg 


gcgcggcgct 


acggggggga 


gtggacggag 


1200 


gacgccgccc 


accgggccct 


cctctcggag 


aggctccatc 


ggaacctcct 


taagcgcctc 


1260 


gagggggagg 


agaagctcct 


ttggctctac 


cacgaggtgg 


aaaagcccct 


ctcccgggtc 


1320 


ctggcccata 


tggaggccac 


cggggtacgg 


cgggacgtgg 


cctaccttca 


ggccctttcc 


1380 


ctggagcttg 


cggaggagat 


ccgccgcctc 


gaggaggagg 


tcttccgctt 


ggcgggccac 


1440 


cccttcaacc 


tcaactcccg 


ggaccagctg 


gaaagggtgc 


tctttgacga 


gcttaggctt 


1500 


cccgccttga 


agaagacgaa 


gaagacaggc 


aagcgctcca 


ccagcgccgc 


ggtgctggag 


1560 


gccctacggg 


aggcccaccc 


catcgtggag 


aagatcctcc 


agcaccggga 


gctcaccaag 


1620 


ctcaagaaca 


cctacgtgga 


ccccctccca 


agcctcgtcc 


acccgaggac 


gggccgcctc 


1680 
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cacacccgct 


tcaaccagac 


ggccacggcc 


acggggaggc 


ttagtagctc 


cgaccccaac 


1740 


ctgcagaaca 


tccccgtccg 


cacccccttg 


ggccagagga 


tccgccgggc 


cttcgtggcc 


1800 


gaggcgggtt 


gggcgttggt 


ggccctggac 


tatagccaga 


tagagctccg 


cgtcctcgcc 


1860 


cacctctccg 


gggacgaaaa 


cctgatcagg 


gtcttccagg 


aggggaagga 


catccacacc 


1920 


cagaccgcaa 


gctggatgtt 


cggcgtcccc 


ccggaggccg 


tggaccccct 


gatgcgccgg 


1980 


gcggccaaga 


cggtgaactt 


cggcgtcctc 


tacggcatgt 


ccgcccatag 


gctctcccag 


2040 


gagcttgcca 


tcccctacga 


ggaggcggtg 


gcctttatag 


agcgctactt 


ccaaagcttc 


2100 


cccaaggtgc 


gggcctggat 


agaaaagacc 


ctggaggagg 


ggaggaagcg 


gggctacgtg 


2160 


gaaaccctct 


tcggaagaag 


gcgctacgtg 


cccgacctca 


acgcccgggt 


gaagagcgtc 


2220 




y ciy i^y L» ci u 


y y u I. waavJ 


auycccy ucc 


agyyCaccgc 


cgccgacccc 




atgaagctcg 


ccatggtgaa 


gctcttcccc 


cgcctccggg 


agatgggggc 


ccgcatgctc 


2340 


ctccaggtcg 


ccaacgagct 


cctcctggag 


gccccccaag 


cgcgggccga 


ggaggtggcg 


2400 


gctttggcca 


aggaggccat 


ggagaaggcc 


tatcccctcg 


ccgtgcccct 


ggaggtggag 


2460 


gtggggatgg 


gggaggactg 


gctttccgcc 


aagggtcacc 


accaccacca 


ccac 


2514 



<210> 500 

<211> 838 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 500 

Met Asn Ser Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu Val 
1 5 10 15 

Asp Gly His His Leu Ala Tyr Arg Thr Phe' Phe Ala Leu Lys Gly Leu 
20 25 30 

Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala Lys 
35 40 45 

Ser Leu Leu Lys Ala Leu Arg Glu Asp Gly Asp Val Val lie Val Val 
50 55 60 

Phe Asp Ala Glu Ala Pro Ser Phe Arg His Gin Thr Tyr Glu Ala Tyr 
65 70 75 80 

Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu Ala 
85 90 95 
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Leu He Lys Glu Met Val Asp Leu Leu Gly Leu Glu Arg Leu Glu Val 
100 105 110 

Pro Gly Phe Olu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys Lys Ala 
115 120 125 

Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Arg Asp Leu 
130 135 140 

Tyr Gin Leu Leu Ser Glu Arg He Ser He Leu His Pro Glu Gly Tyr 
145 150 155 160 

Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly Leu Lys Pro Ser 
165 170 175 

Gin Trp Val Asp Tyr Arg Ala Leu Ala Gly Asp Pro Ser Asp Asn He 
180 185 190 

Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Ala Lys Leu He Arg 
195 200 205 

Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys His Leu Glu Gin Val Lys 
210 215 220 

Pro Ala Ser Val Arg Glu Lys He Leu Ser His Met Glu Asp Leu Lys 
225 230 235 240 

Leu Ser Leu Glu Leu Ser Arg Val His Thr Asp Leu Leu Leu Gin Val 
245 250 255 

Asp Phe Ala Arg Arg Arg Glu Pro Asp Arg Glu Gly Leu Lys Ala Phe 
260 265 270 

Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu Leu 
275 280 285 

Glu Ser Pro Val Ala Ala Glu Glu Ala Pro Trp Pro Pro Pro Glu Gly 
290 295 300 

Ala Phe Val Gly Tyr Val Leu Ser Arg Pro Glu Pro Met Trp Ala Glu 
305 310 315 320 

Leu Asn Ala Leu Ala Ala Ala Trp Gly Gly Arg Val His Arg Ala Ala 
325 330 335 

Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu Leu 
340 345 350 

Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu Val 
355 360 365 

Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser Asn 
370 375 380 



Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr Glu 
385 390 395 400 
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Asp Ala Ala His Arg Ala Leu Leu Ser Qlu Arg Leu His Arg Asn Leu 
405 410 415 

Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His Glu 
420 425 430 

Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr Gly 
435 440 445 

Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu Ala 
450 455 460 

Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly His 
465 470 475 480 

Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe Asp 
485 490 495 

Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys Thr Gly Lys Arg 
500 505 510 

Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro lie 
515 520 525 

Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys Leu Lys Asn Thr 
530. 535 540 

Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arg Leu 
545 550 555 560 

His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser 
565 570 575 

Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly Gin 
580 585 590 

Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu Val Ala 
595 600 605 

Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser Gly 
610 615 620 

Asp Glu Asn Leu He Arg val Phe Gin Glu Gly Lys Asp He His Thr 

625 630 635 640 

Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp Pro 
645 650 655 

Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu Tyr Gly 
660 665 670 

Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu Glu 
675 680 685 



Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val Arg 
690 695 700 



514 



wo 01/90337 



PCTAJSO 1/17086 



Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly Tyr Val 
705 710 715 720 

Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn Ala Arg 
725 730 735 

Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met Pro 
740 745 750 

Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys Leu 
755 760 765 

Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu .Gin Val Ala 
770 775 780 

Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu Val Ala 
785 790 795 800 

Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala Val Pro 
805 810 815 

Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala Lys Gly 
820 825 830 

His His His His His His 
835 

<210> 501 

<211> 2517 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 



<400> 501 
atgaattcgg 


ggatgctgcc 


cctctttgag 


cccaagggcc 


gggtcctcct ggtggacggc 


60 


caccacctgg 


cctaccgcac 


cttccacgcc 


ctgaagggcc 


tcaccaccag ccggggggag 


120 


ccggtgcagg 


cggtctacgg 


cttcgccaag 


agcctcctca 


aggccctcaa ggaggacggg 


180 


gacgcggtga 


tcgtggtctt 


tgacgccgag 


gccccctcct 


tccgccacga ggcctacggg 


240 


gggtacaagg 


cgggccgggc 


ccccacgccg 


gaggactttc 


cccggcaact cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggttcacg 


cgcctcgagg 


tcccgggcta cgaggcggac 


360 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc cgaccagtgg 


540 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt caagggcatc 


600 


ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc cctcctcaag 


660 
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aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtgc 


accgggcagc 


agaccccttg 


1020 


gcggggctaa 


aggacctcaa 


ggaggtccgg 


ggcctcctcg 


ccaaggacct 


cgccgtcttg 


1080 


gcc tcgaggg 


agsrSfSfCtaga 


cctcgtgccc 


ggggacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcgcggc 


gctacggggg 


ggagtggacg 


1200 


gaggacgccg 


cccaccgggc 


cctcctctcg 


gagaggctcc 


atcggaacct 


ccttaagcgc 


1260 


ctcgaggggg 


aggagaagct 


cctttggctc 


taccacgagg 


tggaaaagcc 


cctctcccgg 


1320 


gtcctggccc 


atatggaggc 


caccggggta 


cggcgggacg 


tggcctacct 


tcaggccctt 


1380 


tccctggagc 


ttgcggagga 


gatccgccgc 


ctcgaggagg 


aggtcttccg 


cttggcgggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tgctctttga 


cgagcttagg 


1500 


cttcccgcct 


tgaagaagac 


gaagaagaca 


ggcaagcgct 


ccaccagcgc 


cgcggtgctg 


1560 


gaggccctac 


gggaggccca 


ccccatcgtg 


gagaagatcc 


tccagcaccg 


ggagctcacc 


1620 


aagctcaaga 


acacctacgt 


ggaccccctc 


ccaagcctcg 


tccacccgag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggga 


ggcttagtag 


ctccgacccc 


1740 


aacctgcaga 


acatccccgt 


ccgcaccccc 


ttgggccaga 


ggatccgccg 


ggccttcgtg 


1800 


gccgaggcgg 


gttgggcgtt 


ggtggccctg 


gactatagcc 


agatagagct 


ccgcgtcctc 


1860 


gcccacctct 


ccggggacga 


aaacctgatc 


agggtcttcc 


aggaggggaa 


ggacatccac 


1920 


acccagaccg 


caagctggat 


gttcggcgtc 


cccccggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agacggtgaa 


cttcggcgtc 


ctctacggca 


tgtccgccca 


taggctctcc 


2040 


caggagcttg 


ccatccccta 


cgaggaggcg 


gtggccttta 


tagagcgcta 


cttccaaagc 


2100 


ttccccaagg 


tgcgggcctg 


gatagaaaag 


accctggagg 


aggggaggaa 


gcggggctac 


2160 


gtggaaaccc 


tcttcggaag 


aaggcgctac 


gtgcccgacc 


tcaacgcccg 


ggtgaagagc 


2220 


gtcagggagg 


ccgcggagcg 


catggccttc 


aacatgcccg 


tccagggcac 


cgccgccgac 


2280 


ctcatgaagc 


tcgccatggt 


gaagctcttc 


ccccgcctcc 


gggagatggg 


ggcccgcatg 


2340 
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ctcctccagg tcgccaacga gctcctcctg gaggcccccc aagcgcgggc cgaggaggtg 2400 
gcggctttgg ccaaggaggc catggagaag gcctatcccc tcgccgtgcc cctggaggtg 2460 
gaggtgggga tgggggagga ctggctttcc gccaagggtc accaccacca ccaccac 2517 



<210> 


502 


<211> 


839 


<212> 


PRT 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


502 


Met Asn Ser Gly Met 


1 


5 



10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 

35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

Val Val Phe Asp Ala Glu Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 . 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
.115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
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210 



215 



220 



Leu Lys Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 

290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 365 

Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg hrg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
405 ; 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 
420 425 430 

Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu 
450 455 460 

Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys Thr Gly Lys 
500 505 510 



Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
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515 



520 



525 



He Val Glu hya lie Leu Gin His Arg Glu Leu Thr Lys Leu Lys Asn 
530 535 540 

Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 



Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys Asp He His 
625 630 635 640 

Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 

690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu Val 
785 790 795 800 

Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala Lys 



565 



570 



575 



Ser Ser Asp Pro Asn Leu 
580 



Gin Asn He Pro Val Arg Thr Pro Leu Gly 
585 590 
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820 825 830 

Gly His His His His His His 
835 



<210> 


503 


<211> 


34 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 
1 


Synthetic 


<400> 


503 



atcgtggtct ttgacgccga ggccccctcc ttcc 34 



<210> 504 

<211> 34 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 504 

ggaaggaggg ggcctcggcg tcaaagacca cgat 34 



<210> 505 

<211> 2520 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 505 



atgaattcgg 


aggcgatgct gcccctcttt 


gagcccaagg 


gccgggtcct 


cctggtggac 


60 


ggccaccacc 


tggcctaccg caccttccac 


gccctgaagg 


gcctcaccac 


cagccggggg 


120 


gagccggtgc 


aggcggtcta cggcttcgcc 


aagagcctcc 


tcaaggccct 


caaggaggac 


180 


ggggacgcgg 


tgatcgtggt ctttgacgcc gaggccccct 


ccttccgcca 


cgaggcctac 


240 


ggggggtaca 


aggcgggccg ggcccccacg 


ccggaggact 


ttccccggca 


actcgccctc 


300 


atcaaggagc 


tggtggacct cctggggttc 


acgcgcctcg 


aggtcccggg 


ctacgaggcg 


360 


gacgacgtcc 


tggccagcct ggccaagaag 


gcggaaaagg 


agggctacga 


ggtccgcatc 


420 


ctcaccgccg 


acaaagacct ttaccagctc 


ctttccgacc 


gcatccacgt 


cctccacccc 


480 


gaggggtacc 


tcatcacccc ggcctggctt 


tgggaaaagt 


acggcctgag gcccgaccag 


540 


tgggccgact 


accgggccct gaccggggac gagtccgaca 


accttcccgg ggtcaagggc 


600 
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atcggggaga agacggcgag gaagcttctg gaggagtggg ggagcctgga agccctcctc 660 

aagaacctgg accggctgaa gcccgccatc cgggagaaga tcctggccca catggacgat 720 

ctgaagctct cctgggacct ggccaaggtg cgcaccgacc tgcccctgga ggtggacttc 780 

gccaaaaggc gggagcccga ccgggagagg cttagggcct ttctggagag gcttgagttt 840 

ggcagcctcc tccacgagtt cggccttctg gaaagcccca aggccctgga ggaggccccc 900 

tggcccccgc cggaaggggc cttcgtgggc tttgtgcttt cccgcaagga gcccatgtgg 960 

gccgatcttc tggccctggc cgccgccagg ggcggccgcg tgcaccgggc agcagacccc 1020 

ttggcggggc taaaggacct caaggaggtc cggggcctcc tcgccaagga cctcgccgtc 1080 

ttggcctcga gggaggggct agacctcgtg cccggggacg accccatgct cctcgcctac 1140 

ctcctggacc cttcgaacac cacccccgag ggggtggcgc ggcgctacgg gggggagtgg 1200 

acggaggacg ccgcccaccg ggccctcctc tcggagaggc tccatcggaa cctccttaag 1260 

cgcctcgagg gggaggagaa gctcctttgg ctctaccacg aggtggaaaa gcccctctcc 1320 

cgggtcctgg cccatatgga ggccaccggg gtacggcggg acgtggccta ccttcaggcc 1380 

ctttccctgg agcttgcgga ggagatccgc cgcctcgagg aggaggtctt ccgcttggcg 1440 

. ggccacccct tcaacctcaa ctcccgggac cagctggaaa gggtgctctt tgacgagctt 1500 

aggcttcccg ccttgaagaa gacgaagaag acaggcaagc gctccaccag cgccgcggtg 1560 

ctggaggccc tacgggaggc ccaccccatc gtggagaaga tcctccagca ccgggagctc 1620 

accaagctca agaacaccta cgtggacccc ctcccaagcc tcgtccaccc gaggacgggc 1680 

cgcctccaca cccgcttcaa ccagacggcc acggccacgg ggaggcttag tagctccgac 1740 

cccaacctgc agaacatccc cgtccgcacc cccttgggcc agaggatccg ccgggccttc 1800 

gtggccgagg cgggttgggc gttggtggcc ctggactata gccagataga gctccgcgtc 1860 

ctcgcccacc tctccgggga cgaaaacctg atcagggtct tccaggaggg gaaggacatc 1920 

cacacccaga ccgcaagctg gatgttcggc gtccccccgg aggccgtgga ccccctgatg 1980 

cgccgggcgg ccaagacggt gaacttcggc gtcctctacg gcatgtccgc ccataggctc 2040 

tcccaggagc ttgccatccc ctacgaggag gcggtggcct ttatagagcg ctacttccaa 2100 

agcttcccca aggtgcgggc ctggatagaa aagaccctgg aggaggggag gaagcggggc 2160 

tacgtggaaa ccctcttcgg aagaaggcgc tacgtgcccg acctcaacgc ccgggtgaag 2220 

agcgtcaggg aggccgcgga gcgcatggcc ttcaacatgc ccgtccaggg caccgccgcc 2280 
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gacctcatga agctcgccat ggtgaagctc ttcccccgcc tccgggagat gggggcccgc 2340 

atgctcctcc aggtcgccaa cgagctcctc ctggaggccc cccaagcgcg ggccgaggag 2400 

gtggcggctt tggccaagga ggccatggag aaggcctatc ccctcgccgt gcccctggag 2460 

gtggaggtgg ggatggggga ggactggctt tccgccaagg gtcaccacca ccaccaccac 2520 



<210> 506 

<211> 840 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 506 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val 

50 -55 60 

lie Val Val Phe Asp Ala Glu Ala Pro Ser Phe Arg His Glu Ala Tyr 
65 70 75 80 

Gly Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg 
85 90 95 

Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg 
100 105 110 

Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala 
115 120 125 

Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp 
130 135 140 

Lys Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro 
145 150 155 160 

Glu Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu 
165 170 175 

Arg Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser 
180 185 190 

Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys 
195 200 205 



522 



wo 01/90337 



PCT/USO 1/17086 



Leu Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp 
210 215 220 

Arg Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp 
225 230 235 240 

Leu Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu 
245 250 255 

Glu Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg 
260 265 270 

Ala Phe Leu Glu Arg Leu Glu phe Gly Ser Leu Leu His Glu Phe Gly 
275 280 285 

Leu Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro 
290 295 300 

Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp 
305 310 315 320 

Ala Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg 
325 330 335 

Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly 
340 345 350 

Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp 
355 360 365 

Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro 
370 375 380 

Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp 
385 390 395 400 

Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg 
405 410 415 

Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr 
420 425 430 

His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala 
435 440 445 

Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu 
450 455 460 

Leu Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala 
465 470 475 480 

Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu 
485 490 . 495 

Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys Thr Gly 
500 505 510 
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Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His 
515 520 525 

Pro lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys Leu Lys 
530 535 540 

Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly 
545 550 555 560 

Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu 
565 570 575 

Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu 
580 585 590 

Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu 
595 600 605 

Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu 
610 615 620 

Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys Asp He 
625 630 635 640 

His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val 
645 650 655 

Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu 

660 665 670 

Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr 
675 680 685 

Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys 
690 695 700 

Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly 
705 710 715 720 

Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn 
725 730 735 

Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn 
740 745 750 

Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val 
755 760 765 

Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin 

770 775 780 

Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu 
785 790 795 800 



Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala 
805 810 815 
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Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala 
820 825 830 

Lys Gly His His His His His His 

840 





835 


<210> 


507 


<211> 


36 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


507 



cagaccatga attcggaggc gatgctgccc ctcttt 



<210> 508 

<211> 36 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 508 

aaagaggggc agcatcgcct ccgaattcat ggtctg 



<210> 509 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 509 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 
caccacctgg cctaccgcae cttctttgcc ctgaagggcc tcaccaccag ccggggggag 
ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 
gacgcggtga tcgtggtctt tgacgccgag gccccctcct tccgccacga ggcctacggg 
gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 
aaggagctgg tggacctcct ggggttcacg cgcctcgagg tcccgggcta cgaggcggac 
gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 
accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 
gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 



36 



36 



60 
120 
180 
240 
300 
360 
420 
480 
540 
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gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtgc 


accgggcagc 


agaccccttg 


1020 


gcggggctaa 


aggacctcaa 


ggaggtccgg 


ggcctcctcg 


ccaaggacct 


cgccgtcttg 


1080 


gcctcgaggg 


aggggctaga 


cctcgtgccc 


ggggacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcgcggc 


gctacggggg 


ggagtggacg 


1200 


gaggacgccg 


cccaccgggc 


cctcctctcg 


gagaggctcc 


atcggaacct 


ccttaagcgc 


1260 


ctcgaggggg 


aggagaagct 


cctttggctc 


taccacgagg 


tggaaaagcc 


cctctcccgg 


1320 


gtcctggccc 


atatggaggc 


caccggggta 


cggcgggacg 


tggcctacct 


tcaggccctt 


1380 


tccctggagc 


ttgcggagga 


gatccgccgc 


ctcgaggagg 


aggtcttccg 


cttggcgggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tgctctttga 


cgagcttagg 


1500 


cttcccgcct 


tgaagaagac 


gaagaagaca 


ggcaagcgct 


ccaccagcgc 


cgcggtgctg 


1560 


gaggccctac 


gggaggccca 


ccccatcgtg 


gagaagatcc 


tccagcaccg 


ggagctcacc 


1620 


aagctcaaga 


acacctacgt 


ggaccccctc 


ccaagcctcg 


tccacccgag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggga 


ggcttagtag 


ctccgacccc 


1740 


aacctgcaga 


acatccccgt 


ccgcaccccc 


ttgggccaga 


ggatccgccg 


ggccttcgtg 


1800 


gccgaggcgg 


gttgggcgtt 


ggtggccctg 


gactatagcc 


agatagagct 


ccgcgtcctc 


1860 


gcccacctct 


ccggggacga 


aaacctgatc 


agggtcttcc 


aggaggggaa 


ggacatccac 


1920 


acccagaccg 


caagctggat 


gttcggcgtc 


cccccggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agacggtgaa 


cttcggcgtc 


ctctacggca 


tgtccgccca 


taggctctcc 


2040 


caggagcttg 


ccatccccta 


cgaggaggcg 


gtggccttta 


tagagcgcta 


cttccaaagc 


2100 


ttccccaagg 


tgcgggcctg 


gatagaaaag 


accctggagg 


aggggaggaa 


gcggggctac 


2160 


gtggaaaccc 


tcttcggaag 


aaggcgctac 


gtgcccgacc 


tcaacgcccg 


ggtgaagagc 


2220 
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gtcagggagg ccgcggagcg catggccttc aacatgcccg tccagggcac cgccgccgac 2280 

ctcatgaagc tcgccatggt gaagctcttc ccccgcctcc gggagatggg ggcccgcatg 2340 

ctcctccagg tcgccaacga gctcctcctg gaggcccccc aagcgcgggc cgaggaggtg 2400 

gcggctttgg ccaaggaggc catggagaag gcctatcccc tcgccgtgcc cctggaggtg 2460 

gaggtgggga tgggggagga ctggctttcc gccaagggtc accaccacca ccaccac 2517 



<210> 510 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 510 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lye Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Glu Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 
100 , 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 
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Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 365 

Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 

420 425 430 

Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu 
450 455 460 

Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 



His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 
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Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys Leu Lys Asn 
530 535 540 

Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Qln He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys Asp He His 
625 630 635 640 

Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 ' 765 

Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

Ala Asn Glu Leu Leu Leu Glu Ala Pro Qln Ala Arg Ala Glu Glu Val 
785 790 795 600 
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Ala Ala Leu 



Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala Val 
805 810 815 



Pro Leu Glu 



Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 



Gly His His His His His His 
835 



<210> 


511 


<211> 


33 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


511 



gcctaccgca ccttctttgc cctgaagggc etc 33 



<210> 512 

<211> 33 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 512 

gaggcccttc agggcaaaga aggtgcggta ggc 33 



<210> 513 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 513 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 

caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 120 

.ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcag agaggacggg 180 

gacgcggtga tcgtggtctt tgacgccgag gccccctcct tccgccacga ggcctacggg 240 

gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 300 

aaggagctgg tggacctcct ggggttcacg cgcctcgagg tcccgggcta cgaggcggac 3 60 

gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 420 

accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 480 
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600 
660 

720 



gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 540 
gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 
ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 
aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 

aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 780 

aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 840 

agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 900 

cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 960 

gatcttctgg ccctggccgc cgccaggggc ggccgcgtgc accgggcagc agaccccttg 1020 

gcggggctaa aggacctcaa ggaggtccgg ggcctcctcg ccaaggacct cgccgtcttg 1080 

gcctcgaggg aggggctaga cctcgtgccc ggggacgacc ccatgctcct cgcctacctc 1140 

ctggaccctt cgaacaccac ccccgagggg gtggcgcggc gctacggggg ggagtggacg 1200 

gaggacgccg cccaccgggc cctcctctcg gagaggctcc atcggaacct ccttaagcgc 1260 

ctcgaggggg aggagaagct cctttggctc taccacgagg tggaaaagcc cctctcccgg 1320 

gtcctggccc atatggaggc caccggggta cggcgggacg tggcctacct tcaggccctt 1380 

tccctggagc ttgcggagga gatccgccgc ctcgaggagg aggtcttccg cttggcgggc 1440 

caccccttca acctcaactc ccgggaccag ctggaaaggg tgctctttga cgagcttagg 1500 

cttcccgcct tgaagaagac gaagaagaca ggcaagcgct ccaccagcgc cgcggtgctg 1560 

gaggccctac gggaggccca ccccatcgtg gagaagatcc tccagcaccg ggagctcacc 1620 

aagctcaaga acacctacgt ggaccccctc ccaagcctcg tccacccgag gacgggccgc 1680 

ctccacaccc gcttcaacca gacggccacg gccacgggga ggcttagtag ctccgacccc 1740 

aacctgcaga acatccccgt ccgcaccccc ttgggccaga ggatccgccg ggccttcgtg 1800 

gccgaggcgg gttgggcgtt ggtggccctg gactatagcc agatagagct ccgcgtcctc 1860 

gcccacctct ccggggacga aaacctgatc agggtcttcc aggaggggaa ggacatccac 1920 

acccagaccg caagctggat gttcggcgtc cccccggagg ccgtggaccc cctgatgcgc 1980 

cgggcggcca agacggtgaa cttcggcgtc ctctacggca tgtccgccca taggctctcc 2040 

caggagcttg ccatccccta cgaggaggcg gtggccttta tagagcgcta cttccaaagc 2100 

ttccccaagg tgcgggcctg gatagaaaag accctggagg aggggaggaa gcggggctac 2160 
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gtggaaaccc tcttcggaag aaggcgctac gtgcccgacc tcaacgcccg ggtgaagagc 2220 

gtcagggagg ccgcggagcg catggccttc aacatgcccg tccagggcac cgccgccgac 2280 

ctcatgaagc tcgccatggt gaagctcttc ccccgcctcc gggagatggg ggcccgcatg 2340 

ctcctccagg tcgccaacga gctcctcctg gaggcccccc aagcgcgggc cgaggaggtg 2400 

gcggctttgg ccaaggaggc catggagaag gcctatcccc tcgccgtgcc cctggaggtg 2450 

gaggtgggga tgggggagga ctggctttcc gccaagggtc accaccacca ccaccac 2517 



<210> 514 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 514 

Met Asn Ser Qly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 - 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Arg Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Glu Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
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180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 365 

Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 
420 425 430 

Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu 
450 455 460 

Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
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485 



490 



495 



Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lya Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys Leu Lys Asn 
530 535 540 

Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 

610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys Asp He His 
625 630 635 640 

Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 7^0 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 



Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu Val 
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785 790 795 BOO 

Ala Ala Leu Ala Lys Glu Ala Met Qlu Lys Ala Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Qlu Val Gly Met Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Gly His His His His His His 





835 


<210> 


515 


<211> 


33 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


515 



ctcctcaagg ccctcagaga ggacggggac gcg 



33 



<210> 516 

<211> 33 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 5X6 33 
cgcgtccccg tcctctctga gggccttgag gag 

<210> 517 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 
caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 
ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 
gacgcggtga tcgtggtctt tgacgccgag gccccctcct tccgccacga ggcctacggg 
gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 
aaggagctgg tggacctcct ggggttcacg cgcctcgagg tcccgggcta cgaggcggac 
gacgtcctgg ccaccctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 



60 
120 
180 
240 
300 
360 
420 
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accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtgc 


accgggcagc 


agaccccttg 


1020 


ecggggctaa 


aggacctcaa 


ggaggtccgg 


ggcctcctcg 


ccaaggacct 


cgccgtcttg 


1080 


gcctcgaggg 


aggggctaga 


cctcgtgccc 


ggggacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcgcggc 


gctacggggg 


ggagtggacg 


1200 


gaggacgccg 


cccaccgggc 


cctcctctcg 


gagaggctcc 


atcggaacct 


ccttaagcgc 


1260 


ctcgaggggg 


aggagaagct 


cctttggctc 


• taccacgagg 


tggaaaagcc 


cctctcccgg 


1320 


gtcctggccc 


atatggaggc 


caccggggta 


cggcgggacg 


tggcctacct 


tcaggccctt 


1380 


tccctggagc 


ttgcggagga 


gatccgccgc 


ctcgaggagg 


aggtcttccg 


cttggcgggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tgctctttga 


cgagcttagg 


1500 


cttcccgcct 


tgaagaagac 


gaagaagaca 


ggcaagcgct 


ccaccagcgc 


cgcggtgctg 


1560 


gaggccctac 


gggaggccca 


ccccatcgtg 


gagaagatcc 


tccagcaccg 


ggagctcacc 


1620 


aagctcaaga 


acacctacgt 


ggaccccctc 


ccaagcctcg 


tccacccgag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggga 


ggcttagtag 


ctccgacccc 


1740 


aacctgcaga 


acatccccgt 


ccgcaccccc 


ttgggccaga 


ggatccgccg 


ggccttcgtg 


1800 


gccgaggcgg 


gttgggcgtt 


ggtggccctg 


gactatagcc 


agatagagct 


ccgcgtcctc 


1860 


gcccacctct 


ccggggacga 


aaacctgatc 


agggtcttcc 


aggaggggaa 


ggacatccac 


1920 


acccagaccg 


caagctggat 


gttcggcgtc 


cccccggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agacggtgaa 


cttcggcgtc 


ctctacggca 


tgtccgccca 


taggctctcc 


2040 


caggagcttg 


ccatccccta 


cgaggaggcg 


gtggccttta 


tagagcgcta 


cttccaaagc 


2100 
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<210> 518 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 518 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Qly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

val Val Phe Asp Ala Glu Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 SO 

Glv Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 150 155 160 

Glv' Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 
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ttccccaagg tgcgggcctg gatagaaaag accctggagg aggggaggaa gcggggctac 2160 

gtggaaaccc tcttcggaag aaggcgctac gtgcccgacc tcaacgcccg ggtgaagagc 2220 

gtcagggagg ccgcggagcg catggccttc aacatgcccg tccagggcac cgccgccgac 2280 

ctcatgaagc tcgccatggt gaagctcttc ccccgcctcc gggagatggg ggcccgcatg 2340 

ctcctccagg tcgccaacga gctcctcctg gaggcccccc aagcgcgggc cgaggaggtg 2400 

gcggctttgg ccaaggaggc catggagaag gcctatcccc tcgccgtgcc cctggaggtg 2460 
gaggtgggga tgggggagga ctggctttcc gccaagggtc accaccacca ccaccac 



2517 
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Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu 
225 230 235 240 



Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 

325 330 335 

Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 365 

Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 
420 425 430 

Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 

435 440 445 

Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu 
450 455 460 



Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 
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His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

ASP Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys Leu Lys Asn 
530 535 540 

Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arg 



545 



550 



555 



Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 



Gly ASP Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys Asp He His 
625 630 635 640 



Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 » 

Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly Tyr 

715 720 



705 



710 



Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 
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Ala Asn Glu 
785 



Leu 



Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu Val 
790 795 800 



Ala Ala Leu 



Ala 



Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala Val 
805 810 815 



Pro Leu Glu 



Val 
820 



Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala Lys 
825 830 



Gly His His His His His His 
835 



<210> 


519 


<211> 


33 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


519 



gacgacgtcc tggccaccct ggccaagaag gcg 33 



<210> 520 

<211> 33 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 520 

cgccttcttg gccagggtgg ccaggacgtc gtc 33 



<210> 521 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 521 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 

caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 120 

ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 

gacgcggtga tcgtggtctt tgacgccgag gccccctcct tccgccacga ggcctacggg 240 

gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 300 

aaggagctgg tggacctcct ggggttcacg cgcctcgagg tcccgggcta cgaggcggac 360 



540 
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gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 
accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 
gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 
gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 
ggggagaaga cggcgctcaa gcttctggag gagtggggga gcctggaagc cctcctcaag 
aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 
aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 
aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 
agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 
cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 
gatcttctgg ccctggccgc cgccaggggc ggccgcgtgc accgggcagc agaccccttg 
gcggggctaa aggacctcaa ggaggtccgg ggcctcctcg ccaaggacct cgccgtcttg 
gcctcgaggg aggggctaga cctcgtgccc ggggacgacc ccatgctcct cgcctacctc 
ctggaccctt cgaacaccac ccccgagggg gtggcgcggc gctacggggg ggagtggacg 
gaggacgccg cccaccgggc cctcctctcg gagaggctcc atcggaacct ccttaagcgc 
ctcgaggggg aggagaagct cctttggctc taccacgagg tggaaaagcc cctctcccgg 
gtcctggccc atatggaggc caccggggta cggcgggacg tggcctacct tcaggccctt 
tccctggagc ttgcggagga gatccgccgc ctcgaggagg aggtcttccg cttggcgggc 
caccccttca acctcaactc ccgggaccag ctggaaaggg tgctctttga cgagcttagg 
cttcccgcct tgaagaagac gaagaagaca ggcaagcgct ccaccagcgc cgcggtgctg 
gaggccctac gggaggccca ccccatcgtg gagaagatcc tccagcaccg ggagctcacc 
aagctcaaga acacctacgt ggacccbctc ccaagcctcg tccacccgag gacgggccgc 
ctccacaccc gcttcaacca gacggccacg gccacgggga ggcttagtag ctccgacccc 
aacctgcaga acatccccgt ccgcaccccc ttgggccaga ggatccgccg ggccttcgtg 
gccgaggcgg gttgggcgtt ggtggccctg gactatagcc agatagagct ccgcgtcctc 
gcccacctct ccggggacga aaacctgatc agggtcttcc aggaggggaa ggacatccac 
acccagaccg caagctggat gttcggcgtc cccccggagg ccgtggaccc cctgatgcgc 
cgggcggcca agacggtgaa cttcggcgtc ctctacggca tgtccgccca taggctctcc 



420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
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caggagcttg 


ccatccccta 


cgaggaggcg gtggccttta 


tagagcgcta ctt'ccaaagc 


2100 


ttccccaagg 


tgcgggcctg 


gatagaaaag accctggagg 


aggggaggaa gcggggctac 


2160 


gtggaaaccc 


tcttcggaag 


aaggcgctac gtgcccgacc 


tcaacgcccg ggtgaagagc 


2220 


ate aaaaaacf 


ccaccrcraQCcr 


catggccttc aacatgcccg 


bccaacrcrcac caccocccrac 


2280 


ctcatgaagc 


tcgccatggt 


gaagctcttc ccccgcctcc 


gggagatggg ggcccgcatg 


2340 


ctcctccagg 


tcgccaacga 


gctcctcctg gaggcccccc 


aagcgcgggc cgaggaggtg 


2400 


gcggctttgg 


ccaaggaggc 


catggagaag gcctatcccc 


tcgccgtgcc cctggaggtg 


2460 


gaggtgggga 


tgggggagga 


ctggctttcc gccaagggtc 


accaccacca ccaccac 


2517 



<210> 522 

<211> .839 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 522 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Glu Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 . 150 155 160 
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Qly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 



165 



170 



Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Qlu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 



225 



230 



235 



Lys 



Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 



245 



250 



255 



Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 

265 270 



260 



Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 

280 285 



275 

Leu Glu ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 



325 



330 



Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 365 

Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Qly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 



405 



410 



415 



Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 
420 425 430 

Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu 
450 455 460 
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Ala Glu Glu lie Arg Arg Leu Qlu Glu Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
465 490 495 

Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 . 525 

lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys Leu Lys Asn 
530 535 540 

Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly 
■ 580 585 590 

Gin Arg lie Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Lys Asp He His 
625 630 635 640 

Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 



Pro" Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 
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Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu Val 
785 790 795 800 

Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala Val 
805 

Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 ' 830 

Gly His His His His His His 
835 

<210> 523 

<211> 33 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 523 33 
ggggagaaga cggcgctcaa gcttctggag gag 



<210> 524 

<211> 33 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 



<400> 524 

ctcctccaga agcttgagcg ccgtcttctc ccc 



<210> 525 

<211> 2517 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 

<400> 525 . 
atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 

caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 

ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 

gacgcggtga tcgtggtctt tgacgccgag gccccctcct tccgccacga ggcctacggg 

gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 



33 



60 
120 
180 
240 
300 
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aaggagctgg 


tggacctcct 


ggggttcacg 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggaggggctt 


aaggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtgc 


accgggcagc 


agaccccttg 


1020 


gcggggctaa 


aggacctcaa 


ggaggtccgg 


ggcctcctcg 


ccaaggacct 


cgccgtcttg 


1080 


gcctcgaggg 


aggggctaga 


cctcgtgccc 


ggggacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcgcggc 


gctacggggg 


ggagtggacg 


1200 


gaggacgccg 


cccaccgggc 


cctcctctcg 


gagaggctcc 


atcggaacct 


ccttaagcgc 


1260 


ctcgaggggg 


aggagaagct 


cctttggctc 


taccacgagg 


tggaaaagcc 


cctctcccgg 


1320 


gtcctggccc 


atatggaggc 


caccggggta 


cggcgggacg 


tggcctacct 


tcaggccctt 


1380 


tccctggagc 


ttgcggagga 


gatccgccgc 


ctcgaggagg 


aggtcttccg 


cttggcgggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tgctctttga 


cgagcttagg 


1500 


cttcccgcct 


tgaagaagac 


gaagaagaca 


ggcaagcgct 


ccaccagcgc 


cgcggtgctg 


1560 


gaggccctac 


gggaggccca 


ccccatcgtg 


gagaagatcc 


tccagcaccg 


ggagctcacc 


1620 


aagctcaaga 


acacctacgt 


ggaccccctc 


ccaagcctcg 


tccacccgag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggga 


ggcttagtag 


ctccgacccc 


1740 


aacctgcaga 


acatccccgt 


ccgcaccccc 


ttgggccaga 


ggatccgccg 


ggccttcgtg 


1800 


gccgaggcgg 


gttgggcgtt 


ggtggccctg 


gactatagcc 


agatagagct 


ccgcgtcctc 


1860 


gcccacctct 


ccggggacga 


aaacctgatc 


agggtcttcc 


aggaggggaa 


ggacatccac 


1920 


acccagaccg 


caagctggat 


gttcggcgtc 


cccccggagg 


ccgtggaccc 


cctgatgcgc 


1980 
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cgggcggcca agacggtgaa cttcggcgtc ctctacggca tgtccgccca taggctctcc 2040 

caggagcttg ccatccccta cgaggaggcg gtggccttta tagagcgcta cttccaaagc 2100 

ttccccaagg tgcgggcctg gatagaaaag accctggagg aggggaggaa gcggggctac 2160 

gtggaaaccc tcttcggaag aaggcgctac gtgcccgacc tcaacgcccg ggtgaagagc 2220 

gtcagggagg ccgcggagcg catggccttc aacatgcccg tccagggcac cgccgccgac 2280 

ctcatgaagc tcgccatggt gaagctcttc ccccgcctcc gggagatggg ggcccgcatg 2340 

ctcctccagg tcgccaacga gctcctcctg gaggcccccc aagcgcgggc cgaggaggtg 2400 

gcggctttgg ccaaggaggc catggagaag gcctatcccc tcgccgtgcc cctggaggtg 2460 

gaggtgggga tgggggagga ctggctttcc gccaagggtc accaccacca ccaccac 2517 



<210> 526 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 526 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Glu Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
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145 



150 



155 



160 



Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Gly Leu Lys Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 365 

Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 
420 425 430 

Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 



Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu 
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450 



455 



460 



Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Qlu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys Leu Lys Asn 
530 535 540 

Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys Asp He His 
625 630 635 640 

Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
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755 



760 



765 



lieu Phe Pro Arg Leu 
770 



Arg Glu Met Gly Ala Arg Met Leu Leu Gin Val 
775 780 



Ala Asn Glu Leu Leu 
785 



Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu Val 
790 795 800 



Ala Ala Leu Ala Lys 
805 



Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala Val 
810 815 



Pro Leu Glu Val Glu 
820 



Val Gly Met Gly Glu Asp Trp Leu Ser Ala Lys 
625 830 



Gly His His His His His His 
835 



<210> 


527 


<211> 


39 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


527 



gagcccgacc gggaggggct taaggccttt ctggagagg 39 



<210> 


528 


<211> 


39 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


528 



cctctccaga aaggccttaa gcccctcccg gtcgggctc 39 



<210> 


529 


<211> 


2517 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


529 



atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 

caccacctgg cc taccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 120 

ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 

gacgcggtga tcgtggtctt tgacgccgag gccccctcct tccgccacga ggcctacggg 240 
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acra t cLC aaoa 


GQCtaccctaac 
^yyy^^yyy^ 


WWWWMW^ 


aaacxactttc 


cccggcaact 


cgccctcatc 


300 




tocracctcct 


aaaattcaCQ 


CQCCtCQaQQ 

w^ww wwg**35 


tcccgggcta 


cgaqgcgqac 


360 


aacQtcctQQ 


ccaacctcrcrc 


caaaaaQQcq 


qaaaaqQagg 


gctacgaggt 


ccgcatcctc 


420 


accgccgaca 


aaaaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccoac 


CtQQCtttCICl 

w» w w w wgg 


QaaaacrtacQ 

^ ******** 3 ••**^*3 


qcctcraqcrcc 

^ww w^jw^g w w 


cgaccagtgg 


540 


Qccaaciiacc 


acraccctoac 

*3*3g w WW %v^aw 


cooacraccraQ 
•-yyyy^^y^^y 


tccgacaacc 


ttcccqQQQt 

w wwwwg w 


caagggcatc 


600 




r*cf cir*rf acraa a 


y w i»LiW ^yy ^y 


cr aa t aacf acra 
y Ay L>y yyyy »• 


acctacraacrc 

y w w w^ g w 


cctcctcaag 


660 


a a r« r* t" Ofra r» r« 


oci r« t" cf a a cr f r» 
y y w uy aoy fci ^ 


wy w ^ A i» ».» <^y y 


aarraaaafccc 


taacccac at 


aaacaa t c t q 


720 


a acic t" e t cc fc 


yyy »wv# v»yy v« 


caacrcrtocac 


accgacctgc 


ccctaaaaat 

WWWW*J*JW*J^ W 


ggacttcgcc 


780 


a a a a cr cr c acTcr 

aaaa^y ^^^^ 


ci^ w w w^ Vi>^ 


crcraaaaactt 
y y *y yy 


aaaacctttc 

C4^^^g W W W W w W 


haoaaaoac t 

^^y y Ay ".y y V* ^ 


taaoftttqqc 

**^J www ^ ^ w 


840 


agcctcctcc 


acaaQttccrcr 

»*wy w ww^3 


ccttctggga 


qqqqaQaaQC 


ccCQQcraqqa 

W W W«3^^M^^M 


qqccccctgg 


900 


ccccccrcccrcr 




CQtaQQCttt 

wjj wgg^ w w w w 


gtgctttccc 


Qcaagqagcc 


catgtgggcc 


960 


crate ttctaa 


ccctcrcrccac 


COG c acrcrcrcr c 


aQCCQCQtQC 
ya'-'wywy w^w 


accaaqcaqc 


agaccccttg 


1020 


crcaaaacliaa 

53 w\j^y >j Vi* 


aggacctcaa 


aaaaa t c coa 
yy^yy ^^^33 


ggcctcctcg 


ccaaggacct 


cgccgtcttg 


1080 


erect caaocra 


acrcfCfcrc t aaa 
^yyyy ^ **^y^ 


ectcatcrcGC 

w W b w^4 ^ t3 ^ w W 


aaaaB.cctB.cc 
y syy ^**y 


Gcatgctcct 


cgcctacctc 


1140 


ft"cfCfa npft* t* 

V* wy^CLV.«\i« V# W 


ncf a a r* a r»<^ a n 

Va>y CIA V« Aw w 


V'^.'W^y Ayyyy 


y i-yy^ywyyv^ 


yv^uAuyyyyy 


yy Ay uyyA^^y 


1200 


y «yy«^y '-'*-y 


i^(^^av«^yyyo 




y '^y ^y y ^ w l» 


a i" rrma a r^r't* 
cii'V'^yyACL^M'w^ 


pnt" t*aacrccrc 


1260 


^ ^cy ^yyyyy 


ay y ay aay t* 


^ ^ u u u y y u L. w 


♦"a r»i^a r'rta^^frr 
UA^.>wA^yAyy 


f* riCTa a a a or* o 
uy y AA AAy ^ w 


wyy 


1320 


y v^yy uww 


a l~ a t* fff la cxar* 
ouauyyeiyy v.* 


CiACwyyyy i-A 


'-yycyyy^^y 


^ flifir* r' i" a o r» t" 
uyy vw»i*Awv* \^ 


^w Ayy ^ 


1380 


L. C C C I. y y eiy C 


1. cy cy y dy y o. 


era i* r^r^rtr^r^df 
y A i> \i>y ii« Vt^y ^ 


u I. uy ay y Ay y 


Ay y u ii> w i» w y 


V- u Uiyy ^yyy w 


1440 


o ja o r* i" f* o a 


a o f r» a a r* f" 


V* v»y y y A V. v.* Ay 


V* uyy A AAyyy 




P Cf a CI c 1 1 acrci 

^»y Ay w> u i« Ay y 


1500 




^ na a CT a a fra 
uy AAyAAy Av,* 


fra a rra a rra r« a 
y AAy AAy AUiA 


y y wAay Vi*y w ii» 


A^w Ay ^y 


wy v*yy uy w i»y 


1560 


y ^y y w w ^ l. ci l> 


yyyAyy ct.v..A 




y Ay AAy AUvrw 


u^^if Ay^ A^wy 


OCT a fir* t" r* a <^ r* 
yy Ay ^ uv* A^v,* 


1620 


aagctcaaga 


acacctacgt 


ggaccccctc 


ccaagcctcg 


tccacccgag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggga 


ggcttagtag 


ctccgacccc 


1740 


aacctgcaga 


acatccccgt 


ccgcaccccc 


ttgggccaga 


ggatccgccg 


ggccttcgtg 


1800 


gccgaggcgg 


gttgggcgtt 


ggtggccctg 


gactatagcc 


agatagagct 


ccgcgtcctc 


1860 


gcccacctct 


ccggggacga 


aaacctgatc 


agggtcttcc 


aggaggggaa 


ggacatccac 


1920 
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acccagaccg 


caagctggat 


gttcggcgtc cccccggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agacggtgaa 


cttcggcgtc ctctacggca 


tgtccgccca 


taggctctcc 


2040 


caggagcttg 


ccatccccta 


cgaggaggcg gtggccttta 


tagagcgcta 


cttccaaagc 


2100 


ttccccaagg 


tgcgggcctg 


gatagaaaag accctggagg 




gcggggctac 


2160 


gtggaaaccc 


tcttcggaag 


aaggcgctac gtgcccgacc 


tcaacgcccg 


ggtgaagagc 


2220 


gtcagggagg 


ccgcggagcg 


Cauggcccuc aacaugcwcg 


U \.> L> L> d V« 


^ ^ ^ ^ Cl U> 




ctcatgaagc 


tcgccatggt 


gaagctcttc ccccgcctcc 


gggagatggg 


ggcccgcatg 


2340 


ctcctccagg 


tcgccaacga 


gctcctcctg gaggcccccc 


aagcgcgggc 


cgaggaggtg 


2400 


gcggctttgg 


ccaaggaggc 


catggagaag gcctatcccc 


tcgccgtgcc 


cctggaggtg 


2460 


gaggtgggga 


tgggggagga 


ctggctttcc gccaagggtc 


accaccacca 


ccaccac 


2517 



<210> 530 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 530 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
1.5 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Glu Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 

100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys 7U.a Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 
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Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 

210 215 220 

Leu Lys Pro Ala He Arg Qlu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Qlu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Gly Gly Glu Lys Pro Arg Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 365 

Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 
420 425 430 

Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 
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Gly Val Arg Arg Asp Val Ala Tyr lieu 61n Ala Leu Ser Leu Glu Leu 
450 455 460 

Ala Glu Qlu lie Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly 
465 470 475 480* 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys Leu Lys Asn 
530 535 540 

Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu Val 

595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 6X5 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys Asp He His 
625 630 635 640 

Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 
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Pro Val Gin Gly Thr Ala Ala Asp 
755 760 

Leu Phe Pro Arg Leu Arg Glu Met 
770 775 

Ala Asn Glu Leu Leu Leu Glu Ala 
785 790 

Ala Ala Leu Ala Lys Glu Ala Met 
805 

Pro Leu Glu Val Glu Val Gly Met 
820 

Gly His His His His His His 
835 



Leu Met Lys Leu Ala Met Val Lys 
765 

Gly Ala Arg Met Leu Leu Gin Val 
780 

Pro Gin Ala Arg Ala Glu Glu Val 
795 800 

Glu Lys Ala Tyr Pro Leu Ala Val 
810 815 

Gly Glu Asp Trp Leu Ser Ala Lys 
825 830 



<210> 531 

<211> 54 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 



<400> 531 

cacgagttcg gccttctggg aggggagaag ccccgggagg aggccccctg gccc 54 



<210> 532 

<211> 54 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 
<400> 532 

gggccagggg gcctcctccc ggggcttctc ccctcccaga aggccgaact cgtg 



<210> 533 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 



<400> 533 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 

caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 120 

ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 
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gacgcggtga 


tcgtggtctt 


tgacgccgag 


gccccctcct 


tccgccacga 


ggcctacggg 


240 


gggtacaagg 


cgggccgggc 


ccccacgccg 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggttcacg 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 


ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


ctgcaggggc 


ggccgcgtgc 


accgggcagc 


agaccccttg 


1020 


gcggggctaa 


aggacctcaa 


ggaggtccgg 


ggcctcctcg 


ccaaggacct 


cgccgtcttg 


1080 


gcctcgaggg 


aggggctaga 


cctcgtgccc 


ggggacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcgcggc 


gctacggggg 


ggagtggacg 


1200 


gaggacgccg 


cccaccgggc 


cctcctctcg 


gagaggctcc 


atcggaacct 


ccttaagcgc 


1260 


ctcgaggggg 


aggagaagct 


cctttggctc 


taccacgagg 


tggaaaagcc 


cctctcccgg 


1320 


gtcctggccc 


atatggaggc 


caccggggta 


cggcgggacg 


tggcctacct 


tcaggccctt 


1380 


tccctggagc 


ttgcggagga 


gatccgccgc 


ctcgaggagg 


aggtcttccg 


cttggcgggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tgctctttga 


cgagcttagg 


1500 


cttcccgcct 


tgaagaagac 


gaagaagaca 


ggcaagcgct 


ccaccagcgc 


cgcggtgctg 


1560 


gaggccctac 


9ggaggccca 


ccccatcgtg 


gagaagatcc 


tccagcaccg 


ggagctcacc 


1620 


aagctcaaga 


acacctacgt 


ggaccccctc 


ccaagcctcg 


tccacccgag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggga 


ggcttagtag 


ctccgacccc 


1740 


aacctgcaga 


acatccccgt 


ccgcaccccc 


ttgggccaga 


ggatccgccg 


ggccttcgtg 


1800 


gccgaggcgg 


Sttgggcgtt 


ggtggccctg 


gactatagcc 


agatagagct 


ccgcgtcctc 


1860 
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gcccacctct 


ccggggacga 


aaacctgatc 


agggtcttcc 


aggaggggaa 


ggacatccac 


1920 


acccagaccg 


caagctggat 


gttcggcgtc 


cccccggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agacggtgaa 


cttcggcgtc 


ctctacggca 


tgtccgccca 


taggctctcc 


2040 


caggagcttg 


ccatccccta 


cgaggaggcg 


gtggccttta 


tagagcgcta 


cttccaaagc 


2100 


ttccccaagg 


tgcgggcctg 


gatagaaaag 


accctggagg 


aggggaggaa 


gcggggctac 


2160 


gtggaaaccc 


tcttcggaag 


aaggcgctac 


gtgcccgacc 


tcaacgcccg 


ggtgaagagc 


2220 


gtcagggagg 


ccgcggagcg 


catggccttc 


aacatgcccg 


tccagggcac 


cgccgccgac 


2280 


ctcatgaagc 


tcgccatggt 


gaagctcttc 


ccccgcctcc 


gggagatggg 


ggcccgcatg 


2340 


ctcctccagg 


tcgccaacga 


gctcctcctg 


gaggcccccc 


aagcgcgggc 


cgaggaggtg 


2400 


gcggctttgg 


ccaaggaggc 


catggagaag 


gcctatcccc 


tcgccgtgcc 


cctggaggtg 


2460 


gaggtgggga 


tgggggagga 


ctggctttcc 


gccaagggtc 


accaccacca 


ccaccac 


2517 



<210> 534 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 534 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 

Val Val Phe Asp Ala Glu Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 
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Iiys Ala Glu Lys Glu Gly Tyr Glu.Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 * 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Cys Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 

355 360 365 

Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 ' 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 
405 410 415 



Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 
420 425 430 
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Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu 
450 455 460 

Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys Leu Lys Asn 
530 535 540 

Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys Asp He His 
625 630 635 640 

Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly Tyr 
705 710 715 720 



Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn Ala 
725 730 735 
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Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 . 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu Val 
785 790 795 800 

Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Gly His His His His His His 
835 



<210> 


535 


<211> 


33 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


535 



ctggccctgg ccgcctgcag gggcggccgc gtg 



<210> 


536 


<211> 


33 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


536 



cacgcggccg cccctgcagg cggccagggc cag 



33 



33 



<210> 537 

<211> 2517 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 537 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 
caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 120 
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cggtctacgg 


cttcgccaag 


agcctcctca 


aggccctcaa 


^^^^ Q _rjj~_ 

ggaggacggg 




gacgcggtga 


tcgtggtctt 


tgacgccgag 


gccccctcct 


tccgccacga 


ggcctacggg 


240 


gggtacaagg 


cgggccgggc 


ccccacgccg 


gaggactttc 


cccggcaact 


cgccctcatc 




aagg ag c t gg 


tggacctcct 


ggggttcacg 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


361} 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggts-cctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccgactacc 


gggccctgac 




tccgacaacc 


ttcccggggt 


caagggcatc 


600 


ggggagaaga 


cggcgaggaa 


gcttctgaag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctccbcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


r\ ^ f\ 

960 


gatcttctigg 


ccct^ggccgc 


cgc c aggggc 


ggccgcgtgc 


accgggcagc 


agaccccttg 


1020 


gcggggctaa 


aggacctcaa 


ggaggtccgg 


ggcctcctcg 


ccaaggacct 


cgccgtcttg 


1080 


gcctcgaggg 


^9g9gctaga 


cctcgtgccc 


ggggacgacc 


ccatgctcct 


cgcctacctc 


1140 


ctggaccctt 


cgaacaccac 


ccccgagggg 


gtggcgcggc 


gctacggggg 


ggagtggacg 


1200 


gaggacgccg 


cccaccgggc 


cctcctctcg 


gagaggctcc 


atcggaacct 


ccttaagcgc 


1260 


ctcgaggggg 


aggagaagct 


cctttggctc 


taccacgagg 


tggaaaagcc 


cctctcccgg 


13*20 


gtcctggccc 


atatggaggc 


caccggggta 


cggcgggacg 


tggcctacct 


tcaggccctt 


1380 


tccctggagc 


ttgcggagga 


gatccgccgc 


ctcgaggagg 


aggtcttccg 


cttggcgggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag 


ctggaaaggg 


tgctctttga 


cgagcttagg 


1500 


cttcccgcct 


tgaagaagac 


gaagaagaca 


ggcaagcgct 


ccaccagcgc 


cgcggtgctg 


1560 


gaggccctac 


gggaggccca 


ccccatcgtg 


gagaagatcc 


tccagcaccg 


ggagctcacc 


1620 


aagctcaaga 


acacctacgt 


ggaccccctc 


ccaagcctcg 


tccacccgag 


gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggga 


ggcttagtag 


ctccgacccc 


1740 


aacctgcaga 


acatccccgt 


ccgcaccccc 


ttgggccaga 


ggatccgccg 


ggccttcgtg 


1800 
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gccgaggcgg 


gttgggcgtt 


ggtggccctg 


gactatagcc 


agatagagct 


ccgcgtcctc 


1860 


gcccacctct 


ccggggacga 


aaacctgatc 


agggtcttcc 


aggaggggaa 


ggacatccac 


1920 


acccagaccg 


caagctggat 


gttcggcgtc 


cccccggagg 


ccgtggaccc 


cctgatgcgc 


1980 


cgggcggcca 


agacggtgaa 


cttcggcgtc 


ctcfcacggca 


tgtccgccca 


taggctctcc 


2040 


caggagcttg 


ccatccccta 


cgagg^ggcg 


gtggccttta 


tagagcgcta 


cttccaaagc 


2100 


ttccccaagg 


tgcgggcctg 


gatagaaaag 


accctggagg 


aggggaggaa 


gcggggctac 


2160 


gtggaaaccc 


tcttcggaag 


aaggcgctac 


gtgcccgacc 


tcaacgcccg 


ggtgaagagc 


2220 


gtcagggagg 


ccgcggagcg 


catggccttc 


aacatgcccg 


tccagggcac 


cgccgccgac 


n o A 
ZZOU 


ctcatgaagc 


tcgccatggt 


gaagctcttc 


ccccgcctcc 


gggagatggg 


ggcccgcatg 


2340 


ctcctccagg 


tcgccaacga 


gctcctcctg 


gaggcccccc 


aagcgcgggc 


cgaggaggtg 


2400 


gcggctttgg 


ccaaggaggc 


catggagaag 


gcctatcccc 


tcgccgtgcc 


cctggaggtg 


2460 


gaggtgggga 


tgggggagga 


ctggctttcc 


gccaagggtc 


accaccacca 


ccaccac 


2517 



<210> 538 

<211> 839 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 538 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 

1 5 10 • 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 ■ 60 

Val Val Phe Asp Ala Glu Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
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115 



120 



125 



Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Lys Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu 
355 360 365 

Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn 

405 410 415 



Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His 
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420 425 430 

Glu Val Glu Iiys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 ^ 440 445 

Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu 
450 455 460 

Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys Thr Gly Lys 

500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys Leu Lys Asn 
530 535 540 

Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg lie Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Lys Asp lie His 
625 630 635 640 

Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu Tyr 

660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu 
675 680 685 

Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn Ala 
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725 



730 



735 



Arg Val 



Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 



Pro Val 



Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 



Leu Phe 
770 



Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin Val 
775 780 



Ala Asn 
785 



Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu Val 

790 795 800 



Ala Ala 



Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala Val 
805 810 815 



Pro Leu 



Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 



Gly His His His His His His 
835 



<210> 


539 


<211> 


42 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


539 



ggggagaaga cggcgaggaa gcttctgaag gagtggggga gc 



<210> 


540 


<211> 


42 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


540 



gctcccccac tccttcagaa gcttcctcgc cgtcttctcc cc 42 



<210> 


541 


<211> 


2520 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


541 



atgaattcgg aggcgatgct gcccctcttt gagcccaagg gccgggtcct cctggtggac 60 
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ggccaccacc 


tggcctaccg 


caccttcttt 


gccctgaagg 


gcctcaccac 


cagccggggg 


120 


gagccggtgc 


aggcggtcta 


cggcttcgcc 


aagagcctcc 


tcaaggccct 


cagagaggac 


180 


ggggacgcgg 


tgatcgtggt 


ctttgacgcc 


gaggccccct 


ccttccgcca 


cgaggcctac 


240 


ggggggtaca 


aggcgggccg 


ggcccccacg 


ccggaggact 


ttccccggca 


actcgccctc 


300 


atcaaggagc 


tggtggacct 


cctggggttc 


acgcgcctcg 


aggtcccggg 


ctacgaggcg 


360 


gacgacgtcc 


tggccaccct 


ggccaagaag 


gcggaaaagg 


agggctacga 


ggtccgcatc 


420 


ctcaccgccg 


acaaagacct 


ttaccagctc 


ctttccgacc 


gcatccacgt 


cctccacccc 


480 


gaggggtacc 


tcatcacccc 


ggcctggctt 


tgggaaaagt 


acggcctgag 


gcccgaccag 


540 


tgggccgact 


accgggccct 


gaccggggac 


gagtccgaca 


accttcccgg 


ggtcaagggc 


600 


atcggggaga 


agacggcgct 


caagcttctg 


gaggagtggg 


ggagcctgga 


agccctcctc 


660 


aagaacctgg 


accggctgaa 


gcccgccatc 


cgggagaaga 


tcctggccca 


catggacgat 


720 


ctgaagctct 


cctgggacct 


ggccaaggtg 


cgcaccgacc 


tgcccctgga 


ggtggacttc 


780 


gccaaaaggc 


gggagcccga 


ccgggagggg 


cttaaggcct 


ttctggagag 


gcttgagttt 


840 


ggcagcctcc 


tccacgagtt 


cggccttctg 


ggaggggaga 


agccccggga 


ggaggccccc 


900 


tggcccccgc 


cggaaggggc 


cttcgtgggc 


tttgtgcttt 


cccgcaagga 


gcccatgtgg 


960 


gccgatcttc 


tggccctggc 


cgcctgcagg 


ggcggccgcg 


tgcaccgggc 


agcagacccc 


1020 


ttggcggggc 


taaaggacct 


caaggaggtc 


cggggcctcc 


tcgccaagga 


cctcgccgtc 


1080 


ttggcctcga 


gggagg^gct 


agacctcgtg 


cccggggacg 


accccatgct 


cctcgcctac 


1140 


ctcctggacc 


cttcgaacac 


cacccccgag 


ggggtggcgc 


ggcgctacgg 


gggggagtgg 


1200 


acggaggacg 


ccgcccaccg 


ggccctcctc 


tcggagaggc 


tccatcggaa 


cctccttaag 


1260 


cgcctcgagg 


gggaggagaa 


gctcctttgg 


ctctaccacg 


aggtggaaaa 


gcccctctcc 


1320 


cgggtcctgg 


cccatatgga 


ggccaccggg 


gtacggcggg 


acgtggccta 


ccttcaggcc 


1380 


ctttccctgg 


agcttgcgga 


ggagatccgc 


cgcctcgagg 


aggaggtctt 


ccgcttggcg 


1440 


ggccacccct 


tcaacctcaa 


ctcccgggac 


cagctggaaa 


gggtgctctt 


tgacgagctt 


1500 


aggcttcccg 


ccttgaagaa 


gacgaagaag 


acaggcaagc 


gctccaccag 


cgccgcggtg 


1560 


ctggaggccc 


tacgggaggc 


ccaccccatc 


gtggagaaga 


tcctccagca 


ccgggagctc 


1620 


accaagctca 


agaacaccta 


cgtggacccc 


ctcccaagcc 


tcgtccaccc 


gaggacgggc 


1680 


cgcctccaca 


cccgcttcaa 


ccagacggcc 


acggccacgg 


ggaggcttag 


tagctccgac 


1740 
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cccaacctgc agaacatccc cgtccgcacc cccttgggcc agaggatccg ccgggccttc 1800 

gtggccgagg cgggttgggc gttggtggcc ctggactata gccagataga gctccgcgtc 1860 

ctcgcccacc tctccgggga cgaaaacctg atcagggtct tccaggaggg gaaggacatc 1920 

cacacccaga ccgcaagctg gatgttcggc gtccccccgg aggccgtgga ccccctgatg 1980 

cgccgggcgg ccaagacggt gaacttcggc gtcctctacg gcatgtccgc ccataggctc 2040 

tcccaggagc ttgccatccc ctacgaggag gcggtggcct ttatagagcg ctacttccaa 2100 

agcttcccca aggtgcgggc ctggatagaa aagaccctgg aggaggggag gaagcggggc 2160 

tacgtggaaa ccctcttcgg aagaaggcgc tacgtgcccg acctcaacgc ccgggtgaag 2220 

agcgtcaggg aggccgcgga gcgcatggcc ttcaacatgc fccgtccaggg caccgccgcc 2280 

gacctcatga agctcgccat ggtgaagctc ttcccccgcc tccgggagat gggggcccgc 2340 

atgctcctcc aggtcgccaa cgagctcctc ctggaggccc cccaagcgcg ggccgaggag 2400 

gtggcggctt tggccaagga ggccatggag aaggcctatc ccctcgccgt gcccctggag 2460 

gtggaggtgg ggatggggga ggactggctt tccgccaagg gtcaccacca ccaccaccac 2520 



<210> 542 

<211> 840 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 542 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Arg Glu Asp Gly Asp Ala Val 
50 55 60 

lie Val Val Phe Asp Ala Glu Ala Pro Ser Phe Arg His Glu Ala Tyr 
65 70 75 80 

Gly Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg 
85 90 95 

Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg 
100 105 110 
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Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu Ala 
115 120 125 

Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp 
130 IBS 140 

Lys Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro 
145 150 155 160 

Glu Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu 
165 170 175 

Arg Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser 
160 165 190 

Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu Lys 
195 200 205 

Leu Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp 
210 215 220 

Arg Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp 
225 230 235 240 

Leu Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu 
245 250 - 255 

Glu Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Gly Leu Lys 

260 265 270 

Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly 
275 280 285 

Leu Leu Gly Gly Glu Lys Pro Arg Glu Glu Ala Pro Trp Pro Pro Pro 
290 295 300 

Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp 
305 310 315 320 

Ala Asp Leu Leu Ala Leu Ala Ala Cys Arg Gly Gly Arg Val His Arg 
325 330 335 

Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly 
340 345 350 

Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp 
355 360 365 

Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro 
370 375 380 

Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp 
385 390 395 400 



Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg 
405 410 415. 
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Asn Leu Leu Lys Arg Leu Glu Gly Glu Qlu Lys Leu Leu Trp Leu Tyr 
420 425 430 

His Glu Val Glu Lys Pro Leu Set Arg Val Leu Ala His Met Glu Ala 
435 440 445 

Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu 
450 455 460 

Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala 
465 470 475 480 

Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu 
485 490 495 

Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys Thr Gly 
500 505 510 

Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His 
515 520 525 

Pro lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys Leu Lys 
530 535 540 

Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly 
545 550 555 560 

Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu 
565 570 . 575 

Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr Pro Leu 
580 585 590 

Gly Gin Arg lie Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu 
595 600 605 

Val Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu 
610 615 620 

Ser Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Lys Asp lie 
625 630 635 640 

His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val 
645 650 655 

Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu 

660 665 670 

Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr 
675 680 685 

Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys 
690 695 700 



Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly 
705 710 715 720 
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Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn 
725 730 735 

Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn 
740 745 750 

Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val 
755 760 765 

Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin 
770 775 780 

Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu 
785 790 795 800 

Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala 
805 810 815 

Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala 
820 825 830 

Lys Gly His His His His His His 
835 840 

<2aO> 543 

<211> 2520 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 543 



atgaattcgg 


aggcgatgct 


gcccctcttt 


gagcccaagg 


gccgggtcct 


cctggtggac 


60 


ggccaccacc 


tggcctaccg 


caccttcttt 


gccctgaagg 


gcctcaccac 


cagccggggg 


120 


gagccggtgc 


aggcggtcta 


cggcttcgcc 


aagagcctcc 


tcaaggccct 


cagagaggac 


180 


ggggacgcgg 


tgatcgtggt 


ctttgacgcc 


gaggccccct 


ccttccgcca 


cgaggcctac 


240 


ggggggtaca 


aggcgggccg 


ggcccccacg 


ccggaggact 


ttccccggca 


actcgccctc 


300 


atcaaggagc 


tggtggacct 


cctggggttc 


acgcgcctcg 


aggtcccggg 


ctacgaggcg 


360 


gacgacgtcc 


tggccaccct 


ggccaagaag 


gcggaaaagg 


agggctacga 


ggtccgcatc 


420 


ctcaccgccg 


acaaagacct 


ttaccagctc 


ctttccgacc 


gcatccacgt 


cctccacccc 


480 


gaggggtacc 


tcatcacccc 


ggcctggctt 


tgggaaaagt 


acggcctgag 


gcccgaccag 


540 


tgggccgact 


accgggccct 


gaccggggac 


gagtccgaca 


accttcccgg 


ggtcaagggc 


600 


atcggggaga 


agacggcgct 


caagcttctg 


aaggagtggg 


ggagcctgga 


agccctcctc 


660 


aagaacctgg 


accggctgaa 


gcccgccatc 


cgggagaaga 


tcctggccca 


catggacgat 


720 
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ctgaagctct cctgggacct ggccaaggtg cgcaccgacc tgcccctgga ggtggacttc 780 

gccaaaaggc gggagcccga ccgggagggg cttaaggcct ttctggagag gcttgagttt 840 

ggcagcctcc tccacgagtt cggccttctg ggaggggaga agccccggga ggaggccccc 900 

tggcccccgc cggaaggggc cttcgtgggc tttgtgcttt cccgcaagga gcccatgtgg 960 

gccgatcttc tggccctggc cgcctgcagg ggcggccgcg tgcaccgggc agcagacccc 1020 

ttggcggggc taaaggacct caaggaggtc cggggcctcc tcgccaagga cctcgccgtc 1080 

ttggcctcga gggaggggct agacctcgtg cccggggacg accccatgct cctcgcctac 1140 

ctcctggacc cttcgaacac cacccccgag ggggtggcgc ggcgctacgg gggggagtgg 1200 

acggaggacg ccgcccaccg ggccctcctc tcggagaggc tccatcggaa cctccttaag 1260 

cgcctcgagg gggaggagaa gctcctttgg ctctaccacg aggtggaaaa gcccctctcc 1320 

cgggtcctgg cccatatgga ggccaccggg gtacggcggg acgtggccta ccttcaggcc 1380 

ctttccctgg agcttgcgga ggagatccgc cgcctcgagg aggaggtctt ccgcttggcg 1440 

ggccacccct tcaacctcaa ctcccgggac cagctggaaa gggtgctctt tgacgagctt 1500 

aggcttcccg ccttgaagaa gacgaagaag acaggcaagc gctccaccag cgccgcggtg 1560 

ctggaggccc tacgggaggc ccaccccatc gtggagaaga tcctccagca ccgggagctc 1620 

accaagctca agaacaccta cgtggacccc ctcccaagcc tcgtccaccc gaggacgggc 1680 

cgcctccaca cccgcttcaa ccagacggcc acggccacgg ggaggctta^ tagctccgac 1740 

cccaacctgc agaacatccc cgtccgcacc cccttgggcc agaggatccg ccgggccttc 1800 

gtggccgagg cgggttgggc gttggtggcc ctggactata gccagataga gctccgcgtc 1860 

ctcgcccacc tctccgggga cgaaaacctg atcagggtct tccaggaggg gaaggacatc 1920 

cacacccaga ccgcaagctg gatgttcggc gtccccccgg aggccgtgga ccccctgatg 1980 

cgccgggcgg ccaagacggt gaacttcggc gtcctctacg gcatgtccgc ccataggctc 2040 

tcccaggagc ttgccatccc ctacgaggag gcggtggcct ttatagagcg ctacttccaa 2100 

agcttcccca aggtgcgggc ctggatagaa aagaccctgg aggaggggag gaagcggggc 2160 

tacgtggaaa ccctcttcgg aagaaggcgc tacgtgcccg acctcaacgc ccgggtgaag 2220 
aggccgcgga gcgcatggcc ttcaacatgc ccgtccaggg caccgccgcc 2280 

gacctcatga agctcgccat ggtgaagctc ttcccccgcc tccgggagat gggggcccgc 2340 

atgctcctcc aggtcgccaa cgagctcctc ctggaggccc cccaagcgcg ggccgaggag 2400 

gtggcggctt tggccaagga ggccatggag aaggcctatc ccctcgccgt gcccctggag 2460 
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gtggaggtgg ggatggggga ggactggctt tccgccaagg gtcaccacca ccaccaccac 2520 



<210> 544 

<2ai> 840 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 544 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Arg Glu Asp Gly Asp Ala Val 

50 55 60 

He Val Val Phe Asp Ala Glu Ala Pro Ser Phe Arg His Glu Ala Tyr 
65 70 75 80 

Gly Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg 
85 90 95 

Gin Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg 
100 105 110 

Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu Ala 

115 120 125 

Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp 
130 135 140 

Lys Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro 
145 150 155 160 

Glu Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu 
165 170 175 

Arg Pro Asp Gla Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser 
180 185 190 

Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu Lys 
195 200 205 

Leu Leu Lys Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp 
210 215 220 

Arg Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp 
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225 



230 



235 



240 



Leu Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu 
245 250 255 

Glu val Asp Phe Ala Lys Arg Arg Qlu Pro Asp Arg Glu Gly Leu Lys 
260 265 270 

Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly 
275 280 285 

Leu Leu Gly Gly Glu Lys Pro Arg Glu Glu Ala Pro Trp Pro Pro Pro 
290 295 300 

Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp 
305 310 315, 320 

Ala Asp Leu Leu Ala Leu Ala Ala Cys Arg Gly Gly Arg Val His Arg 



325 



330 



335 



Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly 
340 345 350 

Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp 
355 360 365 

Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro 
370 375 380 

Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp 

395 400 



385 



390 



Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg 
405 410 415 

Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr 
420 425 430 

His Glu Val Qlu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala 
435 440 445 

Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu 

450 455 460 

Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala 

475 480 



465 



Gly 



470 



His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu 



485 



490 



495 



Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys Thr Gly 
500 505 510 

Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His 
515 520 525 



Pro He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys Leu Lys 
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530 



535 



540 



Asn Thr Tyx Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly 
545 550 555 560 

Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu 
565 570 575 

Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu 
580 585 590 

Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu 
595 600 605 

Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu 
610 615 620 

Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys Asp He 
625 630 635 640 

His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val 
645 650 655 

Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu 

660 665 670 

Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr 
675 680 685 

Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys 
690 695 700 

Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly 
705 710 715 720 

Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn 
725 730 735 

Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn 
740 745 750 

Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val 
755 760 765 

Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin 
770 775 780 

Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu 
785 790 795 800 

Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala 
805 810 815 

Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala 
820 825 830 



Lys Gly His His His His His His 
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835 840 

<210> 545 

<211> 50 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 545 

ggggagaaga cggcgctcag gcttctgaag gagtggggga gcctggaagc 



<210> 546 

<211> 50 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 546 ^ 
gcttccaggc tcccccactc cttcagaagc ttgagcgccg tcttctcccc 



<210> 547 

<211> 33 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 547 

ctggtcggga cggacgccaa tgagggtgtg aag 



<210> 548 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 548 
gtccgtcccg accagaat 



<210> 549 

<211> 1008 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
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<400> 549 
atgggtgcgg 


atattggtga 


cctctttgag agggaagagg 


tcgagcttga 


gtacttctca 


60 


ggaaagaaaa 


ttgccgttga 


tgctttcaac acgctatacc 


agttcatctc 


gataataagg 


120 


cagcctgacg 


gtacgccgtt 


aaaggactca cagggcagaa 


tcacctctca 


cctttccgga 


180 


atcctataca 


gagtctccaa 


catggtcgag gtgggaatca 


ggccggtgtt 


tgtattcgac 


240 


ggagagccac 


cggagttcaa 


gaaggctgaa attgaggaga 


ggaaaaagag 


aagggctgag 


300 


gcagaggaga 


tgtggattgc 


ggctttgcag gcaggagata 


aggacgcgaa 


aaagtatgct 


360 


caggctgcag 


ggagggttga 


cgagtacatt gttgactccg 


caaagacgct 


tttaagttac 


420 


atggggattc 


cctttgtcga 


tgccccgtct gaaggagagg 


cgcaggctgc 


ttacatggca 


480 


gcaaaaggcg 


atgtggagta 


cacaggaagc caggattacg 


attctctgct 


cttcggaagc 


540 


ccgagactcg 


ccagaaatct 


cgcaataacg ggaaaaagga 


agcttcccgg 


caaaaatgtc 


600 


tatgtggatg 


taaagccgga 


gataataatt ctggaaagca 


acctcaaaag 


gctgggtttg 


660 


acgagggagc 


agctcatcga 


catagcgatt ctggtcggga 


cggacgccaa 


tgagggtgtg 


720 


aagggtgtcg 


gcgtcaagaa 


ggctttgaac tacatcaaga 


cctacggaga 


tattttcagg 


780 


gcactcaagg 


ctctgaaagt 


aaatattgac cacgtagagg 


agataaggaa 


tttcttcctg 


840 


aatcctcctg 


tgactgacga 


ctacagaata gagttcaggg 


agcctgactt 


tgagaaggcc 


900 


atcgagttcc 


tgtgcgagga 


gcacgacttc agcagggaga 


gggtcgagaa 


ggccttggag 


960 


aagctcaaag 


ctctgaagtc 


aacccaggcc acgcfctgaga 


ggtggttc 




1008 



<210> 550 

<211> 336 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 550 

Met Gly Ala Asp lie Gly Asp Leu Phe Glu Arg Glu Glu Val Glu Leu 
15 10 15 

Glu Tyr Phe Ser Gly Lys Lys lie Ala Val Asp Ala Phe Asn Thr Leu 
20 25 30 

Tyr Gin Phe lie Ser lie lie Arg Gin Pro Asp Gly Thr Pro Leu Lys 
35 40 45 

Asp Ser Gin Gly Arg He Thr Ser His Leu Ser Gly He Leu Tyr Arg 
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50 55 60 

Val Ser Asn Met Val Qlu Val Gly He Arg Pro Val Phe Val Phe Asp 
65 70 75 80 

Gly Glu Pro Pro Glu Phe Lys Lys Ala Glu He Glu Glu Arg Lys Lys 
85 90 

Arg Arg Ala Glu Ala Glu Glu Met Trp He Ala Ala Leu Gin Ala Gly 
100 105 110 

ASP Lys Asp Ala Lys Lys Tyr Ala Gin Ala Ala Gly Arg Val Asp Glu 
lis 120 125 

Tyr He Val Asp Ser Ala Lys Thr Leu Leu Ser Tyr Met Gly He Pro 
* 140 



130 



135 



Phe val Asp Ala Pro Ser Glu Gly Glu Ala Gin Ala Ala Tyr Met Ala 
145 150 155 160 

Ala Lys Gly Asp Val Glu Tyr Thr Gly Ser Gin Asp Tyr Asp Ser Leu 



165 



170 175 



Leu Phe Gly Ser Pro Arg Leu Ala Arg Asn Leu Ala He Thr Gly Lys 
180 185 190 

Arg Lys Leu Pro Gly Lys Asn Val Tyr Val Asp Val Lys Pro Qlu He 
195 20D 205 

lie He Leu Glu Ser Asn Leu Lys Arg Leu Gly Leu Thr Arg Glu Gin 
210 215 220 

Leu He ASP He Ala He Leu Val Gly Thr Asp Ala Asn Glu Gly Val 
225 230 235 240 

Lys Gly val Gly Val Lys Lys Ala Leu Asn Tyr He Lys Thr Tyr Gly 
245 250 255 

Asp He Phe Arg Ala Leu Lys Ala Leu Lys Val Asn He Asp His Val 
260 265 270 

Qlu Glu He Arg Asn Phe Phe Leu Asn Pro Pro Val Thr Asp Asp Tyr 
275 280 285 

Arg He Glu Phe Arg Glu Pro Asp Phe Glu Lys Ala He Glu Phe Leu 
290 295 300 

Cys Glu Glu His ASP Phe Ser Arg Glu Arg Val Glu Lys Ala Leu Glu 
305 310 315 320 

Lys Leu Lys Ala Leu Lys Ser Thr Gin Ala Thr Leu Qlu Arg Trp Phe 



325 330 



<210> 551 

<211> 33 

<212> DNA 

<213> Artificial 
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<220> 

<223> Synthetic 
<400> 55X 



ctggtcggga 


cggacaggaa 


tgagggtgtg aag 




33 


<2a0> 552 
<211> 1008 
<212> DNA 
<213> Artificial 








<220> 

<223> Synthetic 








<4UU> 

atgggtgcgg 


atattggtga 


cctctttgag agggaagagg 


tcgagcttga gtacttctca 


60 


ggaaagaaaa 


ttgccgttga 


tgctttcaac acgctatacc 


agttcatctc gataataagg 


120 


cagcctgacg 


gtacgccgtt 


aaaggactca cagggcagaa 


tcacctctca cctttccgga 


180 


atcctataca 


gagtctccaa 


catggtcgag gtgggaatca 


ggccggtgtt tgtattcgac 


240 


ggagagccac 


cggagttcaa 


gaaggctgaa attgaggaga 


ggaaaaagag aagggctgag 


300 


gcagaggaga 


tgtggattgc 


ggctttgcag gcaggagata 


aggacgcgaa aaagtatgct 


360 


caggctgcag 


ggagggttga 


cgagtacatt gttgactccg 


caaagacgct tttaagttac 


420 


atggggattc 


cctttgtcga 


tgccccgtct gaaggagagg 


cgcaggctgc ttacatggca 


480 


gcaaaaggcg 


atgtggagta 


cacaggaagc caggattacg 


attctctgct cttcggaagc 


540 


ccgagactcg 


ccagaaatct 


cgcaataacg ggaaaaagga 


agcttcccgg caaaaatgtc 


600 


tatgtggatg 


taaagccgga 


gataataatt ctggaaagca 


acctcaaaag gctgggtttg 


660 


acgagggagc 


agctcatcga 


catagcgatt ctggtcggga 


cggacaggaa tgagggtgtg 


720 


aagggtgtcg 


gcgtcaagaa 


ggctttgaac tacatcaaga 


cctacggaga tattttcagg 


780 


gcactcaagg 


ctctgaaagt 


aaatattgac cacgtagagg 


agataaggaa tttcttcctg 


840 


aatcctcctg 


tgactgacga 


ctacagaata gagttcaggg 


agcctgactt tgagaaggcc 


900 


atcgagttcc 


tgtgcgagga 


gcacgacttc agcagggaga 


gggtcgagaa ggccttggag 


960 


aagctcaaag 


ctctgaagtc 


aacccaggcc acgcttgaga 


ggtggttc 


1008 



<210> 553 

<211> 336 

<212> PRT 

<213> TUrtif icial 
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<220> 

<223> Synthetic 
<400> 553 

Met Gly Ala Asp lie Gly Asp Leu Phe Glu Arg Glu Glu Val Qlu Leu 
15 10 " 

Glu Tyr Phe Ser Gly Lys Lys He Ala Val Asp Ala Phe Asn Thr Leu 
20 25 30 

Tyr Gin Phe He Ser He He Arg Gin Pro Asp Gly Thr Pro Leu Lys 
35 40 45 

Asp Ser Gin Gly Arg He Thr Ser His Leu Ser Gly He Leu Tyr Arg 
50 55 60 

Val Ser Asn Met Val Glu Val Gly He Arg Pro Val Phe Val Phe Asp 
65 70 75 80 

Gly Qlu Pro Pro Glu Phe Lys Lys Ala Glu He Glu Glu Arg Lys Lys 
85 90 95 

Arg Arg Ala Glu Ala Glu Glu Met Trp He Ala Ala Leu Gin Ala Gly 
100 105 110 

ASP Lys Asp Ala Lys Lys Tyr Ala Gin Ala Ala Gly Arg Val Asp Glu 
115 120 125 

Tyr He Val Asp Ser Ala Lys Thr Leu Leu Ser Tyr Met Gly He Pro 
130 135 140 

Phe Val Asp Ala Pro Ser Glu Gly Glu Ala Gin Ala Ala Tyr Met Ala 
145 150 155 160 

Ala Lys Gly Asp Val Glu Tyr Thr Gly Ser Gin Asp Tyr Asp Ser Leu 
165 170 175 

Leu Phe Gly Ser Pro Arg Leu Ala Arg Asn Leu Ala He Thr Gly Lys 
180 185 190 

Arg Lys Leu Pro Gly Lys Asn Val Tyr Val Asp Val Lys Pro Qlu He 
195 200 205 

He He Leu Qlu Ser Asn Leu Lys Arg Leu Gly Leu Thr Arg Glu Gin 
210 215 220 

Leu He Asp He Ala He Leu Val Gly Thr Asp Arg Asn Glu Gly Val 
225 230 235 240 

Lys Gly Val Gly Val Lys Lys Ala Leu Asn Tyr He Lys Thr Tyr Gly 
245 250 255 

Asp He Phe Arg Ala Leu Lys Ala Leu Lys Val Asn He Asp His Val 
260 265 270 

Glu Glu He Arg Asn Phe Phe Leu Asn Pro Pro Val Thr Asp Asp Tyr 
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275 280 285 

Arg He Glu Phe Arg Glu Pro Asp Phe Glu Lys Ala He Glu Phe Leu 
290 295 300 

Cys Glu Glu His Asp Phe Ser Arg Glu Arg Val Glu Lys Ala Leu Glu 
305 310 315 320 

Lys Leu Lys Ala Leu Lys Ser Thr Gin Ala Thr Leu Glu Arg Trp Phe 
325 330 335 

<210> 554 
<211> 20 
<212> DNA 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 554 

gaaccacctc tcaagcgtgg 20 



<210> 555 

<211> 36 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 555 

acgcttgaga ggtggttcct ggaggaggcc ccctgg 36 



<210> 556 
<211> 1011 
<212> DNA 

<213> Archaeoglobus fulgidus 
<400> 556 

atgggtgcgg atattggtga cctctttgag agggaagagg tcgagcttga gtacttctca 60 
ggaaagaaaa ttgccgttga tgctttcaac acgctatacc agttcatctc gataataagg 120 
cagcctgacg gtacgccgtt aaaggactca cagggcagaa tcacctctca cctttccgga 180 
atcctataca gagtctccaa catggtcgag gtgggaatca ggccggtgtt tgtattcgac 240 
ggagagccac cggagttcaa gaaggctgaa attgaggaga ggaaaaagag aagggctgag 300 
gcagaggaga tgtggattgc ggctttgcag gcaggagata aggacgcgaa aaagtatgct 360 
caggctgcag ggagggttga cgagtacatt gttgactccg caaagacgct tttaagttac 420 
atggggattc cctttgtcga tgccccgtct gaaggagagg cgcaggctgc ttacatggca 480 
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<400> 557 
taatctgtat caggctg 



<210> 558 

<211> 2526 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 



<400> 558 

atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 



540 
600 
660 



gcaaaaggcg atgtggagta cacaggaagc caggattacg attctctgct cttcggaagc 
ccgagactcg ccagaaatct cgcaataacg ggaaaaagga agcttcccgg caaaaatgtc 
tatgtggatg taaagccgga gataataatt ctggaaagca acctcaaaag gctgggtttg 

acgagggagc agctcatcga catagcgatt ctggtcggga cggactacaa tgagggtgtg 720 

aagggtgtcg gcgtcaagaa ggctttgaac tacatcaaga cctacggaga tattttcagg 780 

gcactcaagg ctctgaaagt aaatattgac cacgtagagg agataaggaa tttcttcctg 840 

aatcctcctg tgactgacga ctacagaata gagttcaggg agcctgactt tgagaaggcc 900 

atcgagttcc tgtgcgagga gcacgacttc agcagggaga gggtcgagaa ggccttggag 960 

aagctcaaag ctctgaagtc aacccaggcc acgcttgaga ggtggttctg a 1011 

<210> 557 

<211> 17 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 



17 



ggccaccacc tggcctaccg caccttcttc gccctgaagg gcctcaccac gagccggggc 
gaaccggtgc aggcggtcta cggcttcgcc aagagcctcc tcaaggccct gaaggaggac 
gggtacaagg ccgtcttcgt ggtctttgac gccaaggccc cctccttccg ccacgaggcc 
tacgaggcct acaaggcggg gagggccccg acccccgagg acttcccccg gcagctcgcc 
ctcatcaagg agctggtgga cctcctgggg tttacccgcc tcgaggtccc cggctacgag 
gcggacgacg ttctcgccac cctggccaag aaggcggaaa aggaggggta cgaggtgcgc 420 
atcctcaccg ccgaccgcga cctctaccaa ctcgtctccg accgcgtcgc cgtcctccac 480 
cccgagggcc acctcatcac cccggagtgg ctttgggaga agtacggcct caggccggag 540 



60 
120 
180 
240 
300 
360 
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cagtgggtgg 


acttccgcgc 


cctcgtgggg 


gacccctccg 


acaacctccc 


cggggtcaag 


600 


ggcatcgggg 


agaagaccgc 


cctcaagctc 


ctcaaggagt 


ggggaagcct 


ggaaaacctc 


660 


ctcaagaacc 


tggaccgggt 


aaagccagaa 


aacgtccggg 


agaagatcaa 


ggcccacctg 


720 


gaagacctca 


ggctctcctt 


ggagctctcc 


cgggtgcgca 


ccgacctccc 


cctggaggtg 


780 


gacctcgccc 


aggggcggga 


gcccgaccgg 


gaggggctta 


gggccttcct 


ggagaggctg 


840 


gagttcggca 


gcctcctcca 


cgagttcggc 


ctcctggagg 


cccccgcccc 


cctggaggag 


900 


gccccctggc 


ccccgccgga 


aggggccttc 


gtgggcttcg 


tcctctcccg 


ccccgagccc 


960 


atgtgggcgg 


agcttaaagc 


cctggccgcc 


tgcaggggcg 


gccgcgtgca 


ccgggcagca 


1020 


gaccccttgg 


cggggctaaa 


ggacctcaag 


gaggtccggg 


gcctcctcgc 


caaggacctc 


1080 


gccgtcttgg 


cctcgaggga 


ggggctagac 


ctcgtgcccg 


gggacgaccc 


catgctcctc 


1140 


gcctacctcc 


tggacccttc 


gaacaccacc 


cccgaggggg 


tggcgcggcg 


ctacgggggg 


1200 


gagtggacgg 


aggacgccgc 


ccaccgggcc 


ctcctctcgg 


agaggctcca 


tcggaacctc 


1260 


cttaagcgcc 


tcgaggggga 


ggagaagctc 


ctttggctct 


accacgaggt 


ggaaaagccc 


1320 


ctctcccggg 


tcctggccca 


tatggaggcc 


accggggtac 


ggcgggacgt 


ggcctacott 


1380 


caggcccttt 


ccctggagct 


tgcggaggag 


atccgccgcc 


tcgaggagga 


ggtcttccgc 


1440 


ttggcgggcc 


accccttcaa 


cctcaactcc 


cgggaccagc 


tggaaagggt 


gctctttgac 


1500 


gagcttaggc 


ttcccgcctt 


gaagaagacg 


aagaagacag 


gcaagcgctc 


caccagcgcc 


1560 


gcggtgctgg 


aggccctacg 


ggaggcccac 


cccatcgtgg 


agaagatcct 


ccagcaccgg 


1620 


gagctcacca 


agctcaagaa 


cacctacgtg 


gaccccctcc 


caagcctcgt 


ccacccgagg 


1680 


acgggccgcc 


tccacacccg 


cttcaaccag 


acggccacgg 


ccacggggag 


gcttagtagc 


1740 


tccgacccca 


acctgcagaa 


catccccgtc 


cgcaccccct 


tgggccagag 


gatccgccgg 


1800 


gccttcgtgg 


ccgaggcggg 


ttgggcgttg 


gtggccctgg 


actatagcca 


gatagagctc 


1860 


cgcgtcctcg 


cccacctctc 


cggggacgaa 


aacctgatca 


gggtcttcca 


ggaggggaag 


1920 


gacatccaca 


cccagaccgc 


aagctggatg 


ttcggcgtcc 


ccccggaggc 


cgtggacccc 


1980 


ctgatgcgcc 


gggcggccaa 


gacggtgaac 


ttcggcgtcc 


tctacggcat 


gtccgcccat 


2040 


aggctctccc 


aggagcttgc 


catcccctac 


gaggaggcgg 


tggcctttat 


agagcgctac 


2100 


ttccaaagct 


tccccaaggt 


gcgggcctgg 


atagaaaaga 


ccctggagga 


ggggaggaag 


2160 


cggggctacg 


tggaaaccct 


cttcggaaga 


aggcgctacg 


tgcccgacct 


caacgcccgg 


2220 
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gtgaagagcg tcagggaggc cgcggagcgc atggccttca acatgcccgt ccagggcacc 2280 

gccgccgacc tcatgaagct cgccatggtg aagctcttcc cccgcctccg ggagatgggg 2340 

gcccgcatgc tcctccaggt cgccaacgag ctcctcctgg aggcccccca agcgcgggcc 2400 

gaggaggtgg cggctttggc caaggaggcc atggagaagg cctatcccct cgccgtgccc 2460 

ctggaggtgg aggtggggat gggggaggac tggctttccg ccaagggtca ccaccaccac 2520 



caccac 



<210> 559 

<211> 2643 

<212> Dm 

<213> Artificial 

<220> 

<223> Synthetic 



2526 



<400> 559 

atgggtgcgg atattggtga cctctttgag agggaagagg tcgagcttga gtacttctca 60 

ggaaagaaaa ttgccgttga tgctttcaac acgctatacc agttcatctc gataataagg 120 

cagcctgacg gtacgccgtt aaaggactca cagggcagaa tcacctctca cctttccgga 180 

atcctataca gagtctccaa catggtcgag gtgggaatca ggccggtgtt tgtattcgac 240 

ggagagccac cggagttcaa gaaggctgaa attgaggaga ggaaaaagag aagggctgag 300 

gcagaggaga tgtggattgc ggctttgcag gcaggagata aggacgcgaa aaagtatgct 360 

caggctgcag ggagggttga cgagtacatt gttgactccg caaagacgct tttaagttac 420 

atggggattc cctttgtcga tgccccgtct gaaggagagg cgcaggctgc ttacatggca 480 

gcaaaaggcg atgtggagta cacaggaagc caggattacg attctctgct cttcggaagc 540 

ccgagactcg ccagaaatct cgcaataacg ggaaaaagga agcttcccgg caaaaatgtc 600 

tatgtggatg taaagccgga gataataatt ctggaaagca acctcaaaag gctgggtttg 660 

acgagggagc agctcatcga catagcgatt ctggtcggga cggactacaa tgagggtgtg 720 

aagggtgtcg gcgtcaagaa ggctttgaac tacatcaaga cctacggaga tattttcagg 780 

gcactcaagg ctctgaaagt aaatattgac cacgtagagg agataaggaa tttcttcctg 840 

aatcctcctg tgactgacga ctacagaata gagttcaggg agcctgactt tgagaaggcc 900 

atcgagttcc tgtgcgagga gcacgacttc agcagggaga gggtcgagaa ggccttggag 960 

aagctcaaag ctctgaagtc aacccaggcc acgcttgaga ggtggttcct ggaggaggcc 1020 

ccctggcccc cgccggaagg ggccttcgtg ggcttcgtcc tctcccgccc cgagcccatg 1080 
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tgggcggagc 


ttaaagccct 


ggccgcctgc 


aggggcggcc gcgtgcaccg ggcagcagac 


1140 


cccttggcgg 


ggctaaagga 


cctcaaggag 


gtccggggcc tcctcgccaa ggacctcgcc 


1200 


gtcttggcct 


cgagggaggg 


gctagacctc 


gtgcccgggg acgaccccat gctcctcgcc 


1260 


tacctcctgg 


acccttcgaa 


caccaccccc 


gagggggtgg cgcggcgcta cgggggggag 


1320 


tggacggagg 


acgccgccca 


ccgggccctc 


ctctcggaga ggctccatcg gaacctcctt 


1380 


aagcgcctcg 


agggggagga 


gaagctcctt 


tggctctacc acgaggtgga aaagcccctc 


1440 


tcccgggtcc 


tggcccatat 


ggaggccacc 


ggggtacggc gggacgtggc ctaccttcag 


1500 


gccctttccc 


tggagcttgc 


ggaggagatc 


cgccgcctcg aggaggaggt cttccgcttg 


1560 


gcgggccacc 


ccttcaacct 


caactcccgg 


gaccagctgg aaagggtgct ctttgacgag 


1620 


cttaggcttc 


ccgccttgaa 


gaagacgaag 


aagacaggca agcgctccac cagcgccgcg 


1680 


gtgctggagg 


ccctacggga 


ggcccacccc 


atcgtggaga agatcctcca gcaccgggag 


1740 


ctcaccaagc 


tcaagaacac 


ctacgtggac 


cccctcccaa gcctcgtcca cccgaggacg 


1800 


ggccgcctcc 


acacccgctt 


caaccagacg 


gccacggcca cggggaggct tagtagctcc 


1860 


gaccccaacc 


tgcagaacat 


ccccgtccgc 


acccccttgg gccagaggat ccgccgggcc 


1920 


ttcgtggccg 


aggcgggttg 


ggcgttggtg 


gccctggact atagccagat agagctccgc 


1980 


gtcctcgccc 


acctctccgg 


ggacgaaaac 


ctgatcaggg tcttccagga ggggaaggac 


2040 


atccacaccc 


agaccgcaag 


ctggatgttc 


ggcgtccccc cggaggccgt ggaccccctg 


2100 


atgcgccggg 


cggccaagac 


ggtgaacttc 


ggcgtcctct acggcatgtc cgcccatagg 


2160 


ctctcccagg 


agcttgccat 


cccctacgag 


gaggcggtgg cctttataga gcgctacttc 


2220 


caaagcttcc 


ccaaggtgcg 


ggcctggata 


gaaaagaccc tggaggaggg gaggaagcgg 


2280 


ggctacgtgg 


aaaccctctt 


cggaagaagg 


cgctacgtgc ccgacctcaa cgcccgggtg 


2340 


aagagcgtca 


gggaggccgc 


ggagcgcatg 


gccttcaaca tgcccgtcca gggcaccgcc 


2400 


gccgacctca 


tgaagctcgc 


catggtgaag 


ctcttccccc gcctccggga gatgggggcc 


2460 


cgcatgctcc 


tccaggtcgc 


caacgagctc 


ctcctggagg ccccccaagc gcgggccgag 


2520 


gaggtggcgg 


ctttggccaa 


ggaggccatg 


gagaaggcct at'cccctcgc cgtgcccctg 


2580 


gaggtggagg 


tggggatggg 


ggaggactgg 


ctttccgcca agggtcacca ccaccaccac 


2640 


cac 








2643 



<210> 560 
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<211> 881 
<212> PRT 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 560 

Met Gly Ala Asp lie Gly Asp Leu Phe Glu Arg Glu Glu Val Glu Leu 
15 10 15 

Glu Tyr Phe Ser Gly Lys Lys lie Ala Val Asp Ala Phe Asn Thr Leu 
20 25 30 

Tyr Gin Phe He Ser He He Arg Gin Pro Asp Gly Thr Pro Leu Lys 
35 ' 40 45 

Asp Ser Gin Gly Arg He Thr Ser His Leu Ser Gly He Leu Tyr Arg 
50 55 60 

Val Ser Asn Met Val Glu Val Gly He Arg Pro Val Phe Val Phe Asp 
65 ■ 70 75 80 

Gly Glu Pro Pro Glu Phe Lys Lys Ala Glu He Glu Glu Arg Lys Lys 
85 90 95 

Arg Arg Ala Glu Ala Glu Glu Met Trp He Ala Ala Leu Gin Ala Gly 

100 105 110 

Asp Lys Asp Ala Lys Lys Tyr Ala Gin Ala Ala Gly Arg Val Asp Glu 
115 120 125 

Tyr He Val Asp Ser Ala Lys Thr Leu Leu Ser Tyr Met Gly He Pro 
130 135 140 

Phe Val Asp Ala Pro Ser Glu Gly Glu Ala Gin Ala Ala Tyr Met Ala 
145 150 155 160 

Ala Lys Gly Asp Val Glu Tyr Thr Gly Ser Gin Asp Tyr Asp Ser Leu 
165 170 175 

Leu Phe Gly Ser Pro Arg Leu Ala Arg Asn Leu Ala He Thr Gly Lys 
180 185 190 

Arg Lys Leu Pro Gly Lys Asn Val Tyr Val Asp Val Lys Pro Glu He 
195 200 205 

He He Leu Glu Ser Asn Leu Lys Arg Leu Gly Leu Thr Arg Glu Gin 
210 215 220 

Leu He Asp He Ala He Leu Val Gly Thr Asp Tyr Asn Glu Gly Val 
225 230 235 240 

Lys Gly Val Gly Val Lys Lys Ala Leu Asn Tyr He Lys Thr Tyr Gly 
245 250 255 
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Asp He Phe Arg Ala Leu Lys Ala Leu Lys Val Asn He Asp His Val 
260 265 270 

Glu Glu He Arg Asn Phe Phe Leu Asn Pro Pro Val Thr Asp Asp Tyr 
275 280 285 

Arg He Glu Phe Arg Glu Pro Asp Phe Glu Lys Ala He Qlu Phe Leu 
290 295 300 

Cys Glu Glu His Asp Phe Ser Arg Glu Arg Val Glu Lys Ala Leu Glu 
305 310 315 320 



Lys Leu Lys Ala Leu Lys Ser Thr Gin Ala Thr Leu Glu Arg Trp Phe 
325 330 335 

Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu Gly Ala Phe Val Gly Phe 
340 345 350 

Val Leu Ser Arg Pro Glu Pro Met Trp Ala Glu Leu Lys Ala Leu Ala 
355 360 365 



Ala Cys Arg Gly Gly Arg Val His Arg Ala Ala Asp Pro Leu Ala Gly 
370 375 380 



Leu Lys Asp Leu Lys Glu Val Arg Gly Leu Leu Ala Lys Asp Leu Ala 
385 390 395 400 

Val Leu Ala Ser Arg Glu Gly Leu Asp Leu Val Pro Gly Asp Asp Pro 
405 410 415 

Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser Asn Thr Thr Pro Glu Gly 
420 425 430 

Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr Glu Asp Ala Ala His Arg 
435 440 445 

Ala Leu Leu Ser Glu Arg Leu His Arg Asn Leu Leu Lys Arg Leu Glu 
450 455 460 

Gly Glu Glu Lys Leu Leu Trp Leu Tyr His Glu Val Glu Lys Pro Leu 
465 470 475 480 

Ser Arg Val Leu Ala His Met Glu Ala Thr Gly Val Arg Arg Asp Val 
485 490 495 

Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu Ala Glu Glu He Arg Arg 
500 505 510 

Leu Glu Glu Glu Val Phe Arg Leu Ala Gly His Pro Phe Asn Leu Asn 
515 520 525 

Ser Arg Asp Gin Leu Glu Arg Val Leu Phe Asp Glu Leu Arg Leu Pro 
530 535 540 

Ala Leu Lys Lys Thr Lys Lys Thr Gly Lys Arg Ser Thr Ser Ala Ala 
545 550 555 560 
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Val Leu Glu Ala Leu Arg Glu Ala His Pro He Val Glu Lys He Leu 
565 570 575 

Gin His Arg Glu Leu Thr Lys Leu Lys Asn Thr Tyr Val Asp Pro Leu 
580 585 590 

Pro Ser Leu Val His Pro Arg Thr Gly Arg Leu His Thr Arg Phe Asn 

595 600 605 

Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser Ser Asp Pro Asn Leu 
610 615 620 

Gin Asn He Pro Val Arg Thr Pro Leu Gly Gin Arg He Arg Arg Ala 
625 630 635 640 

Phe Val Ala Glu Ala Gly Trp Ala Leu Val Ala Leu Asp Tyr Ser Gin 
645 650 655 

He Glu Leu Arg Val Leu Ala His Leu Ser Gly Asp Glu Asn Leu He 
660 665 670 

Arg Val Phe Gin Glu Gly Lys Asp He His Thr Gin Thr Ala Ser Trp 
675 680 685 

Met Phe Gly Val Pro Pro Glu Ala Val Asp Pro Leu Met Arg Arg Ala 
690 695 700 

Ala Lys Thr Val Asn Phe Gly Val Leu Tyr Gly Met Ser Ala His Arg 
705 710 715 720 

Leu Ser Gin Glu Leu Ala He Pro Tyr Glu Glu Ala Val Ala Phe He 
725 730 735 

Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val Arg Ala Trp He Glu Lys 
740 745 750 

Thr Leu Glu Glu Gly Arg Lys Arg Gly Tyr Val Glu Thr Leu Phe Gly 
755 760 765 

Arg Arg Arg Tyr Val Pro Asp Leu Asn Ala Arg Val Lys Ser Val Arg 
770 775 780 

Glu Ala Ala Glu Arg Met Ala Phe Asn Met Pro Val Gin Gly Thr Ala 
785 790 795 800 

Ala Asp Leu Met Lys Leu Ala Met Val Lys Leu Phe Pro Arg Leu Arg 
805 810 815 

Glu Met Gly Ala Arg Met Leu Leu Gin Val Ala Asn Glu Leu Leu Leu 
820 825 830 

Glu Ala Pro Gin Ala Arg Ala Glu Glu Val Ala Ala Leu Ala Lys Glu 
835 840 845 



Ala Met Glu Lys Ala Tyr Pro Leu Ala Val Pro Leu Glu Val Glu Val 
850 855 860 
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Gly Met Gly Glu Asp Trp Leu Ser Ala Lys Gly His His His His His 
865 870 875 880 

His 



<210> 561 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 561 

ggttgacttc agagctttga g 21 



<210> 562 

<211> 36 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 562 

aaagctctga agtcaaccct ggaggaggcc ccctgg 36 

<210> 563 

<211> 2619 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 563 

atgggtgcgg atattggtga cctctttgag agggaagagg tcgagcttga gtacttctca 60 

ggaaagaaaa ttgccgttga tgctttcaac acgctatacc agttcatctc gataataagg 120 

cagcctgacg gtacgccgtt aaaggactca cagggcagaa tcacctctca cctttccgga 180 

atcctataca gagtctccaa catggtcgag gtgggaatca ggccggtgtt tgtattcgac 240 

ggagagccac cggagttcaa gaaggctgaa attgaggaga ggaaaaagag aagggctgag 300 

gcagaggaga tgtggattgc ggctttgcag gcaggagata aggacgcgaa aaagtatgct 360 

caggctgcag ggagggttga cgagtacatt gttgactccg caaagacgct tttaagttac 420 

atggggattc cctttgtcga tgccccgtct gaaggagagg cgcaggctgc ttacatggca 480 

gcaaaaggcg atgtggagta cacaggaagc caggattacg attctctgct cttcggaagc 540 
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ccgagactcg 


ccagaaatct 


cgcaataacg 


ggaaaaagga 


agcttcccgg 


caaaaatgtc 


600 


tatgtggatg 


taaagccgga 


gataataatt 


ctggaaagca 


acctcaaaag 


gctgggtttg 


660 


acgagggagc 


agctcatcga 


catagcgatt 


ctggtcggga 


cggactacaa 


tgagggtgtg 


720 


aagggtgtcg 


gcgtcaagaa 


ggctttgaac 


tacatcaaga 


cctacggaga 


tattttcagg 


780 


gcactcaagg 


ctctgaaagt 


aaatattgac 


cacgtagagg 


agataaggaa 


tttcttcctg 


840 


aatcctcctg 


tgactgacga 


ctacagaata 


gagttcaggg 


agcctgactt 


tgagaaggcc 


900 


atcgagttcc 


tgtgcgagga 


gcacgacttc 


agcagggaga 


gggtcgagaa 


ggccttggag 


960 


aagctcaaag 


ctctgaagtc 


aaccctggag 


gaggccccct 


ggcccccgcc 


ggaaggggcc 


1020 


ttcgtgggct 


tcgtcctctc 


ccgccccgag 


cccatgtggg 


cggagcttaa 


agccctggcc 


1080 


gcctgcaggg 


gcggccgcgt 


gcaccgggca 


gcagacccct 


tggcggggct 


aaaggacctc 


1140 


aaggaggtcc 


ggggcctcct 


cgccaaggac 


ctcgccgtct 


tggcctcgag 


ggaggggcta 


1200 


gacctcgtgc 


ccggggacga 


ccccatgctc 


ctcgcctacc 


tcctggaccc 


ttcgaacacc 


1260 


acccccgagg 


gggtggcgcg 


gcgctacggg 


ggggagtgga 


cggaggacgc 


cgcccaccgg 


1320 


gccctcctct 


cggagaggct 


ccatcggaac 


ctccttaagc 


gcctcgaggg 


ggaggagaag 


1380 


ctcctttggc 


tctaccacga 


ggtggaaaag 


cccctctccc 


gggtcctggc 


ccatatggag 


1440 


gccaccgggg 


tacggcggga 


cgtggcctac 


cttcaggccc 


tttccctgga 


gcttgcggag 


1500 


gagatccgcc 


gcctcgagga 


ggaggtcttc 


cgcttggcgg 


gccacccctt 


caacctcaac 


1560 


tcccgggacc 


agctggaaag 


ggtgctcttt 


gacgagctta 


ggcttcccgc 


cttgaagaag 


1620 


acgaagaaga 


caggcaagcg 


ctccaccagc 


gccgcggtgc 


tggaggccct 


acgggaggcc 


1680 


caccccatcg 


tggagaagat 


cctccagcac 


cgggagctca 


ccaagctcaa 


gaacacctac 


1740 


gtggaccccc 


tcccaagcct 


cgtccacccg 


aggacgggcc 


gcctccacac 


ccgcttcaac 


1800 


cagacggcca 


cggccacggg 


gaggcttagt 


agctccgacc 


ccaacctgca 


gaacatcccc 


1860 


gtccgcaccc 


ccttgggcca 


gaggatccgc 


cgggccttcg 


tggccgaggc 


gggttgggcg 


1920 


ttggtggccc 


tggactatag 


ccagatagag 


ctccgcgtcc 


tcgcccacct 


ctccggggac 


1980 


gaaaacctga 


tcagggtctt 


ccaggagggg 


aaggacatcc 


acacccagac 


cgcaagctgg 


2040 


atgttcggcg 


tccccccgga 


ggccgtggac 


cccctgatgc 


gccgggcggc 


caagacggtg 


2100 


aacttcggcg 


tcctctacgg 


catgtccgcc 


cataggctct 


cccaggagct 


tgccatcccc 


2160 


tacgaggagg 


cggtggcctt 


tatagagcgc 


tacttccaaa 


gcttccccaa 


ggtgcgggcc 


2220 


tggatagaaa 


agaccctgga 


ggaggggagg 


aagcggggct 


acgtggaaac 


cctcttcgga 


2280 
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agaaggcgct 


acgtgcccga 


cctcaacgcc cgggtgaaga 


gcgtcaggga ggccgcggag 


2340 


cgcatggcct 


tcaacatgcc 


cgtccagggc accgccgccg 


acctcatgaa gctcgccatg 


2400 


gtgaagctct 


tcccccgcct 


ccgggagatg ggggcccgca 


tgctcctcca ggtcgccaac 


2460 


gagctcctcc 


tggaggcccc 


ccaagcgcgg gccgaggagg 


tggcggcttt ggccaaggag 


2520 


gccatggaga 


aggcctatcc 


cctcgccgtg cccctggagg tggaggtggg gatgggggag 


2580 


gactggcttt 


ccgccaaggg 


tcaccaccac caccaccac 




2619 



<210> 


564 


<211> 


873 


<212> 


PRT 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


564 


Met Gly Ala Asp He 


1 


5 


Glu Tyr Phe Ser Gly 




20 



10 15 



25 30 

Tyr Gin Phe He Ser He He Arg Gin Pro Asp Gly Thr Pro Leu Lys 
35 40 45 

Asp Ser Gin Gly Arg He Thr Ser His Leu Ser Gly He Leu Tyr Arg 
50 55 60 

Val Ser Asn Met Val Glu Val Gly He Arg Pro Val Phe Val Phe Asp 
65 70 75 80 

Gly Glu Pro Pro Glu Phe Lys Lys Ala Glu lie Glu Glu Arg Lys Lys 
85 90 95 

Arg Arg Ala Glu Ala Glu Glu Met Trp He Ala Ala Leu Gin Ala Gly 
100 105 110 

Asp Lys Asp Ala Lys Lys Tyr Ala Gin Ala Ala Gly Arg Val Asp Glu 
115 120 125 

Tyr He Val Asp Ser Ala Lys Thr Leu Leu Ser Tyr Met Gly He Pro 
130 135 140 

Phe Val Asp Ala Pro Ser Glu Gly Glu Ala Gin Ala Ala Tyr Met Ala 
145 150 155 160 

Ala Lys Gly Asp Val Glu Tyr Thr Gly Ser Gin Asp Tyr Asp Ser Leu 
165 170 175 
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Leu Phe Gly Ser Pro Arg Leu Ala Arg Asn Leu Ala He Thr Gly Lys 
180 1B5 190 

Arg Lys Leu Pro Gly Lys Asn Val Tyr Val Asp Val Lys Pro Glu He 
195 200 205 

He He Leu Glu Ser Asn Leu Lys Arg Leu Gly Leu Thr Arg Glu Gin 
210 215 220 

Leu He Asp He Ala He Leu Val Gly Thr Asp Tyr Asn Glu Gly Val 
225 230 235 240 

Lys Gly Val Gly Val Lys Lys Ala Leu Asn Tyr He Lys Thr Tyr Gly 
245 250 255 

Asp He Phe Arg Ala Leu Lys Ala Leu Lys Val Asn He Asp His Val 
260 265 270 

Glu Glu He Arg Asn Phe Phe Leu Asn Pro Pro Val Thr Asp Asp Tyr 

275 280 285 

Arg He Glu Phe Arg Glu Pro Asp Phe Glu Lys Ala He Glu Phe Leu 
290 295 300 

Cys Glu Glu His Asp Phe Ser Arg Glu Arg Val Glu Lys Ala Leu Glu 
305 310 315 320 

Lys Leu Lys Ala Leu Lys Ser Thr Leu Glu Glu Ala Pro Trp Pro Pro 
325 330 335 

Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro Met 
340 345 350 

Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val His 
355 360 365 

Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg 
370 375 380 

Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu 
385 390 395 400 

Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp 
405 410 415 

Pro ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu 
420 425 430 

Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His 
435 440 445 



Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu 
450 455 460 

Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu 
465 470 475 480 
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Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu 
485 490 495 

Glu Leu Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg Leu 
500 505 510 

Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val 
515 520 525 

Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys Thr 
530 535 540 

Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala 
545 550 555 560 

His Pro He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys Leu 
565 570 575 

Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr 
580 585 590 

Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg 
595 600 605 

Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro - Val Arg Thr Pro 
610 615 620 

Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala 
625 630 635 640 

Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His 
645 650 655 

Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys Asp 
660 665 670 

He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala 
675 680 685 

Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val 
690 695 700 

Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro 
705 710 715 720 

Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro 
725 730 735 

Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg 
740 745 750 

Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu 

755 760 765 



Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe 
770 775 780 
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Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met 
785 790 795 800 

Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu 
805 810 815 

Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu 

820 825 830 

Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu 
835 840 845 

Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser 



<210> 565 
<211> 2643 
<212> DNA 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 565 

atgggtgcgg atattggtga cctctttgag agggaagagg tcgagcttga gtacttctca 60 

ggaaagaaaa ttgccgttga tgctttcaac acgctatacc agttcatctc gataataagg 120 

cagcctgacg gtacgccgtt aaaggactca cagggcagaa tcacctctca cctttccgga 180 

atcctataca gagtctccaa catggtcgag gtgggaatca ggccggtgtt tgtattcgac 240 

ggagagccac cggagttcaa gaaggctgaa attgaggaga ggaaaaagag aagggctgag 300 

gcagaggaga tgtggattgc ggctttgcag gcaggagata aggacgcgaa aaagtatgct 360 

caggctgcag ggagggttga cgagtacatt gttgactccg caaagacgct tttaagttac 420 

atggggattc cctttgtcga tgccccgtct gaaggagagg cgcaggctgc ttacatggca 480 

gcaaaaggcg atgtggagta cacaggaagc caggattacg attctctgct cttcggaagc 540 

ccgagactcg ccagaaatct cgcaataacg ggaaaaagga agcttcccgg caaaaatgtc 600 

tatgtggatg taaagccgga gataataatt ctggaaagca acctcaaaag gctgggtttg 660 

acgagggagc agctcatcga catagcgatt ctggtcggga cggactacaa tgagggtgtg 720 

aagggtgtcg gcgtcaagaa ggctttgaac tacatcaaga cctacggaga tattttcagg 780 

gcactcaagg ctctgaaagt aaatattgac cacgtagagg agataaggaa tttcttcctg 840 

aatcctcctg tgactgacga ctacagaata gagttcaggg agcctgactt tgagaaggcc 900 
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atcgagttcc 


tgtgcgagga 


gcacgacttc 


agcagggaga 


gggtcgagaa 


ggccttggag 




aagctcaaag 


ctctgaagtc 


aacccaggcc 


acgcttgaga 


ggtggttcct 


ggaggaggcc 


1020 


ccctggcccc 


cgccggaagg 


ggccttcgtg 


ggcttcgtcc 


tctcccgccc 


cgagcccatg 


1080 


tgggcggagc 


ttaaagccct 


ggccgcctgc 


aggggcggcc 


gcgtccaccg 


ggcccccgag 


1140 


ccttataaag 


ccctcaggga 


cctgaaggag 


gcgcgggggc 


ttctcgccaa 


agacctgagc 


1200 


gttctggccc 


tgagggaagg 


ccttggcctc 


ccgcccggcg 


acgaccccat 


gctcctcgcc 


1260 


tacctcctgg 


acccttcgaa 


caccaccccc 


gagggggtgg 


cccggcgcta 


cggcggggag 


1320 


tggacggagg 


aggcggggga 


gcgggccgcc 


ctttccgaga 


ggctcttcgc 


caacctgctt 


1380 


aagaggcttg 


agggggagga 


gaggctcctt 


tggctttacc 


gggaggtgga 


gaggcccctt 


1440 


tccgctgtcc 


tggcccatat 


ggaggccacg 


ggggtgcgcc 


tggacgtggc 


ctatctcagg 


1500 


gccttgtccc 


tggaggtggc 


cgaggagatc 


gcccgcctcg 


aggccgaggt 


cttccgcctg 


1560 


gccggccacc 


ccttcaacct 


caactcccgg 


gaccagctgg 


aaagggtcct 


ctttgacgag 


1620 


ctagggcttc 


ccgccatcaa 


gaagacgcaa 


aagaccggca 


agcgctccac 


cagcgccgcc 


1680 


gtcctggagg 


ccctccgcga 


ggcccacccc 


atcgtggaga 


agatcctgca 


gtaccgggag 


1740 


ctcaccaagc 


tgaagagcac 


ctacattgac 


cccttgccgg 


acctcatcca 


ccccaggacg' 


1800 


ggccgcctcc 


acacccgctt 


caaccagacg 


gccacggcca 


cgggcaggct 


aagtagctcc 


1860 


gatcccaacc 


tccagaacat 


ccccgtccgc 


accccgcttg 


ggcagaggat 


ccgccgggcc 


1920 


ttcatcgccg 


aggaggggtg 


gctattggtg 


gccctggact 


atagccagat 


agagctcagg 


1980 


gtgctggccc 


acctctccgg 


cgacgagaac 


ctgatccggg 


tcttccagga 


ggggcgggac 


2040 


atccacacgg 


agaccgccag 


ctggatgttc 


ggcgtccccc 


gggaggccgt 


ggaccccctg 


2100 


atgcgccggg 


cggccaagac 


catcaacttc 


ggggtCCtct 


acggcatgtc 


ggcccaccgc 


2160 


ctctcccagg 


agctagccat 


cccttacgag 


gaggcccagg 


ccttcattga 


gcgctacttt 


2220 


cagagcttcc 


ccaaggtgcg 


ggcctggatt 


gagaagaccc 


tggaggaggg 


caggaggcgg 


2280 


gggtacgtgg 


agaccctctt 


cggccgccgc 


cgctacgtgc 


cagacctaga 


ggcccgggtg 


2340 


aagagcgcgc 


999ag9cggc 


cgagcgcaug 


gccuccaaca 


Egcccgucca 


gggca ccgc c 


o A n n 


gccgacctca 


tgaagctggc 


tatggtgaag 


ctcttcccca 


ggctggagga 


aatgggggcc 


2460 


^SSatgctcc 


ttcaggtcgc 


caacgagctg 


gtcctcgagg 


ccccaaaaga 


gagggcggag 


2520 


gccgtggccc 


ggctggccaa 


ggaggtcatg 


gagggggtgt 


atcccctggc 


cgtgcccctg 


2580 


gaggtggagg 


tggggatagg 


ggaggactgg 


ctctccgcca 


aggagcacca 


ccaccaccac 


2640 
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cac 2643 



<210> 


566 








<211> 


881 








<212> 


PRT 








<213> 


Artificial 






<220> 










<223> 


Synthetic 






<400> 


566 








Met Gly Ala 


Asp lie Gly Asp Leu Phe Glu Arg 


Glu Glu Val Glu 


Leu 


1 




5 10 


15 




Glu Tyr Phe 


Ser Gly Lys Iiys lie Ala Val Asp 


Ala Phe Asn Thr 


Leu 






20 25 


30 




Tyx Gin Phe 


lie Ser lie lie Arg Gin Pro Asp 


Gly Thr Pro Leu 


Lys 




35 


40 


45 




Asp Ser Gin 


Gly Arg lie Thr Ser His Leu Ser 


Gly He Leu Tyr 


Arg 


50 




55 


60 




Val Ser Asn 


Met val Glu Val Gly He Arg Pro 


Val Phe Val Phe 


Asp 


65 




70 75 




80 



Gly Glu Pro Pro Glu Phe Lys Lys Ala Glu He Glu Glu Arg Lys Lys 
85 90 95 

Arg Arg Ala Glu Ala Glu Glu Met Trp He Ala Ala Leu Gin Ala Gly 
100 105 110 

Asp Lys Asp Ala Lys Lys Tyr Ala Gin Ala Ala Gly Arg Val Asp Glu 
115 120 125 

Tyr He Val Asp Ser Ala Lys Thr Leu Leu Ser Tyr Met Gly He Pro 
130 135 140 

Phe Val Asp Ala Pro Ser Glu Gly Glu Ala Gin Ala Ala Tyr Met Ala 
145 150 155 160 

Ala Lys Gly Asp Val Glu Tyr Thr Gly Ser Gin Asp Tyr Asp Ser Leu 
165 170 175 

Leu Phe Gly Ser Pro Arg Leu Ala Arg Asn Leu Ala He Thr Gly Lys 
180 185 190 

Arg Lys Leu Pro Gly Lys Asn Val Tyr Val Asp Val Lys Pro Glu He 
195 200 205 

He He Leu Glu Ser Asn Leu Lys Arg Leu Gly Leu Thr Arg Glu Gin 
210 '215 220 

Leu He Asp He Ala He Leu Val Gly Thr Asp Tyr Asn Glu Gly Val 
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225 



230 



235 



240 



I*ys Gly Val Gly Val Lys Lys Ala Leu Asn Tyr He Lys Thr Tyr Gly 
245 250 255 

Asp He Phe Arg Ala Leu Lys Ala Leu Lys Val Asn He Asp His Val 
260 265 270 

Glu Glu He Arg Asn Phe Phe Leu Asn Pro Pro Val Thr Asp Asp Tyr 
275 280 285 

Arg He Glu Phe Arg Glu Pro Asp Phe Glu Lys Ala He Glu Phe Leu 
290 295 300 

Cys Glu Glu His Asp Phe Ser Arg Glu Arg Val Glu Lys Ala Leu Glu 
305 310 315 320 

Lys Leu Lys Ala Leu Lys Ser Thr Gin Ala Thr Leu Glu Arg Trp Phe 
325 330 335 

Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu Gly Ala Phe Val Gly Phe 
340 345 350 

Val Leu Ser Arg Pro Glu Pro Met Trp Ala Glu Leu Lys Ala Leu Ala 
355 360 365 

Ala Cys Arg Gly Gly Arg Val His Arg Ala Pro Glu Pro Tyr Lys Ala 
370 375 380 

Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu Leu Ala Lys Asp Leu Ser 
385 390 395 400 

Val Leu Ala Leu Arg Glu Gly Leu Gly Leu Pro Pro Gly Asp Asp Pro 
405 410 415 

Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser Asn Thr Thr Pro Glu Gly 
420 425 430 

Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr Glu Glu Ala Gly Glu Arg 
435 440 445 

Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn Leu Leu Lys Arg Leu Glu 
450 455 460 

Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg Glu Val Glu Arg Pro Leu 
465 470 475 480 

Ser Ala Val Leu Ala His Met Glu Ala Thr Gly Val Arg Leu Asp Val 
485 490 495 

Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val Ala Glu Glu He Ala Arg 
500 505 510 

Leu Glu Ala Glu Val Phe Arg Leu Ala Gly His Pro Phe Asn Leu Asn 
515 520 525 



Ser Arg Asp Gin Leu Glu Arg Val Leu Phe Asp Glu Leu Gly Leu Pro 
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530 



535 



540 



Ala lie Lys Lys Thr Gin Lys Thr Gly Lys Arg Ser Thr Ser Ala Ala 
545 550 555 560 

Val Leu Glu Ala Leu Arg Glu Ala His Pro lie Val Glu Lys lie Leu 
565 570 575 

Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser Thr Tyr lie Asp Pro Leu 
580 585 590 

Pro Asp Leu lie His Pro Arg Thr Gly Arg Leu His Thr Arg Phe Asn 
595 600 ' 605 

Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser Ser Asp Pro Asn Leu 
610 615 620 

Gin Asn lie Pro Val Arg Thr Pro Leu Gly Gin Arg lie Arg Arg Ala 
625 630 635 640 

Phe He Ala Glu Glu Gly Trp Leu Leu Val Ala Leu Asp Tyr Ser Gin 
645 650 655 

He Glu Leu Arg Val Leu Ala His Leu Ser Gly Asp Glu Asn Leu He 
660 665 670 

Arg Val Phe Gin Glu Gly Arg Asp He His Thr Glu Thr Ala Ser Trp 
675 680 685 

Met Phe Gly Val Pro Arg Glu Ala Val Asp Pro Leu Met Arg Arg Ala 
690 695 700 

Ala Lys Thr He Asn Phe Gly Val Leu Tyr Gly Met Ser Ala His Arg 
705 710 715 720 

Leu Ser Gin Glu Leu Ala He Pro Tyr Glu Glu Ala Gin Ala Phe He 
725 730 735 

Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val Arg Ala Trp He Glu Lys 
740 745 750 

Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr Val Glu Thr Leu Phe Gly 

755 760 765 

Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala Arg Val Lys Ser Val Arg 
770 775 780 

Glu Ala Ala Glu Arg Met Ala Phe Asn Met Pro Val Gin Gly Thr Ala 
785 790 795 800 

Ala Asp Leu Met Lys Leu Ala Met Val Lys Leu Phe Pro Arg Leu Glu 
805 810 815 

Glu Met Gly Ala Arg Met Leu Leu Gin Val Ala Asn Glu Leu Val Leu 
820 825 830 



Glu Ala Pro Lys Glu Arg Ala Glu Ala Val Ala Arg Leu Ala Lys Glu 
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835 840 845 

Val Met Glu Gly Val Tyr Pro Leu Ala Val Pro Leu Glu Val Glu Val 

850 855 860 

Gly lie Gly Glu Asp Trp Leu Ser Ala Lys Glu His His His His His 
865 870 875 880 

His 

<210> 567 

<211> 2643 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 567 



a taocit acacr 


atattQQtcra 


cctctt tgag 




tcaaacttoa 


gtacttctca 


60 


0gaaagaaaa 


ttgccgttga 


tgctttcaac 


acgctatacc 


agttcatctc 


gataataagg 


120 


caQCctQacQ 


crtaccrccortt 


aaaggactca 


caQQQcaaaa 


tcacctctca 


cctttccgga 


180 


atcctataca 


gagtctccaa 


catQCftcciaQ 


qtcicfciaatca 


QQCCQQtQtt 


tgtattcgac 


240 


ggagagccac 


cggagttcaa 


gaaggctgaa 


attgaggaga 


ggaaaaagag 


aagggctgag 


300 


gcagaggaga 


tgtggattgc 


ggctttgcag 


gcaggagata 


aggacgcgaa 


aaagtatgct 


360 


caggctgcag 


ggagggttga 


cgagtacatt 


gttgactccg 


caaagacgct 


tttaagttac 


420 


atggggattc 


cctttgtcga 


tgccccgtct 


gaaggagagg 


cgcaggctgc 


ttacatggca 


480 


gcaaaaggcg 


atgtggagta 


cacaggaagc 


caggattacg 


attctctgct 


cttcggaagc 


540 


ccgagactcg 


ccagaaatct 


cgcaataacg 


ggaaaaagga 


agcttcccgg 


caaaaatgtc 


600 


tatgtggatg 


taaagccgga 


gataataatt 


ctggaaagca 


acctcaaaag 


gctgggtttg 


660 


acgagggagc 


agctcatcga 


catagcgatt 


ctggtcggga 


cggactacaa 


tgagggtgtg 


720 


aagggtgtcg 


gcgtcaagaa 


ggctttgaac 


tacatcaaga 


cctacggaga 


tattttcagg 


780 


gcactcaagg 


ctctgaaagt 


aaatattgac 


cacgtagagg 


agataaggaa 


tttcttcctg 


840 


aatcctcctg 


tgactgacga 


ctacagaata 


gagttcaggg 


agcctgactt 


tgagaaggcc 


900 


atcgagttcc 


tgtgcgagga 


gcacgacttc 


agcagggaga 


gggtcgagaa 


ggccttggag 


960 


aagctcaaag 


ctctgaagtc 


aacccaggcc 


acgcttgaga 


ggtggttcct 


ggaggaggcc 


1020 


ccctggcccc 


cgccggaagg 


ggccttcgtg 


ggcttcgtcc 


tctcccgccc 


cgagcccatg 


1080 


tgggcggagc 


ttaaagccct 


ggccgcctgc 


aggggcggcc 


gcgtccaccg 


ggcccccgag 


1140 
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ccttataaag ccctcaggga cctgaaggag gcgcgggggc ttctcgccaa agacctgagc 1200 

gttctggccc tgagggaagg ccttggcctc ccgcccggcg acgaccccat gctcctcgcc 1260 

tacctcctgg acccttcgaa caccaccccc gagggggtgg cccggcgcta cggcggggag 1320 

tggacggagg aggcggggga gcgggccgcc ctttccgaga ggctcttcgc caacctgtgg 1380 

gggaggcttg agggggagga gaggctcctt tggctttacc gggaggtgga gaggcccctt 1440 

tccgctgtcc tggcccatat ggaggccacg ggggtgcgcc tggacgtggc ctatctcagg 1500 

gccttgtccc tggaggtggc cgaggagatc gcccgcctcg aggccgaggt cttccgcctg 1560 

gccggccacc ccttcaacct caactcccgg gaccagctgg aaagggtcct ctttgacgag 1620 

ctagggcttc ccgccatcgg caagacggag aagaccggca agcgctccac cagcgccgcc 1680 

gtcctggagg ccctccgcga ggcccacccc atcgtggaga agatcctgca gtaccgggag 1740 

ctcaccaagc tgaagagcac ctacattgac cccttgccgg acctcatcca ccccaggacg 1800 

ggccgcctcc acacccgctt caaccagacg gccacggcca cgggcaggct aagtagctcc 1860 

gatcccaacc tccagaacat ccccgtccgc accccgcttg ggcagaggat ccgccgggcc 1920 

ttcatcgccg aggaggggtg gctattggtg gccctggact atagccagat agagctcagg 1980 

gtgctggccc acctctccgg cgacgagaac ctgatccggg tcttccagga ggggcgggac 2040 

atccacacgg agaccgccag ctggatgttc ggcgtccccc gggaggccgt ggaccccctg 2100 

atgcgccggg cggccaagac catcaacttc ggggtcctct acggcatgtc ggcccaccgc 2160 

ctctcccagg agctagccat cccttacgag gaggcccagg ccttcattga gcgctacttt 2220 

cagagcttcc ccaaggtgcg ggcctggatt gagaagaccc tggaggaggg caggaggcgg 2280 

gggtacgtgg agaccctctt cggccgccgc cgctacgtgc cagacctaga ggcccgggtg 2340 

aagagcgtgc gggaggcggc cgagcgcatg gccttcaaca tgcccgtcca gggcaccgcc 2400 

gccgacctca tgaagctggc tatggtgaag ctcttcccca ggctggagga aatgggggcc 2460 

aggatgctcc ttcaggtcca caacgagctg gtcctcgagg ccccaaaaga gagggcggag 2520 

gccgtggccc ggctggccaa ggaggtcatg gagggggtgt atcccctggc cgtgcccctg 2580 

gaggtggagg tggggatagg ggaggactgg ctctccgcca aggagcacca ccaccaccac 2640 
cac 



2643 



<210> 568 
<211> 881 
<212> PRT 
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<213> Artificial 
<220> 

<223> Synthetic 
<400> 568 

Met Gly Ala Asp He Gly Asp Leu Phe Glu Arg Glu Glu Val Glu Leu 
15 10 15 

Glu Tyr Phe Ser Gly Lys Lys He Ala Val Asp Ala Phe Asn Thr Leu 
20 25 30 

Tyr Gin Phe He Ser He He Arg Gin Pro Asp Gly Thr Pro Leu Lys 
35 40 45 

Asp Ser Gin Gly Arg He Thr Ser His Leu Ser Gly He Leu Tyr Arg 
50 55 60 

Val Ser Asn Met Val Glu Val Gly He Arg Pro Val Phe Val Phe Asp 
65 70 75 80 

Gly Glu Pro Pro Glu Phe Lys Lys Ala Glu He Glu Glu Arg Lys Lys 
85 90 95 

Arg Arg Ala Glu Ala Glu Glu Met Trp He Ala Ala Leu Gin Ala Gly 
100 105 110 

Asp Lys Asp Ala Lys Lys Tyr Ala Gin Ala Ala Gly Arg Val Asp Glu 

115 120 125 

Tyr He Val Asp Ser Ala Lys Thr Leu Leu Ser Tyr Met Gly He Pro 
130 135 140 

Phe Val Asp Ala Pro Ser Glu Gly Glu Ala Gin Ala Ala Tyr Met Ala 
145 150 155 160 

Ala Lys Gly Asp Val Glu Tyr Thr Gly Ser Gin Asp Tyr Asp Ser Leu 
165 170 175 

Leu Phe Gly Ser Pro Arg Leu Ala Arg Asn Leu Ala He Thr Gly Lys 
180 185 190 

Arg Lys Leu Pro Gly Lys Asn Val Tyr Val Asp Val Lys Pro Glu He 
195 200 205 

He He Leu Glu Ser Asn Leu Lys Arg Leu Gly Leu Thr Arg Glu Gin 
210 215 220 

Leu He Asp He Ala He Leu Val Gly Thr Asp Tyr Asn Glu Gly Val 
225 230 235 240 

Lys Gly Val Gly Val Lys Lys Ala Leu Asn Tyr He Lys Thr Tyr Gly 
245 250 255 

Asp He Phe Arg Ala Leu Lys Ala Leu Lys Val Asn He Asp His Val 
260 265 270 
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Glu Glu lie Arg Asn Phe E>he Leu Asn Pro Pro Val Thr Asp Asp Tyr 
275 280 285 

Arg He Glu Phe Arg Glu Pro Asp Phe Glu Lys Ala He Glu Phe Leu 
290 295 300 

Cys Glu Glu His Asp Phe Ser Arg Glu Arg Val Glu Lys Ala Leu Glu 
305 310 315 320 

Lys Leu Lys Ala Leu Lys Ser Thr Gin Ala Thr Leu Glu Arg Trp Phe 
325 330 335 

Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu Gly Ala Phe Val Gly Phe 
340 345 350 

Val Leu Ser Arg Pro Glu Pro Met Trp Ala Glu Leu Lys Ala Leu Ala 
355 360 365 

Ala Cys Arg Gly Gly Arg Val His Arg Ala Pro Glu Pro Tyr Lys Ala 
370 375 380 

Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu Leu Ala Lys Asp Leu Ser 
385 390 395 400 

Val Leu Ala Leu Arg Glu Gly Leu Gly Leu Pro Pro Gly Asp Asp Pro 
405 410 415 

Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser Asn Thr Thr Pro Glu Gly 
420 425 430 

Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr Glu Glu Ala Gly Glu Arg 
435 440 445 

Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn Leu Trp Gly Arg Leu Glu 
450 455 460 

Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg Glu Val Glu Arg Pro Leu 
465 470 475 480 

Ser Ala Val Leu Ala His Met Glu Ala Thr Gly Val Arg Leu Asp Val 
485 490 495 

Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val Ala Glu Glu He Ala Arg 
500 505 510 

Leu Glu Ala Glu Val Phe Arg Leu Ala Gly His Pro Phe Asn Leu Asn 
515 520 525 

Ser Arg Asp Glu Leu Glu Arg Val Leu Phe Asp Glu Leu Gly Leu Pro 
530 535 540 

Ala He Gly Lys Thr Glu Lys Thr Gly Lys Arg Ser Thr Ser Ala Ala 
545 550 555 560 

Val Leu Glu Ala Leu Arg Glu Ala His Pro He Val Glu Lys He Leu 
565 570 575 
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Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser Thr Tyr lie Asp Pro Leu 
580 585 590 

Pro Asp Leu lie His Pro Arg Thr Gly Arg Leu His Thr Arg Phe Asn 
595 600 605 

Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser Ser Asp Pro Asn Leu 
610 615 620 

Gin Asn lie Pro Val Arg Thr Pro Leu Gly Gin Arg lie Arg Arg Ala 
625 630 635 640 

Phe lie Ala Glu Glu Gly Trp Leu Leu Val Ala Leu Asp Tyr Ser Gin 
645 650 655 

lie Glu Leu Arg Val Leu Ala His Leu Ser Gly Asp Glu Asn Leu He 
660 665 670 

Arg Val Phe Gin Glu Gly Arg Asp He His Thr Glu Thr Ala Ser Trp 
675 680 685 

Met Phe Gly Val Pro Arg Glu Ala Val Asp Pro Leu Met Arg Arg Ala. 
. 690 695 700 

Ala Lys Thr He Asn Phe Gly Val Leu Tyr Gly Met Ser Ala His Arg 
705 710 715 720 

Leu Ser Gin Glu Leu Ala He Pro Tyr Glu Glu Ala Gin Ala Phe He 
725 730 735 

Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val Arg Ala Trp He Glu Lys 
740 745 750 

Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr Val Glu Thr Leu Phe Gly 
755 760 765 

Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala Arg Val Lys Ser Val Arg 
770 775 780 

Glu Ala Ala Glu Arg Met Ala Phe Asn Met Pro Val Gin Gly Thr Ala 
785 790 795 800 

Ala Asp Leu Met Lys Leu Ala Met Val Lys Leu Phe Pro Arg Leu Glu 
805 810 815 

Glu Met Gly Ala Arg Met Leu Leu Gin Val His Asn Glu Leu Val Leu 
820 825 830 

Glu Ala Pro Lys Glu Arg Ala Glu Ala Val Ala Arg Leu Ala Lys Glu 
835 840 845 

Val Met Glu Gly Val Tyr Pro Leu Ala Val Pro Leu Glu Val Glu Val 
850 855 860 



Gly He Gly Glu Asp Trp Leu Ser Ala Lys Glu His His His His His 
865 870 875 880 
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His 



<210> 569 

<211> 24 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 569 

gagcggataa caatttcaca cagg 



<210> 570 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 570 

tgcccggtgc acgcggccgc ccctgcaggc 



<210> 571 

<211> 2445 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 571 



atgaattccc 


tgcccctctt tgagcccaag ggccgggtgc 


ttctggtgga cggccaccac 


60 


ctggcctacc 


gtaccttttt tgccctgaag ggcctcacca 


ccagccgcgg ggagccggtc 


120 


caggcggtgt 


acgggtttgc caagagcctt ttgaaggcgc 


taagggaaga cggggatgtg 


180 


gtgatcgtgg 


tgtttgacgc caaggccccc tccttccgcc 


accagaccta cgaggcctac 


240 


aaggcggggc 


gggctcccac ccccgaggac tttccccggc 


agcttgccct tatcaaggag 


300 


atggtggacc 


ttttgggcct ggagcgcctc gaggtgccgg 


gctttgaagc ggatgacgtc 


360 


ctggctaccc 


tggccaagaa ggcggaaaag gagggctacg 


aggtccgcat cctcaccgcc 


420 


gacaaagacc 


tttaccagct cctttccgac cgcatccacg 


tcctccaccc cgaggggtac 


480 


ctcatcaccc 


cggcctggct ttgggaaaag tacggcctga 


ggcccgacca gtgggccgac 


540 


taccgggccc 


tgaccgggga cgagtccgac aaccttcccg 


gggtcaaggg catcggggag 


600 
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aagacggcga 


ggaagcttct 


ggaggagtgg 


gggagcctgg 


aagccctcct 


caagaacctg 


660 


gaccggctga 


agcccgccat 


ccgggagaag 


atcctggccc 


acatggacga 


tctgaagctc 


720 


tcctgggacc 


tggccaaggt 


gcgcaccgac 


ctgcccctgg 


aggtggactt 


cgccaaaagg 


780 


cgggagcccg 


accgggaggg 


ggagaagccc 


cgggaggagg 


ccccctggcc 


cccgcccgaa 


840 


ggggcctt eg 


tgggcttcct 


cctttcccgc 


cccgagccca 


tgtgggcgga 


gcttaaagcc 


900 


ctggccgcct 


gcaggggcgg 


ccgcgtgcac 


cgggcagcag 


accccttggc 


Sgggctaaag 


960 


gacctcaagg 


aggtccgggg 


cctcctcgcc 


aaggacctcg 


ccgtcttggc 


ctcgagggag 


1020 


gggctagacc 


tcgtgcccgg 


ggacgacccc 


atgctcctcg 


cctacctcct 


ggacccttcg 


1080 


aacaccaccc 


ccgagggggt 


ggcgcggcgc 


tacggggggg 


agtggacgga 


ggacgccgcc 


1140 


caccgggccc 


tcctctcgga 


gaggctccat 


cggaacctcc 


ttaagcgcct 


cgagggggag 


1200 


gagaagctcc 


tttggctcta 


ccacgaggtg 


gaaaagcccc 


tctcccgggt 


cctggcccat 


1260 


atggaggcca 


ccggggtacg 


gcgggacgtg 


gcctaccttc 


aggccctttc 


cctggagctt 


1320 


gcggaggaga 


tccgccgcct 


cgaggaggag 


gtcttccgct 


tggcgggcca 


ccccttcaac 


1380 


ctcaactccc 


gggaccagct 


ggaaagggtg 


ctctttgacg 


agcttaggct 


tcccgccttg 


1440 


aagaagacga 


agaagaoagg 


caagcgctcc 


accagcgccg 


cggtgctgga 


ggccctacgg 


1500 


gaggcccacc 


ccatcgtgga 


gaagatcctc 


cagcaccggg 


agctcaccaa 


gctcaagaac 


1560 


acctacgtgg 


accccctccc 


aagcctcgtc 


cacccgagga 


cgggccgcct 


ccacacccgc 


1620 


ttcaaccaga 


cggccacggc 


cacggggagg 


cttagtagct 


ccgaccccaa 


cctgcagaac 


1680 


atccccgtcc 


gcaccccctt 


gggccagagg 


atccqccggg 


ccttcgtggc 


cqaggcqggt 


1740 


tgggcgttgg 


tggccctgga 


ctatagccag 


atagagctcc 


gcgtcctcgc 


ccacctctcc 


1800 


ggggacgaaa 


acctgatcag 


ggtcttccag 


gaggggaagg 


acatccacac 


ccagaccgca 


1860 


agctggatgt 


tcggcgtccc 


cccggaggcc 


gtggaccccc 


tgatgcgccg 


ggcggccaag 


1920 


acggtgaact 


tcggcgtcct 


ctacggcatg 


tccgcccata 


ggctctccca 


ggagcttgcc 


1980 


atcccctacg 


aggaggcggt 


ggcctttata 


gagcgctact 


tccaaagctt 


ccccaaggtg 


2040 


cgggcctgga 


tagaaaagac 


cctggaggag 


gggaggaagc 


ggggctacgt 


ggaaaccctc 


2100 


ttcggaagaa 


ggcgctacgt 


gcccgacctc 


aacgcccggg 


tgaagagcgt 


cagggaggcc 


2160 


gcggagcgca 


tggccttcaa 


catgcccgtc 


cagggcaccg 


ccgccgacct 


catgaagctc 


2220 


gccatggtga 


agctcttccc 


ccgcctccgg 


gagatggggg 


cccgcatgct 


cctccaggtc 


2280 


gccaacgagc 


tcctcctgga 


ggccccccaa 


gcgcgggccg 


aggaggtggc 


ggctttggcc 


2340 
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aaggaggcca tggagaaggc ctatcccctc gccgtgcccc tggaggtgga ggtggggatg 2400 
ggggaggact ggctttccgc caagggtcac caccaccacc accac 2445 



<210> 


572 


<211> 


815 


<212> 


PRT 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


572 


Met Asn Ser Leu Pro 


1 


5 



10 15 

Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys Gly Leu 
20 25 30 

Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala Lys 
35 40 45 

Ser Leu Leu Lys Ala Leu Arg Glu Asp Gly Asp Val Val He Val Val 
50 55 60 

Phe Asp Ala Lys Ala Pro Ser Phe Arg His Gin Thr Tyr Glu Ala Tyr 
65 70 75 80 

Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu Ala 
85 90 95 

Leu He Lys Glu Met Val Asp Leu Leu Gly Leu Glu Arg Leu Glu Val 
100 105 110 

Pro Gly Phe Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys Lys Ala 

115 120 125 

Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys Asp Leu 
130 135 140 

Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu Gly Tyr 
145 150 155 160 

Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg Pro Asp 
165 170 175 

Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp Asn Leu. 
180 185 190 

Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu Leu Glu 
195 200 205 

Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg Leu Lys 
210 215 220 
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Pro Ala lie Arg Glu Lys He Leu Ala His Met Asp Asp Leu Lys Leu 
225 230 235 240 

Ser Trp Asp Leu Ala Lys Val Arg'xhr Asp Leu Pro Leu Glu Val Asp 
245 250 255 

Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Gly Glu Lys Pro Arg Glu 

260 265 270 

Glu Ala Pro Trp Pro Pro Pro Glu Gly Ala Phe Val Gly Phe Leu Leu 
275 280 285 

Ser Arg Pro Glu Pro Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys 
290 295 300 

Arg Gly Gly Arg Val His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys 
305 310 315 320 

Asp Leu Lys Glu Val Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu 

325 330 335 

Ala Ser Arg Glu Gly Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu 
340 345 350 

Leu Ala Tyr Leu Leu Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala 
355 360 365 

Arg Arg Tyr Gly Gly Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu 

370 375 380 

Leu Ser Glu Arg Leu His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu 
385 390 395 400 

Glu Lys Leu Leu Trp Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg 
405 410 415 

Val Leu Ala His Met Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr 
420 425 430 

Leu Gin Ala Leu Ser Leu Glu Leu Ala Glu Glu He Arg Arg Leu Glu 
435 440 445 

Glu Glu Val Phe Arg Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg 
450 455 460 

Asp Gin Leu Glu Arg Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu 
465 470 475 480 

Lys Lys Thr Lys Lys Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu 
485 490 495 

Glu Ala Leu Arg Glu Ala His Pro He Val Glu Lys He Leu Gin His 
500 505 510 



Arg Glu Leu Thr Lys Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser 
515 520 525 
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I,eu Val His Pro Arg Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr 
530 535 540 

Ala Thr Ala Thr Gly Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn 
545 550 555 560 

He Pro Val Arg Thr Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val 
565 570 575 

Ala Glu Ala Gly Trp Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu 
580 585 590 

Leu Arg Val Leu Ala His Leu Ser Gly Asp Glu Asn Leu He Arg Val 
595 600 605 

Phe Gin Glu Gly Lys Asp He His Thr Gin Thr Ala Ser Trp Met Phe 
610 615 620 

Gly Val Pro Pro Glu Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys 
625 630 635 640 

Thr Val Asn Phe Gly Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser 
645 650 655 

Gin Glu Leu Ala He Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg 
660 665 670 

Tyr Phe Gin Ser Phe Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu 
675 680 685 

Glu Glu Gly Arg Lys Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg 
690 695 700 

Arg Tyr Val Pro Asp Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala 
705 710 715 720 

Ala Glu Arg Met Ala Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp 
725 730 735 

Leu Met Lys Leu Ala Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met 
740 745 750 

Gly Ala Arg Met Leu Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala 
755 760 765 

Pro Gin Ala Arg Ala Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met 
770 775 780 

Glu Lys Ala Tyr Pro Leu Ala Val Pro Leu Glu Val Glu Val Gly Met 
785 790 795 800 

Gly Glu Asp Trp Leu Ser Ala Lys Gly His His His His His His 
805 810 815 



<210> 573 
<211> 2520 
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<212> DNA 

<213> Artificial 

<220> 

<22 3 > Synthetic 

<400> 573 



atgaattccg 


aggcgatgct 


tccgctcttt 


gaacccaaag 


gccgggtcct 


cctggtggac 


60 


ggccaccacc 


tggcctaccg 


caccttcttc 


gccctgaagg 


gcctcaccac 


gagccggggc 


120 


gaaccggtgc 


aggcggtcta 


cggcttcgcc 


aagagcctcc 


tcaaggccct 


caaggaggac 


180 


ggggacgcgg 


tgatcgtggt 


ctttgacgcc 


aaggccccct 


ccttccgcca 


cgaggcctac 


240 


ggggggtaca 


aggcgggccg 


ggcccccacc 


CCCrqaqcract 


tcccccgcca 


qctcqccttq 


300 


gtcaagcqqc 


tQqtqgacct 


tCtCTQQCCtQ 


Qtccqcctcq 


aaoccccoaa 


at a.cqaaaco' 


360 


gacgacgtcc 


tgggcaccct 


ggccaagaag 


qccqaaaaoQ 




Qotacacatc 


420 


ctcaccgccg 
tcatcacccc 


accgcgacct 
ggagtggctt 


ctaccaactc 
tgggagaagt 


gtctccgacc 
atgggcttaa 


gcatccacgt 
gccttcccag 


cctccacccc 
540 


480 


tgggtggact 


accgggcctt 


ggccggggac 


ccttccgaca 


acatccccgg 


cgtgaagggc 


600 


atcggggaga 


agacggcggc 


caagctgatc 


cgggagtggg 


gaagcctgga 


aaacctcctc 


660 


aagaacctgg 


accggctgaa 


gcccgccatc 


cgggagaaga 


tcctggccca 


catggacgat 


720 


ctgaagctct 


cctgggacct 


ggccaaggtg 


cgcaccgacc 


tgcccctgga 


ggtggacttc 


780 


gccaaaaggc 


gggagcccga 


ccgggagagg 


cttagggcct 


ttctggagag 


gcttgagttt 


840 


ggcagcctcc 


tccacgagtt 


cggccttctg 


gaaagcccca 


aggccctgga 


ggaggccccc 


900 


tggcccccgc 


cggaaggggc 


cttcgtgggc 


ttcgtcctct 


cccgccccga 


gcccatgtgg 


960 


gcggagctta 


aagccctggc 


cgcctgcagg 


ggcggccgcg 


tgcaccgggc 


agcagacccc 


1020 


ttggcggggc 


taaaggacct 


caaggaggtc 


cggggcctcc 


tcgccaagga 


cctcgccgtc 


1080 


ttggcctcga 


gggaggggct 


agacctcgtg 


cccggggacg 


accccatgct 


cctcgcctac 


1140 


ctcctggacc 


cttcgaacac 


cacccccgag 


ggggtggcgc 


ggcgctacgg 


gggggagtgg 


1200 


acggaggacg 


ccgcccaccg 


ggccctcctc 


tcggagaggc 


tccatcggaa 


cctccttaag 


1260 


cgcctcgagg 


gggaggagaa 


gctcctttgg 


ctctaccacg 


aggtggaaaa 


gcccctctcc 


1320 


cgggtcctgg 


cccatatgga 


ggccaccggg 


gtacggcggg 


acgtggccta 


ccttcaggcc 


1380 


ctttccctgg 


agcttgcgga 


ggagatccgc 


cgcctcgagg 


aggaggtctt 


ccgcttggcg 


1440 


ggccacccct 


tcaacctcaa 


ctcccgggac 


cagctggaaa 


gggtgctctt 


tgacgagctt 


1500 


aggcttcccg 


ccttgaagaa 


gacgaagaag 


acaggcaagc 


gctccaccag 


cgccgcggtg 


1560 
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ctggaggccc tacgggaggc ccaccccatc gtggagaaga tcctccagca ccgggagctc 1620 

accaagctca agaacaccta cgtggacccc ctcccaagcc tcgtccaccc gaggacgggc 1680 

cgcctccaca cccgcttcaa ccagacggcc acggccacgg ggaggcttag tagctccgac 1740 

cccaacctgc agaacatccc cgtccgcacc cccttgggcc agaggatccg ccgggccttc 1800 

gtggccgagg cgggttgggc gttggtggcc ctggactata gccagataga gctccgcgtc 1860 

ctcgcccacc tctccgggga cgaaaacctg atcagggtct tccaggaggg gaaggacatc 1920 

cacacccaga ccgcaagctg gatgttcggc gtccccccgg aggccgtgga ccccctgatg 1980 

cgccgggcgg ccaagacggt gaacttcggc gtcctctacg gcatgtccgc ccataggctc 2040 

tcccaggagc ttgccatccc ctacgaggag gcggtggcct ttatagagcg ctacttccaa 2100 

agcttcccca aggtgcgggc ctggatagaa aagaccctgg aggaggggag gaagcggggc 2160 

tacgtggaaa ccctcttcgg aagaaggcgc tacgtgcccg acctcaacgc ccgggtgaag 2220 

agcgtcaggg aggccgcgga gcgcatggcc ttcaacatgc ccgtccaggg caccgccgcc 2280 

gacctcatga agctcgccat ggtgaagctc ttcccccgcc tccgggagat gggggcccgc 2340 

atgctcctcc aggtcgccaa cgagctcctc ctggaggccc cccaagcgcg ggccgaggag 2400 

gtggcggctt tggccaagga ggccatggag aaggcctatc ccctcgccgt gcccctggag 2460 

gtggaggtgg ggatggggga ggactggctt tccgccaagg gtcaccacca ccaccaccac 2520 



<210> 574 

<211> 840 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 574 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val 
50 55 60 

lie Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr 
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65 70 75 80 

Gly Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg 
85 90 95 

Gin Leu Ala Leu Val Lys Arg Leu Val Asp Leu Leu Gly Leu Val Arg 
100 105 110 

Leu Glu Ala Pro Gly Tyr Glu Ala Asp Asp Val Leu Gly Thr Leu Ala 
115 120 125 

Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp 
130 135 140 

Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg lie His Val Leu His Pro 
145 150 155 160 

Glu Gly Tyr Leu lie Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly Leu 
165 170 175 

Lys Pro Ser Gin Trp Val Asp Tyr Arg Ala Leu Ala Gly Asp Pro Ser 
180 185 190 

Asp Asn lie Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Ala Lys 
195 200 205 

Leu He Arg Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu Asp 
210 215 220 

Arg Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp 
225 230 235 240 

Leu Lys Leu Ser Trp Asp Leu Ala Lys Val TOrg Thr Asp Leu Pro Leu 
245 250 255 

Glu Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg 
260 265 270 

Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly 
275 280 285 

Leu Leu Glu Ser Pro Lys Ala Leu Glu Glu 7U.a Pro Trp Pro Pro Pro 
290 295 300 

Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro Met Trp 
305 310 315 320 

Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val His Arg 
325 330 335 

Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly 
340 345 350 

Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp 
355 360 365 

Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro 
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370 



375 



380 



Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp 

395 400 



385 



390 



Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg 
405 410 415 

Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr 
420 425 430 

His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala 
435 440 445 

Thr Gly val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu 
450 455 460 

Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala 
465 470 475 480 

Gly His pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu 

495 



485 



Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys Thr Gly 
500 505 510 

Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His 
515 520 525 

Pro lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys Leu Lys 
530 535 540 

Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly 
545 550 555 560 

Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu 
565 570 575 

ser ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu 
580 585 590 

Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu 
595 600 605 

Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu 
610 615 620 

Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys Asp He 
625 630 635 640 

His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val 
645 650 655 

Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu 
660 665 670 

Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr 
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675 680 685 

Glu Glu Ala Val Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys 
690 695 700 

Val Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly 
705 710 715 720 

Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn 
725 730 735 

Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn 
740 745 750 

Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val 
755 760 765 

Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin 
770 775 780 

Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu 
785 790 795 800 

Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala 
805 810 815 

Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala 
820 825 830 

Lys Gly His His His His His His 
835 840 

<210> 575 

<211> 2445 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 575 



atgaattccc tgcccctctt 


tgagcccaag 


ggccgggtgc 


ttctggtgga cggccaccac 


60 


ctggcctacc gtaccttttt 


tgccctgaag 


ggcctcacca 


ccagccgcgg ggagccggtc 


120 


caggcggtgt acgggtttgc 


caagagcctt 


ttgaaggcgc 


taagggaaga 


cggggatgtg 


180 


gtgatcgtgg tgtttgacgc 


caaggccccc 


tccttccgcc 


accagaccta 


cgaggcctac 


240 


aaggcggggc gggctcccac 


ccccgaggac 


tttccccggc 


agcttgccct 


tatcaaggag 


300 


atggtggacc ttttgggctt 


tacccgcctc 


gaggtgccgg 


gctttgaagc 


ggatgacgtc 


360 


ctggctaccc tggccaagaa 


ggcggaaaag 


gagggctacg 


aggtccgcat 


cctcaccgcc 


420 


gacaaagacc tttaccagct 


cctttccgac 


cgcatccacg 


tcctccaccc 


cgaggggtac 


480 
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ctcatcaccc 


cggcctggct 


ttgggaaaag 


tacggcctga ggcccgacca 


gtgggccgac 


540 


taccgggccc 


tgaccgggga 


cgagtccgac 


aaccttcccg gggtcaaggg 


catcggggag 


600 


aagacggcga 


ggaagcttct 


ggaggagtgg 


gggagcctgg aagccctcct 


caagaacctg 


660 


gaccggctga 


agcccgccat 


ccgggagaag 


atcctggccc acatggacga 


tctgaagctc 


720 


tcctgggacc 


tggccaaggt 


gcgcaccgac 


ctgcccctgg aggtggactt 


cgccaaaagg 


780 


cgggagcccg 


accgggaggg 


ggagaagccc 


cgggaggagg ccccctggcc 


cccgcccgaa 


840 


ggggccttcg 


tgggcttcct 


cctttcccgc 


cccgagccca tgtgggcgga 


gcttaaagcc 


900 


ctggccgcct 


gcaggggcgg 


ccgcgtgcac 


cgggcagcag accccttggc 


ggggctaaag 


960 


gacctcaagg 


aggtccgggg 


cctcctcgcG 


aaggacctcg ccgtcttggc 


ctcgagggag 


1020 


gggctagacc 


tcgtgcccgg 


ggacgacccc 


atgctcctcg cctacctcct 


ggacccttcg 


1080 


aacaccaccc 


ccgagggggt 


ggcgcggcgc 


tacggggggg agtggacgga 


ggacgccgcc 


1140 


caccgggccc 


tcctctcgga 


gaggctccat 


cggaacctcc ttaagcgcct 


cgagggggag 


1200 


gagaagctcc 


tttggctcta 


ccacgaggtg 


gaaaagcccc tctcccgggt 


cctggcccat 


1260 


atggaggcca 


ccggggtacg 


gcgggacgtg 


gcctaccttc aggccctttc 


cctggagctt 


1320 


gcggaggaga 


tccgccgcct 


cgaggaggag 


gtcttccgct tggcgggcca 


ccccttcaac 


1380 


ctcaactccc 


gggaccagct 


ggaaagggtg 


ctctttgacg agcttaggct 


tcccgccttg 


1440 


aagaagacga 


agaagacagg 


caagcgctcc 


accagcgccg cggtgctgga 


ggccctacgg 


1500 


gaggcccacc 


ccatcgtgga 


gaagatcctc 


cagcaccggg agctcaccaa 


gctcaagaac 


1560 


acctacgtgg 


accccctccc 


aagcctcgtc 


cacccgagga cgggccgcct 


ccacacccgc 


1620 


ttcaaccaga 


cggccacggc 


cacggggagg 


cttagtagct ccgaccccaa 


cctgcagaac 


1680 


atccccgtcc 


gcaccccctt 


gggccagagg 


atccgccggg ccttcgtggc 


cgaggcgggt 


1740 


tgggcgttgg 


tggccctgga 


ctatagccag 


atagagctcc gcgtcctcgc 


ccacctctcc 


1800 


ggggacgaaa 


acctgatcag 


ggtcttccag 


gaggggaagg acatccacac 


ccagaccgca 


1860 


agctggatgt 


tcggcgtccc 


cccggaggcc 


gtggaccccc tgatgcgccg 


ggcggccaag 


1920 


acggtgaact 


tcggcgtcct 


ctacggcatg 


tccgcccata ggctctccca 


ggagcttgcc 


1980 


atcccctacg 


aggaggcggt 


ggcctttata 


gagcgctact tccaaagctt 


ccccaaggtg 


2040 


cgggcctgga 


tagaaaagac 


cctggaggag 


gggaggaagc ggggctacgt 


ggaaaccctc 


2100 


ttcggaagaa 


ggcgctacgt 


gcccgacctc 


aacgcccggg tgaagagcgt 


cagggaggcc 


2160 


gcggagcgca 


tggccttcaa 


catgcccgtc 


cagggcaccg ccgccgacct 


catgaagctc 


2220 
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gccatggtga agctcttccc ccgcctccgg gagatggggg cccgcatgct cctccaggtc 2280 

gccaacgagc tcctcctgga ggccccccaa gcgcgggccg aggaggtggc ggctttggcc 2340 

aaggaggcca tggagaaggc ctatcccctc gccgtgcccc tggaggtgga ggtggggatg 2400 

g999cLggact ggctttccgc caagggtcac caccaccacc accac 2445 



<210> 


576 


<211> 


815 


<212> 


PRT 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


576 


Met Asn Ser Leu Pro 


1 


5 


Asp Gly His His Leu 



10 15 



20 25 30 

Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala Lys 
35 40 45 

Ser Leu Leu Lys Ala Leu Arg Glu Asp Gly Asp Val Val lie Val Val 
50 55 60 

Phe Asp Ala Lys Ala Pro Ser Phe Arg His Gin Thr Tyr Glu Ala Tyr 
65 70 75 80 

Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu Ala 
85 90 95 

Leu lie Lys Glu Met Val Asp Leu Leu Gly Phe Thr Arg Leu Glu Val 
100 105 110 

Pro Gly Phe Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys Lys Ala 
115 120 125 

Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys Asp Leu 
130 135 140 

Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu Gly Tyr 
145 150 155 160 

Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg Pro Asp 
165 170 175 

Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp Asn Leu 
180 185 190 

Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu Leu Glu 
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195 



200 



205 



Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg Leu Lys 
210 215 220 

Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu Lys Leu 
225 230 235 240 

Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu Val Asp 
245 250 255 

Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Gly Glu Lys Pro Arg Glu 
260 265 270 

Glu Ala Pro Trp Pro Pro Pro Glu Gly Ala Phe Val Gly Phe Leu Leu 
275 280 285 

Ser Arg Pro Glu Pro Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys 
290 295 300 

Arg Gly Gly Arg Val His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys 
305 310 315 320 

Asp Leu Lys Glu Val Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu 
325 330 335 

Ala Ser Arg Glu Gly Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu 
340 345 350 

Leu Ala Tyr Leu Leu Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala 
355 360 365 

Arg Arg Tyr Gly Gly Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu 

370 375 380 

Leu Ser Glu Arg Leu His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu 
385 390 395 400 

Glu Lys Leu Leu Trp Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg 
405 410 415 

Val Leu Ala His Met Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr 
420 425 430 

Leu Gin Ala Leu Ser Leu Glu Leu Ala Glu Glu He Arg Arg Leu Glu 
435 440 445 

Glu Glu Val Phe Arg Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg 
450 455 460 

Asp Glii Leu Glu Arg Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu 
465 470 475 480 

Lys Lys Thr Lys Lys Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu 
485 490 495 

Glu Ala Leu Arg Glu Ala His Pro He Val Glu Lys He Leu Gin His 
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500 



505 



510 



Arg Glu Leu Thr Lys Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser 
515 520 525 

Leu Val His Pro Arg Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr 
530 535 540 

Ala Thr Ala Thr Gly Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn 
545 550 555 560 

lie Pro Val Arg Thr Pro Leu Gly Gin Arg lie Arg Arg Ala Phe Val 
565 570 575 

Ala Glu Ala Gly Trp Ala Leu Val Ala Leu Asp Tyr Ser Gin lie Glu 
580 585 590 

Leu Arg Val Leu Ala His Leu Ser Gly Asp Glu Asn Leu He Arg Val 
595 600 605 

Phe Gin Glu Gly Lys Asp He His Thr Gin Thr Ala Ser Trp Met Phe 
610 615 620 

Gly Val Pro Pro Glu Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys 
625 630 635 640 

Thr Val Asn Phe Gly Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser 
645 650 655 

Gin Glu Leu Ala He Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg 
660 665 670 

Tyr Phe Gin Ser Phe Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu 
675 680 685 

Glu. Glu Gly Arg Lys Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg 

690 695 700 

Arg Tyr Val Pro Asp Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala 
705 710 715 720 

Ala Glu Arg Met Ala Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp 
725 730 735 

Leu Met Lys Leu Ala Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met 
740 745 750 

Gly Ala Arg Met Leu Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala 
755 760 765 

Pro Gin Ala Arg Ala Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met 
770 775 780 

Glu Lys Ala Tyr Pro Leu Ala Val Pro Leu Glu Val Glu Val Gly Met 
785 790 795 800 



Gly Glu Asp Trp Leu Ser Ala Lys Gly His His His His His His 
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<210> 577 
<211> 2520 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 577 

atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 60 

ggccaccacc tggcctaccg caccttcttc gccctgaagg gcctcaccac gagccggggc 120 

gaaccggtgc aggcggtcta cggcttcgcc aagagcctcc tcaaggccct caaggaggac 180 

ggggacgcgg tgatcgtggt ctttgacgcc aaggccccct ccttccgcca cgaggcctac 240 

ggggggtaca aggcgggccg ggcccccacc ccggaggact tcccccgcca gctcgccttg 300 

gtcaagcggc tggtggacct tctgggcttt acccgcctcg aggccccggg gtacgaggcg 360 

gacgacgtcc tgggcaccct ggccaagaag gccgaaaagg aggggtacga ggtgcgcatc 420 

ctcaccgccg accgcgacct ctaccaactc gtctccgacc gcatccacgt cctccacccc 480 

gaggggtacc tcatcacccc ggagtggctt tgggagaagt atgggcttaa gccttcccag 540 

tgggtggact accgggcctt ggccggggac ccttccgaca acatccccgg cgtgaagggc 600 

atcggggaga agacggcggc caagctgatc cgggagtggg gaagcctgga aaacctcctc 660 

aagaacctgg accggctgaa gcccgccatc cgggagaaga tcctggccca catggacgat 720 

ctgaagctct cctgggacct ggccaaggtg cgcaccgacc tgcccctgga ggtggacttc 780 

gccaaaaggc gggagcccga ccgggagagg cttagggcct ttctggagag gcttgagttt 840 

ggcagcctcc tccacgagtt cggccttctg gaaagcccca aggccctgga ggaggccccc 900 

tggcccccgc cggaaggggc cttcgtgggc ttcgtcctct cccgccccga gcccatgtgg 960 

gcggagctta aagccctggc cgcctgcagg ggcggccgcg tgcaccgggc agcagacccc 1020 

ttggcggggc taaaggacct caaggaggtc cggggcctcc tcgccaagga cctcgccgtc 1080 

ttggcctcga gggaggggct agacctcgtg cccggggacg accccatgct cctcgcctac 1140 

ctcctggacc cttcgaacac cacccccgag ggggtggcgc ggcgctacgg gggggagtgg 12 00 

acggaggacg ccgcccaccg ggccctcctc tcggagaggc tccatcggaa cctccttaag 12 60 

cgcctcgagg gggaggagaa gctcctttgg ctctaccacg aggtggaaaa gcccctctcc 1320 

cgggtcctgg cccatatgga ggccaccggg gtacggcggg acgtggccta ccttcaggcc 1380 
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ctttccctgg agcttgcgga ggagatccgc cgcctcgagg aggaggtctt ccgcttggcg 1440 

ggccacccct tcaacctcaa ctcccgggac cagctggaaa gggtgctctt tgacgagctt 1500 

aggcttcccg ccttgaagaa gacgaagaag acaggcaagc gctccaccag cgccgcggtg 1560 

ctggaggccc tacgggaggc ccaccccatc gtggagaaga tcctccagca ccgggagctc 1620 

accaagctca agaacaccta cgtggacccc ctcccaagcc tcgtccaccc gaggacgggc 1680 

cgcctccaca cccgcttcaa ccagacggcc acggccacgg ggaggcttag tagctccgac 1740 

cccaacctgc agaacatccc cgtccgcacc cccttgggcc agaggatccg ccgggccttc 1800 

gtggccgagg cgggttgggc gttggtggcc ctggactata gccagataga gctccgcgtc 1860 

ctcgcccacc tctccgggga cgaaaacctg atcagggtct tccaggaggg gaaggacatc 1920 

cacacccaga ccgcaagctg gatgttcggc gtccccccgg aggccgtgga ccccctgatg 1980 

cgccgggcgg ccaagacggt gaacttcggc gtcctctacg gcatgtccgc ccataggctc 2040 

tcccaggagc ttgccatccc ctacgaggag gcggtggcct ttatagagcg ctacttccaa 2100 

agcttcccca aggtgcgggc ctggatagaa aagaccctgg aggaggggag gaagcggggc 2160 

tacgtggaaa ccctcttcgg aagaaggcgc tacgtgcccg acctcaacgc ccgggtgaag 2220 

agcgtcaggg aggccgcgga gcgcatggcc ttcaacatgc ccgtccaggg caccgccgcc 2280 

gacctcatga agctcgccat ggtgaagctc ttcccccgcc tccgggagat gggggcccgc 2340 

atgctcctcc aggtcgccaa cgagctcctc ctggaggccc cccaagcgcg ggccgaggag 2400 

gtggcggctt tggccaagga ggccatggag aaggcctatc ccctcgccgt gcccctggag 2460 

gtggaggtgg ggatggggga ggactggctt tccgccaagg gtcaccacca ccaccaccac 2520 

<210> 578 

<211> 840 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 578 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 
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Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val 
50 55 60 

lie Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr 
65 70 75 80 

Gly Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg 
85 90 95 

Gin Leu Ala Leu Val Lys Arg Leu Val Asp Leu Leu Gly Phe Thr Arg 
100 105 110 

Leu Glu Ala Pro Gly Tyr Glu Ala Asp Asp Val Leu Gly Thr Leu Ala 
115 120 125 

Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp 
130 135 140 

Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg He His Val Leu His Pro 
145 150 155 160 

Glu Gly Tyr Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly Leu 
165 170 175 

Lys Pro Ser Gin Trp Val Asp Tyr Arg Ala Leu Ala Gly Asp Pro Ser 
180 185 190 

Asp Asn He Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Ala Lys 
195 200 205 

Leu He Arg Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu Asp 
210 215 220 

Arg Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp 
225 230 235 240 

Leu Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu 
245 250 255 



Glu Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg 
260 265 270 

Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly 
275 280 285 

Leu Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro 
290 295 300 

Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro Met Trp 
305 310 315 320 

Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val His Arg 
325 330 335 

Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly 
340 345 350 
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Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp 
355 360 365 

Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro 
370 375 380 

Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp 
385 390 395 400 

Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg 
405 410 415 

Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys lieu Leu Trp Leu Tyr 
420 425 430 

His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala 
435 440 445 

Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu 
450 455 460 

Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala 
465 470 475 480 

Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu 
485 490 495 

Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys Thr Gly 

500 505 510 

Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His 
515 520 525 

Pro He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys Leu Lys 
530 535 540 

Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly 
545 550 555 560 

Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu 
565 570 575 

Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu 
580 585 590 

Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu 
595 600 605 

Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu 

610 615 620 

Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys Asp He 
625 630 635 640 



His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val 
645 650 655 



620 



wo 01/90337 



PCT/USOl/17086 



Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu 
660 665 670 

Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr 
675 680 685 

Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys 
690 695 700 

Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly 
705 710 715 720 

Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn 
725 730 735 

Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn 
740 745 750 

Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val 
755 760 765 

Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin 
770 775 780 

Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu 
785 790 795 800 

Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala 
805 810 S15 

Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala 
820 825 830 

Lys Gly His His His His His His 
835 840 

<210> 579 

<211> 2445 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 579 



atgaattccc 


tgcccctctt 


tgagcccaag 


ggccgggtgc ttctggtgga cggccaccac 


60 


ctggcctacc gtaccttttt tgccctgaag 


ggcctcacca ccagccgcgg ggagccggtc 


120 


caggcggtgt 


acgggtttgc 


caagagcctt 


ttgaaggcgc taagggaaga cggggatgtg 


180 


gtgatcgtgg 


tctttgacgc 


cgaggccccc 


tccttccgcc accagaccta cgaggcctac 


240 


aaggcggggc 


gggctcccac 


ccccgaggac 


tttccccggc agcttgccct tatcaaggag 


300 


atggtggacc 


ttttgggcct 


ggagcgcctc 


gaggtgccgg gctttgaagc ggatgacgtc 


360 
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ctggctaccc 


tggccaagaa 


ggcggaaaag 


gagggctacg 


aggtccgcat 


cctcaccgcc 


420 


gacaaagacc 


tttaccagct 


cctttccgac 


cgcatccacg 


tcctccaccc 


cgaggggtac 


480 


ctcatcaccc 


cggcctggct 


ttgggaaaag 


tacggcctga 


ggcccgacca 


gtgggccgac 


540 


taccgggccc 


tgaccgggga 


cgagtccgac 


aaccttcccg 


gggtcaaggg 


catcggggag 


600 


aagacggcga 


ggaagcttct 


ggaggagtgg 


99g«^9cctgg 


aagccctcct 


caagaacctg 


660 


gaccggctga 


agcccgccat 


ccgggagaag 


atcctggccc 


acatggacga 


tctgaagctc 


720 


tcctgggacc 


tggccaaggt 


gcgcaccgac 


ctgcccctgg 


aggtggactt 


cgccaaaagg 


780 


cgggagcccg 


accgggaggg 


ggagaagccc 


cgggaggagg 


ccccctggcc 


cccgcccgaa 


840 


ggggccttcg 


tgggcttcct 


cctttcccgc 


cccgagccca 


tgtgggcgga 


gcttaaagcc 


900 


ctggccgcct 


gcaggggcgg 


ccgcgtgcac 


cgggcagcag 


accccttggc 


ggggctaaag 


960 


gacctcaagg 


aggtccgggg 


cctcctcgcc 


aaggacctcg 


ccgtcttggc 


ctcgagggag 


1020 


gggctagacc 


tcgtgcccgg 


ggacgacccc 


atgctcctcg 


qctacctcct 


ggacccttcg 


1080 


aacaccaccc 


ccgagggggt 


ggcgcggcgc 


tacggggggg 


agtggacgga 


ggacgccgcc 


1140 


caccgggccc 


tcctctcgga 


gaggctccat 


cggaacctcc 


ttaagcgcct 


cgagggggag 


1200 


gagaagctcc 


tttggctcta 


ccacgaggtg 


gaaaagcccc 


tctcccgggt 


cctggcccat 


1260 


atggaggcca 


ccggggtacg 


gcgggacgtg 


gcctaccttc 


aggccctttc 


cctggagctt 


1320 


gcggaggaga 


tccgccgcct 


cgaggaggag 


gtcttccgct 


tggcgggcca 


ccccttcaac 


1380 


ctcaactccc 


gggaccagct 


ggaaagggtg 


ctctttgacg 


agcttaggct 


tcccgccttg 


1440 


aagaagacga 


agaagacagg 


caagcgctcc 


accagcgccg 


cggtgctgga 


ggccctacgg 


1500 


gaggcccacc 


ccatcgtgga 


gaagatcctc 


cagcaccggg 


agctcaccaa 


gctcaagaac 


1560 


acctacgtgg 


accccctccc 


aagcctcgtc 


cacccgagga 


cgggccgcct 


ccacacccgc 


1620 


ttcaaccaga 


cggccacggc 


cacggggagg 


cttagtagct 


ccgaccccaa 


cctgcagaac 


1680 


atccccgtcc 


gcaccccctt 


gggccagagg 


atccgccggg 


ccttcgtggc 


cgaggcgggt 


1740 


tgggcgttgg 


tggccctgga 


ctatagccag 


atagagctcc 


gcgtcctcgc 


ccacctctcc 


1800 


ggggacgaaa 


acctgatcag 


ggtcttccag 


gaggggaagg 


acatccacac 


ccagaccgca 


1860 


agctggatgt 


tcggcgtccc 


cccggaggcc 


gtggaccccc 


tgatgcgccg 


ggcggccaag 


1920 


acggtgaact 


tcggcgtcct 


ctacggcatg 


tccgcccata 


ggctctccca 


ggagcttgcc 


1980 


atcccctacg 


aggaggcggt 


ggcctttata 


gagcgctact 


tccaaagctt 


ccccaaggtg 


2040 
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cgggcctgga tagaaaagac cctggaggag gggaggaagc ggggctacgt ggaaaccctc 2100 

ttcggaagaa ggcgctacgt gcccgacctc aacgcccggg tgaagagcgt cagggaggcc 2160 

gcggagcgca tggccttcaa catgcccgtc cagggcaccg ccgccgacct catgaagctc 2220 

gccatggtga agctcttccc ccgcctccgg gagatggggg cccgcatgct cctccaggtc 2280 

gccaacgagc tcctcctgga ggccccccaa gcgcgggccg aggaggtggc ggctttggcc 2340 

aaggaggcca tggagaaggc ctatcccctc gccgtgcccc tggaggtgga ggtggggatg 2400 

ggggaggact ggctttccgc caagggtcac caccaccacc accac 2445 



<210> 


580 


<211> 


815 


<212> 


PRT 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


580 


Met Asn Ser Leu Pro 


1 


5 


Asp Gly His His Leu 



10 15 



20 25 30 

Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala Lys 
35 40 45 

Ser Leu Leu Lys Ala Leu Arg Glu Asp Gly Asp Val Val He Val Val 
50 55 60 

Phe Asp Ala Glu Ala Pro Ser Phe Arg His Gin Thr Tyr Glu Ala Tyr 
65 70 75 80 

Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu Ala 
85 90 95 

Leu He Lys Glu Met Val Asp Leu Leu Gly Leu Glu Arg Leu Glu Val 
100 105 110 

Pro Gly Phe Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys Lys Ala 

115 120 125 

Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys Asp Leu 
130 135 140 

Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu Gly Tyr 
145 150 155 160 

Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg Pro Asp 
165 170 175 
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Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp Asn Leu 
180 185 190 

Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys Leu Leu Glu 
195 200 205 

Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp hrg Leu Lys 
210 215 220 

Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu Lys Leu 
225 230 235 240 

Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu Val Asp 
245' 250 255 

Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Gly Glu Lys Pro Arg Glu 
260 265 270 

Glu Ala Pro Trp Pro Pro Pro Glu Gly Ala Phe Val Gly Phe Leu Leu 
275 280 285 

Ser Arg Pro Glu Pro Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys 
290 295 300 

Arg Gly Gly Arg Val His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys 
305 310 315 320 

Asp Leu Lys Glu Val Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu 

325 330 335 

Ala Ser Arg Glu Gly Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu 
340 345 350 

Leu Ala Tyr Leu Leu Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala 
355 360 365 

Arg Arg Tyr Gly Gly Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu 
370 375 380 

Leu Ser Glu Arg Leu His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu 
385 390 395 400 

Glu Lys Leu Leu Trp Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg 
405 410 415 

Val Leu Ala His Met Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr 
420 425 430 

Leu Gin Ala Leu Ser Leu Glu Leu Ala Glu Glu He Arg Arg Leu Glu 
435 ' 440 445 

Glu Glu Val Phe Arg Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg 
450 455 460 

Asp Gin Leu Glu Arg Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu 
465 470 475 480 
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Lys Lys Thr Lys Lys Thr Gly I*ys Arg Ser Thr Ser Ala Ala Val Leu 
485 490 495 

Glu Ala Leu Arg Glu Ala His Pro He Val Glu Lys He Leu Gin His 
500 505 510 

Arg Glu Leu Thr Lys Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser 
515 520 525 

Leu Val His Pro Arg Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr 
530 535 540 

Ala Thr Ala Thr Gly Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn 
545 550 555 560 

He Pro Val Arg Thr Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val 
565 570 575 

Ala Glu Ala Gly Trp Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu 
580 585 590 

Leu Arg Val Leu Ala His Leu Ser Gly Asp Glu Asn Leu He Arg Val 
595 600 605 

Phe Gin Glu Gly Lys Asp He His Thr Gin Thr Ala Ser Trp Met Phe 
610 615 620 

Gly Val Pro Pro Glu Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys 
625 630 635 640 

Thr Val Asn Phe Gly Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser 
645 650 655 

Gin Glu Leu Ala He Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg 
660 665 670 

Tyr Phe Gin Ser Phe Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu 
675 680 685 

Glu Glu Gly Arg Lys Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg 
690 695 700 

Arg Tyr Val Pro Asp Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala 
705 710 715 720 

Ala Glu Arg Met Ala Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp 
725 730 735 

Leu Met Lys Leu Ala Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met 
740 745 750 

Gly Ala Arg Met Leu Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala 
755 760 765 

Pro Gin Ala Arg Ala Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met 
770 775 780 



625 



wo 01/90337 



PCTAJSOl/17086 



Glu Lys Ala Tyr Pro Leu Ala Val Pro Leu Glu Val Glu Val Gly Met 
785 790 795 800 

Gly Glu Asp Trp Leu Ser Ala Lys Gly His His His His His His 





805 


810 




815 




<210> 581 
<211> 2445 
<212> DNA 
<213> Artificial 










<220> 

<223> Synthetic 










<400> 581 
atgaattccc 


tgcccctctt 


tgagcccaag ggccgggtgc 


ttctggtgga 


cggccaccac 


60 


ctggcctacc 


gtaccttttt 


tgccctgaag ggcctcacca 


ccagccgcgg ggagccggtc 


120 


caggcggtgt 


acgggtttgc 


caagagcctt ttgaaggcgc 


taagggaaga 


cggggatgtg 


180 


gtgatcgtgg 


tctttgacgc 


cgaggccccc tccttccgcc 


accagaccta 


cgaggcctac 


240 


aaggcggggc 


gggctcccac 


ccccgaggac tttccccggc 


agcttgccct 


tatcaaggag 


300 


atggtggacc 


ttttgggctt 


tacccgcctc gaggtgccgg 


gctttgaagc ggatgacgtc 


360 


ctggctaccc 


tggccaagaa 


ggcggaaaag gagggctacg 


aggtccgcat 


cctcaccgcc 


420 


gacaaagacc 


tttaccagct 


cctttccgac cgcatccacg 


tcctccaccc 


cgaggggtac 


480 


ctcatcaccc 


cggcctggct 


ttgggaaaag tacggcctga 


ggcccgacca 


gtgggccgac 


540 


taccgggccc 


tgaccgggga 


cgagtccgac aaccttcccg 


gggtcaaggg 


catcggggag 


600 


aagacggcga 


ggaagcttct 


ggaggagtgg gggagcctgg 


aagccctcct 


caagaacctg 


660 


gaccggctga 


agcccgccat 


ccgggagaag atcctggccc 


acatggacga tctgaagctc 


720 


tcctgggacc 


tggccaaggt 


gcgcaccgac ctgcccctgg 


aggtggactt 


cgccaaaagg 


780 


cgggagcccg 


accgggaggg 


ggagaagccc cgggaggagg 


ccccctggcc 


cccgcccgaa 


840 


ggggccttcg 


tgggcttcct 


cctttcccgc cccgagccca 


tgtgggcgga gcttaaagcc 


900 


ctggccgcct 


gcaggggcgg 


ccgcgtgcac cgggcagcag 


accccttggc 


ggggctaaag 


960 


gacctcaagg 


aggtccgggg 


cctcctcgcc aaggacctcg 


ccgtcttggc 


ctcgagggag 


1020 


gggctagacc 


tcgtgcccgg 


ggacgacccc atgctcctcg 


cctacctcct 


ggacccttcg 


1080 


aacaccaccc 


ccgagggggt 


ggcgcggcgc tacggggggg 


agtggacgga ggacgccgcc 


1140 


caccgggccc 


tcctctcgga 


gaggctccat cggaacctcc 


ttaagcgcct 


cgagggggag 


1200 


gagaagctcc 


tttggctcta 


ccacgaggtg gaaaagcccc 


tctcccgggt 


cctggcccat 


1260 
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atggaggcca ccggggtacg gcgggacgtg goctaccttc aggccctttc cctggagctt 1320 

gcggaggaga tccgccgcct cgaggaggag gtcttccgct tggcgggcca ccccttcaac 1380 

ctcaactccc gggaccagct ggaaagggtg ctctttgacg agcttaggct tcccgccttg 1440 

aagaagacga agaagacagg caagcgctcc accagcgccg cggtgctgga ggccctacgg 1500 

gaggcccacc ccatcgtgga gaagatcctc cagcaccggg agctcaccaa gctcaagaac 1560 

aoctacgtgg accccctccc aagcctcgtc cacccgagga cgggccgcct ccacacocgc 1620 

ttcaaccaga cggccacggc cacggggagg cttagtagct ccgaccccaa cctgcagaac 1680 

atccccgtcc gcaccccctt gggccagagg. atccgccggg ccttcgtggc cgaggcgggt 1740 

tgggcgttgg tggccctgga ctatagccag atagagctcc gcgtcctcgc ccacctctcc 1800 

ggggacgaaa acctgatcag ggtottccag gaggggaagg acatccacac coagaccgoa I860 

agctggatgt tcggcgtccc cccggaggcc gtggaccccc tgatgcgccg ggcggccaag 1920 

acggtgaact tcggcgtoct ctacggcatg tccgcccata ggctctccca ggagcttgcc 1980 

atcccctacg aggaggcggt ggcctttata gagcgctact tccaaagctt ccccaaggtg 2040 

cgggcctgga tagaaaagac cctggaggag gggaggaagc ggggctacgt ggaaaccctc 2100 

ttcggaagaa ggcgctacgt gcccgacctc aacgcccggg tgaagagcgt cagggaggcc 2160 
gcggagcgoa tggccttcaa catgcccgtc cagggcaccg ccgccgacct catgaagctc 
gccatggtga agctcttccc ccgcctccgg gagatggggg cccgcatgct cctccaggtc 

gccaacgagc tcctcctgga ggccccccaa gcgcgggccg aggaggtggc ggctttggcc 2340 
aaggaggcca tggagaaggc ctatcccctc gccgtgcccc tggaggtgga ggtggggatg 
ggggaggact ggctttccgc caagggtcac caccaccacc accac 

<210> 582 

<211> 815 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 582 

Met Asn Ser Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu Val 
1 5 10 15 

Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys Gly Leu 
20 25 30 



2220 
2280 



2400 
2445 
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Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala Lys 
35 40 45 

Ser Leu Leu Lys Ala Leu Arg Glu Asp Gly Asp Val Val lie Val Val 
50 55 60 

Phe Asp Ala Glu Ala Pro Ser Phe Arg His Gin Thr Tyr Glu Ala Tyr 
65 70 75 80 

Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu Ala 
85 SO 95 

Leu lie Lys Glu Met Val Asp Leu Leu Gly Phe Thr Arg Leu Glu Val 
100 105 110 

Pro Gly Phe Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys Lys Ala 
115 120 125 

Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys Asp Leu 
130 135 140 

Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu Gly Tyr 
145 150 155 160 

Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg Pro Asp 
165 170 175 

Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp Asn Leu 

180 185 190 

Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys Leu Leu Glu 
195 200 205 

Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg Leu Lys 
210 215 220 

Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu Lys Leu 
225 230 235 240 

Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu Val Asp 
245 250 255 

Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Gly Glu Lys Pro Arg Glu 
260 265 270 

Glu Ala Pro Trp Pro Pro Pro Glu Gly Ala Phe Val Gly Phe Leu Leu 
275 280 285 

Ser Arg Pro Glu Pro Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys 

290 295 300 

Arg Gly Gly Arg Val His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys 
305 310 315 320 



Asp Leu Lys Glu Val Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu 
325 330 335 
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Ala Ser Arg Glu Gly Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu 
340 345 350 

Leu Ala Tyr Leu Leu Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala 
355 360 365 

Arg Arg -Tyr Gly Gly Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu 
370 375 380 

Leu Ser Glu Arg Leu His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu 
385 390 395 400 

Glu Lys Leu Leu Trp Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg 
405 410 415 

Val Leu Ala His Met Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr 
420 425 430 

Leu Gin Ala Leu Ser Leu Glu Leu Ala Glu Glu He Arg Arg Leu Glu 
435 440 445 

Glu Glu Val Phe Arg Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg 
450 455 460 

Asp Gin Leu Glu Arg Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu 
465 470 475 480 

Lys Lys Thr Lys Lys Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu 
485 490 495 

Glu Ala Leu Arg Glu Ala His Pro He Val Glu Lys He Leu Gin His 
500 505 510 

Arg Glu Leu Thr Lys Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser 
515 520 525 

Leu Val His Pro Arg Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr 
530 535 540 

Ala Thr Ala Thr Gly Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn 
545 550 555 560 

He Pro Val Arg Thr Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val 
565 570 575 

Ala Glu Ala Gly Trp Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu 
580 585 590 

Leu Arg Val Leu Ala His Leu Ser Gly Asp Glu Asn Leu He Arg Val 
595 600 605 

Phe Gin Glu Gly Lys Asp He His Thr Gin Thr Ala Ser Trp Met Phe 
610 615 620 

Gly Val Pro Pro Glu Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys 
625 630 635 640 
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Thr Val Asn Phe Gly Val 
645 

Gin Glu Leu Ala lie Pro 
660 

Tyr Phe Gin Ser Phe Pro 
675 

Glu Glu Gly Arg Lys Arg 
690 

Arg Tyr Val Pro Asp Leu 
705 710 

Ala Glu Arg Met Ala Phe 
725 

Leu Met Lys Leu Ala Met 
740 

Gly Ala Arg Met Leu Leu 

755 

Pro Gin Ala Arg Ala Glu 
770 

Glu Lys Ala Tyr Pro Leu 
785 790 

Gly Glu Asp Trp Leu Ser 
805 

<210> 583 
<211> 2520 
<212> DNA 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 583 



atgaattccg aggcgatgct 


tccgctcttt 


gaacccaaag 


gccgggtcct 


cctggtggac 


60 


ggccaccacc tggcctaccg 


tacctttttt 


gccctgaagg 


gcctcaccac 


cagccggggg 


120 


gagccggtcc aggcggtgta 


cgggtttgcc 


aagagccttt 


tgaaggcgct 


aagagaagac 


180 


ggggacgcgg tgatcgtggt 


ctttgacgcc 


gaggccccct 


ccttccgcca 


cgaggcctac 


240 


ggggggtaca aggcggggcg 


ggctcccacc 


cccgaggact 


ttccccggca 


gcttgccctt 


300 


atcaaggagc tggtggacct 


cctggggttt 


acccgcctcg 


aggtccccgg 


ctacgaggcg 


360 


gacgacgttc tcgccaccct 


ggccaagaag 


gcggaaaagg 


aggggtacga 


ggtgcgcatc 


420 


ctcaccgccg acaaagacct 


ttaccagctc 


ctttccgacc 


gcatccacgt 


cctccacccc 


480 



Leu Tyr Gly Met Ser Ala His Arg Leu Ser 
650 655 

Tyr Glu Glu Ala Val Ala Phe lie Glu Arg 
665 670 

Lys Val Arg Ala Trp He Glu Lys Thr Leu 
680 685 

Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg 
695 700 

Asn Ala Arg Val Lys Ser Val Arg Glu Ala 
715 720 

Asn Met Pro Val Gin Gly Thr Ala Ala Asp 
730 735 

Val Lys Leu Phe Pro Arg Leu Arg Glu Met 
745 750 

Gin Val Ala Asn Glu Leu Leu Leu Glu Ala 
760 765 

Glu Val Ala Ala Leu Ala Lys Glu Ala Met 
775 780 

Ala Val Pro Leu Glu Val Glu Val Gly Met 
795 800 



Ala Lys Gly His His His His His His 
810 815 
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W W Wb»W W WW 


crac c t aac 1 1 

gy WW wg W W «w 


tgggaaaagt 


acQcrcctqaa 

**wggww 


gcccgaccag 


540 


tcroaccoac^ 

w jj j3>*^^^^ w 


acccraaccct 

**WWJJ*J^ w w^ w 


QaccQQCTQac 


gagtccgaca 


accttcccgg 


ggtcaagggc 


600 




a w 


caagctccbc 


aaaqacrtqaa 


aqaqcctQQa 

y3**3 WW v-^^v* 


agccctcctc 


660 


cmy oci WW vy J 


ac caac tcraa 

C4 w wy y w wy 


CXCCCaCCB.tLC 


CQQQacraaQa 
^ 3 3 3 **3 ****y 


tcct.g9ccca 


catggacgat 


720 


etaaacTctcfc 

w wy v« v« w 


cctcfcraacct 

w w *j y y w w w 


y y wwawyy ^^y 


cacaccQacc 


tQCCCCtQQa 

w ^ w w 


qqtQqacttc 

33 W33*i*i» V w 


780 


crccaaaacrcif! 
y w w CLCiCi ay y w 


cr crcr a cr c c c a a 
yyy ay w w wy a 


wuy y y ay y y y 


c 1 1 aacrcr c c ii 


w w wy y ay c^y 


qctqqaqttc 

yw wyy My www 


840 


yy way >^ w w 


w w wav*y ay w 


wyy WW ^ww wy 


craaaacrcraaa 

yyayyyyaya 


aciccccaacia 

ay wwwwyyy» 


qqaqqccccc 

yyayy wwwww 


900 




^yyddyyyyL. 


wu uwy i^yyy 


tttataettt 

^wi^ywywiviaW 


w w wy w wwyy » 


qcccatqtqq 

y w w w^» 3 J 


960 


^CC^clUC U Uw 


'-yy ccuuyy w 


wy wwuy wayy 


ygt,yyt-i^yi-y 


i..y wawwyyy w 


aacaaacccc 

c^y w ciy & w 


1020 




uaaagyacc u 


uaagg agg t c 




uwy wwaayy a 


cpt* cacccftc 

www wy w wy w 


1080 


CtygCCCCya 


gggaggggcc 


agaccucg eg 


wccygyyawy 


awwwwciwy w w 


rT't" f* erect ac 

WW w wy WW («aw 


1140 


^4 ^ M a M ^4 

CuCCuyga,CC 


ccwcgaacac 


cacccccgag 


ggggtggcgc 


yy cyctawyy 




1200 


aCyyayyaCy 


ccycccaccg 


ggcwwwwwuc 


ucgyayaggc 


uwwciuwyycici 


WW www u waay 


1260 


CyCCCCyayy 


ggyaggagaa 


y wwww tcuyg 


w tw wawwcLwy 


ay y wyy aaaei 


ywwwwL«wwww 


1320 


Cyyy tCCuyy 


cccacaugga 


ggccacwggg 


gcawygcggg 


awy uyy WW ua 


w w w wwayy w w 


1380 


cfcttccct99 


agcttgcgga 


ggagatccgc 


cgccccgagg 


aggaggtcut. 


wcy cutyy^-y 


1440 


gcfccacccct 


tcaacctcaa 


ctcccgggac 


cagccggaaa 


999^9^^^^^ 


uycLwy ay w u u 


1500 


agg^^^^^^ST 


ccucgaagaa 


yacgaagaag 


acaggcaagc 


gc^ccaccag 


cgccy wyy uy 




ctggaggccc 


4* ^ ^^^^^^^^ <^ ^^^^^t 

cacgggaggc 


ccaccccauc 


gcggagaaga 


ucccccagca 


wcggyagctc 


1 fi20 


accaagctca 


agaacaccta 


cgtggacccc 


ctcccaagcc 


ccgwccaccc 


gaggacggyc 


1 6fi0 


cgcctccaca 


cccgctbcaa 


ccagacggcc 


acggccacgg 


g93^99cttag 


uagc uCwgac 


X / *U 


cccaacctgc 


agaacatccc 


cgtccgcacc 


cccttzgggcc 


agaggacccg 


ccgggcwu^w 




gtggccgagg 




gccggtggcc 


+• a ^ a a 

c L.ygawuaua 


y w way a L> ay a 


y w w w wy wy ^ w 


1860 


ctcgcccacc 


tctccgggga 


cgaaaacctg 


atcagggtct 


tccaggaggg 


gaaggacatc 


1920 


cacacccaga 


ccgcaagctg 


gatgttcggc 


gtccccccgg 


aggccgtgga 


ccccctgatg 


1980 


cgccgggcgg 


ccaagacggt 


gaacttcggc 


gtcctctacg 


gcatgtccgc 


ccataggctc 


2040 


tcccaggagc 


ttgccatccc 


ctacgaggag 


gcggtggcct 


ttatagagcg 


ctacttccaa 


2100 


agcttcccca 


aggtgcgggc 


ctggatagaa 


aagaccctgg 


aggaggggag 


gaagcggggc 


2160 
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tacgtggaaa ccctcttcgg aagaaggcgc tacgtgcccg acctcaacgc ccgggtgaag 2220 

agcgtcaggg aggccgcgga gcgcatggcc ttcaacatgc ccgtccaggg caccgccgcc 2280 

gacctcatga agctcgccat ggtgaagctc ttcccccgcc tccgggagat gggggcccgc 2340 

atgctcctcc aggtcgccaa cgagctcctc ctggaggccc cccaagcgcg ggccgaggag 2400 

gtggcggctt tggccaagga ggccatggag aaggcctatc ccctcgccgt gcccctggag 2460 

gtggaggtgg ggatggggga ggactggctt tccgccaagg gtcaccacca ccaccaccac 2520 



<210> 584 

<211> 840 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 584 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Arg Glu Asp Gly Asp Ala Val 
50 55 60 

lie Val Val Phe Asp Ala Glu Ala Pro Ser Phe Arg His Glu Ala Tyr 
65 70 75 80 

Gly Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg 
85 90 95 

Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg 
100 105 • 110 

Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu Ala 
115 120 125 

Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp 
130 135 140 

Lys Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro 
145 150 155 160 

Glu Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu 
165 170 175 

Arg Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser 
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180 



185 



190 



Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala hen hys 
195 200 205 

Leu Leu Lys Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp 
210 215 220 

Arg Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp 
225 230 235 240 

Leu Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu 
245 250 255 

Glu Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Gly Leu Lys 
260 265 270 

Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly 
275 280 285 

Leu Leu Gly Gly Glu Lys Pro Arg Glu Glu Ala Pro Trp Pro Pro Pro 
290 295 300 

Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp 
305 310 315 320 

Ala Asp Leu Leu Ala Leu Ala Ala Cys Arg Gly Gly Arg Val His Arg 
325 330 335 

Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly 
340 345 350 

Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp 

355 360 365 

Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro 
370 375 380 

Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp 
385 390 395 400 

Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg. Leu His Arg 
405 410 415 

Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr 
420 425 430 

His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala 
435 440 445 

Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu 
450 455 460 

Leu Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala 
465 470 475 480 

Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu 
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485 490 495 

Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys Thr Qly 
500 505 510 

Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His 
515 520 525 

Pro lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys Leu Lys 
530 535 540 

Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly 
545 550 555 560 

Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu 
565 570 575 



Ser Ser Ser Asp Pro Asn Leu Gin 
580 

Gly Gin Arg lie Arg Arg Ala Phe 
595 600 

Val Ala Leu Asp Tyr Ser Gin lie 
610 615 

Ser Gly Asp Glu Asn Leu lie Arg 
625 630 

His Thr Gin Thr Ala Ser Trp Met 
645 

Asp Pro Leu Met Arg Arg Ala Ala 
660 

Tyr Gly Met Ser Ala His Arg Leu 

675 680 



Asn lie Pro Val Arg Thr Pro Leu 
585 590 

Val Ala Glu Ala Gly Trp Ala Leu 
605 

Glu Leu Arg Val Leu Ala His Leu 
620 

Val Phe Gin Glu Gly Lys Asp lie 
635 640 

Phe Gly Val Pro Pro Glu Ala Val 
650 655 

Lys Thr Val Asn Phe Gly Val Leu 
665 670 

Ser Gin Glu Leu Ala lie Pro Tyr 

685 



Glu Glu Ala Val Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe Pro Lys 
690 695 700 

Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly 
705 710 715 720 

Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn 
725 730 735 

Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn 
740 745 750 

Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val 
755 760 765 

Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin 
770 775 780 

Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu 
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785 790 795 800 

Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala 
805 810 815 

Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala 
820 825 830 

Lys Gly His His His His His His 
835 840 

<210> 585 

<211> 2499 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 585 



atgaattccc 


tgcccctctt 


tgagcccaag 


agccgggtgc ttctggtgga 


cggccaccac 


60 


ctggcctacc 


gcacctcctt 


cgccctgaag 


ggcotcacca ccagccgcgg 


ggagccggtg 


120 


cagatggtct 


acggcttcgc 


ccggagcctc 


ctcaaggcct tgaaggagga 


cggacaggcg 


180 


gtggtcgtgg 


tctttgacgc 


caaggccccc 


tccttccgcc acgaggccta 


cgaggcctac 


240 


aacraccraocc 


acrcrcccccac 


cccqqaaQac 

^^^33**33'^^ 


ttccccggca gctcgccctt 


atcaaggaga 


300 


tggtggacct 


tttgggcctg 


gcgcgcctcg 


aggtcccggg ctacgaggcg 


gacgacgttc 


360 


tcgccaccct 


ggccaagaag 


gcggaaaagg 


aggggtacga ggtgcgcatt 


ctaccgccga 


420 


ccgcgacctc 


taccaactcg 


tctccgaccg 


cgtcgccgtc ctccaccccg 


agggccacct 


480 


catcaccccg 


gagtggcttt 


gggagaagta 


cggcctcagg ccggagcagt 


gggtggactt 


540 


ccgcgccctc 


gtgggggacc 


cctccgacaa 


cctccccggg gtcaagggca 


tcggggagga 


600 


gacggcggcc 


aagctgatcc 


gggagtgggg 


aagcctggaa aaccttctta 


agcacctgga 


660 


acaggcgaaa 


cctgcctccg 


tgcgggagaa 


gatccttagc cacatggagg 


acctcaagct 


720 


atccctggag 


ctatcccggg 


tgcacacgga 


cttgcttctt cagtggactt 


taaggccctg 


780 


cgccgcagga 


cccccgacct 


ggagggcctg 


agggcctttt tggaggagct 


ggagttcggc 


840 


agcctcctcc 


acgagttcgg 


cctcctggag 


gcccccgccg cggcggagga 


agctccctgg 


900 


ccgccccccg 


agggagcctt 


cgtggggtac 


gttctttccc gccccgagcc 


catgtgggcg 


960 


gagcttaacg 


ccttggccgc 


cgcctggggc 


ggccgcgtgc accgggcagc 


agaccccttg 


1020 


gcggggctaa 


aggacctcaa 


ggaggtccgg 


ggcctcctcg ccaaggacct 


cgccgtcttg 


1080 


gcctcgaggg 


aggggctaga 


cctcgtgccc 


ggggacgacc ccatgctcct 


cgcctacctc 


1140 
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ctggaccctt 


cgaacaccac 


ccccgagggg gtggcgcggc gctacggggg ggagtggacg 


1200 


gaggacgccg 


cccaccgggc 


cctcctctcg gagaggctcc atcggaacct ccttaagcgc 


1260 


ctcgaggggg 


aggagaagct 


cctttggctc taccacgagg tggaaaagcc cctctcccgg 


1320 


gtcctggccc 


atatggaggc 


caccggggta cggcgggacg tggcctacct tcaggccctt 


1380 


tccctggagc 


ttgcggagga 


gatccgccgc ctcgaggagg aggtcttccg cttggcgggc 


1440 


caccccttca 


acctcaactc 


ccgggaccag ctggaaaggg tgctctttga cgagcttagg 


1500 


cttcccgcct 


tgaagaagac 


gaagaagaca ggcaagcgct ccaccagcgc cgcggtgctg 


1560 


gaggccctac 


gggaggccca 


ccccatcgtg gagaagatcc tccagcaccg ggagctcacc 


1620 


aagctcaaga 


acacctacgt 


ggaccccctc ccaagcctcg tccacccgag gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg gccacgggga ggcttagtag ctccgacccc 


1740 


aacctgcaga 


acatccccgt 


ccgcaccccc ttgggccaga ggatccgccg ggccttcgtg 


1800 


gccgaggcgg 


gttgggcgtt 


ggtggccctg gactatagcc agatagagct ccgcgtcctc 


1860 


gcccacctct 


ccggggacga 


aaacctgatc agggtcttcc aggaggggaa ggacatccac 


1920 


acccagaccg 


caagctggat 


gttcggcgtc cccccggagg ccgtggaccc cctgatgcgc 


1980 


cgggcggcca 


agacggtgaa 


cttcggcgtc ctctacggca tgtccgccca taggctctcc 


2040 


caggagcttg 


ccatccccta 


cgaggaggcg gtggccttta tagagcgcta cttccaaagc 


2100 


ttccccaagg 


tgcgggcctg 


gatagaaaag accctggagg aggggaggaa gcggggctac 


2160 


gtggaaaccc 


tcttcggaag 


aaggcgctac gtgcccgacc tcaacgcccg ggtgaagagc 


2220 


gtcagggagg 


ccgcggagcg 


catggccttc aacatgcccg tccagggcac cgccgccgac 


2280 


ctcatgaagc 


tcgccatggt 


gaagctcttc ccccgcctcc gggagatggg ggcccgcatg 


2340 


ctcctccagg 


tcgccaacga 


gctcctcctg gaggcccccc aagcgcgggc cgaggaggtg 


2400 


gcggctttgg 


ccaaggaggc 


catggagaag gcctatcccc tcgccgtgcc cctggaggtg 


2460 


gaggtgggga 


tgggggagga 


ctggctttcc gccaagggt 


2499 



<210> 586 

<211> 811 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 586 
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Met Glu Phe Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu Val 
15 10 15 

Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys Gly Leu 
20 25 30 

Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala Lys 
35 40 45 

Ser Leu Leu Lys Ala Leu Arg Glu Asp Gly Asp Val Val lie Val Val 
50 55 60 

Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly Gly Tyr 
65 70 75 80 

Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu Ala 
85 90 95 

Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu Glu Val 
100 105 110 

Pro Gly Phe Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys Lys Ala 
115 120 125 

Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Gly Asp Arg Asp Leu 
130 135 140 

Tyr Gin Leu Val Ser Asp Arg Val Ala Arg Pro Glu Gin Trp Val Asp 
145 150 155 160 

Tyr Arg Ala Leu Ala Gly Asp Pro Ser Asp Asn lie Pro Gly Val Lys 
165 170 175 

Gly lie Gly Glu Lys Thr Ala Arg Lys Leu Leu Glu Glu Trp Gly Ser 
180 185 190 

Val Glu Ala Leu Leu Lys Asn Leu Asp Arg Leu Lys Pro Ala lie Arg 
195 200 205 

Glu Lys He Leu Ala His Met Glu Asp Leu Lys Leu Ser Leu Glu Leu 
210 215 220 

Ser Arg Val Arg Thr Asp Leu Pro Leu Glu Val Asp Leu Ala Gin Gly 
225 230 235 240 

Arg Glu Pro Asp Arg Glu Gly Leu Lys Ala Phe Leu Glu Arg Leu Glu 
245 250 255 

Phe Gly Ser Leu Leu His Glu Phe Gly Leu Leu Glu Ser Pro Val Ala 
260 265 270 

Ala Glu Glu Ala Pro Trp Pro Pro Pro Glu Gly Ala Phe Val Gly Tyr 
275 280 285 

Val Leu Ser Arg Pro Glu Pro Met Trp Ala Glu Leu Asn Ala Leu Ala 
290 295 300 
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Ala Ala Trp Gly Gly Arg Val His Arg Ala Ala Asp Pro Leu Ala Gly 
305 310 315 320 

Leu Lys Asp Leu Lys Glu Val Arg Gly Leu Leu Ala Lys Asp Leu Ala 
325 330 335 

Val Leu Ala Ser Arg Glu Gly Leu Asp Leu Val Pro Gly Asp Asp Pro 

340 345 350 

Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser Asn Thr Thr Pro Glu Gly 
355 360 365 

Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr Glu Asp Ala Ala His Arg 
370 375 380 

Ala Leu Leu Ser Glu Arg Leu His Arg Asn Leu Leu Lys Arg Leu Glu 
385 390 395 400 

Gly Glu Glu Lys Leu Leu Trp Leu Tyr His Glu Val Glu Lys Pro Leu 
405 410 415 

Ser Arg Val Leu Ala His Met Glu Ala Thr Gly Val Arg Arg Asp Val 
420 425 430 

Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu Ala Glu Glu He Arg Arg 
435 440 445 

Leu Glu Glu Glu Val Phe Arg Leu Ala Gly His Pro Phe Asn Leu Asn 

450 455 460 

Ser Arg Asp Gin Leu Glu Arg Val Leu Phe Asp Glu Leu Arg Leu Pro 
465 470 475 480 

Ala Leu Lys Lys Thr Lys Lys Thr Gly Lys Arg Ser Thr Ser Ala Ala 
485 490 495 

Val Leu Glu Ala Leu Arg Glu Ala His Pro He Val Glu Lys He Leu 
500 505 510 

Gin His Arg Glu Leu Thr Lys Leu Lys Asn Thr Tyr Val Asp Pro Leu 
515 520 525 

Pro Ser Leu Val His Pro Arg Thr Gly Arg Leu His Thr Arg Phe Asn 
530 535 540 

Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser Ser Asp Pro Asn Leu 
545 550 555 560 

Gin Asn He Pro Val Arg Thr Pro Leu Gly Gin Arg He Arg Arg Ala 

565 570 575 

Phe Val Ala Glu Ala Gly Trp Ala Leu Val Ala Leu Asp Tyr Ser Gin 
580 585 590 



He Glu Leu Arg Val Leu Ala His Leu Ser Gly Asp Glu Asn Leu He 
595 600 605 
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Arg Val Phe Gin Glu Gly Lys Asp He His Thr Gin Thr Ala Ser Trp 
610 615 620 

Met Phe Gly Val Pro Pro Qlu Ala Val Asp Pro Leu Met Arg Arg Ala 
625 630 635 640 

Ala LYS Thr Val Aan Phe Gly Val Leu Tyr Gly Met ser Ala His Arg 
645 650 655 

Leu ser Gin Glu Leu Ala He Pro Tyr Qlu Glu Ala Val Ala Phe He 
6S0 665 670 

Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val Arg Ala Trp He Glu Lys 
675 680 685 

Thr Leu Glu Glu Gly Arg Lys Arg Gly Tyr Val Glu Thr Leu Phe Gly 
690 695 700 

Arg Arg Arg Tyr Val Pro Asp Leu Asn Ala Arg Val Lys Ser Val Arg 
705 710 715 720 

Glu Ala Ala Glu Arg Met Ala Phe Asn Met Pro Val Gin Gly Thr Ala 
725 730 735 

Ala Asp Leu Met Lys Leu Ala Met Val Lys Leu Phe Pro Arg Leu Arg 
740 745 750 

Glu Met Gly Ala Arg Met Leu Leu Gin Val Ala Asn Glu Leu Leu Leu 
755 760 765 

Glu Ala pro Gin Ala Arg Ala Glu Glu Val Ala Ala Leu Ala Lys Glu 
770 775 780 

Ala Met Glu Lys Ala Tyr Pro Leu Ala Val Pro Leu Glu Val Glu Val 
785 790 795 800 

Gly Met Gly Glu Asp Trp Leu Ser Ala Lys Gly 
805 810 



<210> 


587 


<2X1> 


2433 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


587 



60 
120 



ctggcctacc gcaccttcca cgccctgaag ggcctcacca ccagccgcgg ggagccggtc 
caggcggtgt acgggtttgc caagagcctt ttgaaggcgc taagagaaga cggggatgtg 180 
gtgatcgtgg tctttgacgc caaggccccc tccttccgcc acgaggccta cggggggtac 240 
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aaaacaaacc 


gqqccccgac 


ccccgaggac 


ttcccccggc 


agctcgccct 


catcaaggag 


300 




tcctggggct 


ggcqcQCctC 

33^3^3*****"* 


qaqqtqccqq 

3**33*'3^**33 


gctttgaagc 


ggatgacgtc 


360 


ctggctaccc 


tggccaagaa 


qqcQQaaaaa 


qaqaact^ acq 
3**yyy^ *-»**y 


aqqtqcqca^ 

**33 **3*-3**»* »* 


tctcaccggc 


420 


aaccQCQacc 


tttaccaact 


cgtctccgac 


cgcgtcgcca 


qaccqqaqca 


qtqqqtqaac 
y ^-yyy ^yy 


480 


tacc999cct 


taaccQQcraa 


cccttccgac 


aacatccccg 


ocatqaaqqq 


cat: caaaQaci 

wawwyyyyay 


540 


aaciacaQCcra 


aaaacrcttct 


Qaaqqaa too 
yy^33^y ^yy 


yyyay wy uyy 


aaaccctcct 


c aaaaac c tcr 

wv»CT*j»vtw w wy 


600 


QaCCQQCtQa 


aacccQCcat: 


ccaaaacraaa 
^^-yyy^y^^-y 


atcctcicicGG 

www W%J*^ W WW 


aGatcrciaaaa 

a wa wyy ay y a 


WW w waay w wa 


660 


tccctQaacic 


******** 3 


CTCQcaccaac 


wwwwwwww*4y 


aaato'CiacGt: 

wfcjjj*j w«j*^aww w 


wy WW wayyyy 


720 


coorcraac c ca 


ac cacraacrcra 




tttttaaaaa 
w w w w wyyay a 


acrefccrcf acf t fc 
yy *■* wyy ay w w 


wyy aay WW w w 


780 


ctccacgagt 


tccracctat fc 


ociri ri fierce CCl 


3 ^yy^y 3^33 


c*My wciy w www 


w w*4y ww*jwww 


840 


^ W w y " Z3 rj ^ ZD 


r* c 1 1 Gcr t cf era 

*^** i» v»w*^ v«*jM>4 




trr* r»rT r* rT" cr 
^wwwywwwwM 


aywwwauywy 


ctcsr*ctct^ etc ^ f" 


900 






yya^as^^y^ 


y uy wctwwyyy 


way way aw WW 


cttyycgggg 


7 o w 


ctaaaaaacc 




V* v*y y y y w w L. ».* 


w w wy w wciciyy 


aw w w wy w wy w 


w w wyy WW w wy 


1020 


aaaaaacTGrcrc 


t acra c c h ccr t 

W ^ J ^ ** >* WV**J w 


Of c c c ocTcrcf a c 

vjViiVawyyyyaw 


cfaCGGGafcac 

y aw w wwn wy w 


tcefceaoGta 
www w wy w w L- a 


GOfcGGfccrcraG 
WW www wyy aw 


1080 


cctfccaaaca. 

W w W ^^^n %A b% wL 




ayyyyi-ay^a 


wy y wy w wawy 


9ygyy9«a*-y 


era r*aasi oexst g 
y awyy ayy aw 


1140 


cfccgcccacc 




c ti caaacraacT 


ctcGatccrcra 

** W V»Wd W W J*jC> 


aGG tGcttaa 

M W W www W WGiG» 


y wy WW wwy ay 


1200 




aQctcctttia 

**y ww*»wwi*4**j 


crctctafr'ap 


y viyy uyyaaa 


ay www w ww w w 


wwyyy www wy 


1260 


QCCcatiatQcr 


aggc c ac c gg 


yy ^«*^yy *-yy 


oacatao'cct 

y ^*i»*3 w *J *H ^ w w 


a G G t* t c acrcf c 

^^N^ w w w ^ " y y ^* 


GGfc ttccct'a 


1320 


y**y'* ^ *-y*'yy 


acfcracra t c CO 

^33 3 ** 


ccQCchcaacr 


oaao'acrc^t Gt 

y ay y «y y www 


tGGCfGt" triCfG 

w w wy w t« i«»yy ** 


rfCrCTGGaGGGG 

yyywwawwww 


1380 


ttcaacctca. 


ac t G c ccracra 


ccaaetaaaa 


acracrtactcfc 

ay y y wy w w w w 


w wyawyay w w 


wayy w w www w 


1440 




GL*J a w ^ Gl GL d A 


y d w Ay y c eicL^ 


wy w wwwaw wa 


y cgccywyy u 


g w i^y y cly y c c 


1 ^00 






t-y cyy ay aay 


a WW w WW way w 


awcyyyaycu 


CaCwaay wtw 


1 

X3 O w 


aagaacacct 


accrtioaa.ccc 

WW*J W*J*JCfcS^^ w 


cctecGaacic 

WW w w wwGLay w 


GtccrtGcacG 

w w wy w w waw w 




wwy WW wwwaw 


1620 


acccgcttca 


accagacggc 


caecraccaccf 

******y y wwci wy 


craof acrcfG 1 1 a 
yyy «yy V- w wa 


*5 1« ^y y ^ 


cGGGaaGGfccr 

wwwwcmww wy 


1680 


cagaacatcc 


ccgtccgcac 


ccccttgggc 


cagaggatcc 


gccgggcctt 


cgtggccgag 


1740 


gcgggttggg 


cgttggtggc 


cctggactat 


agccagatag 


agctccgcgt 


cctcgcccac 


1800 


ctctccgggg 


acgaaaacct 


gatcagggtc 


ttccaggagg 


ggaaggacat 


ccacacccag 


1860 


accgcaagct 


ggatgttcgg 


cgtccccccg 


gaggccgtgg 


accccctgat 


gcgccgggcg 


1920 


gccaagacgg 


tgaacttcgg 


cgtcctctac 


ggcatgtccg 


cccataggct 


ctcccaggag 


1980 
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<210> 588 

<211> 811 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 588 

Met Glu Phe Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu Val 
15 10 15 

Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys Gly Leu 
20 25 30 

Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe Ala Lys 
35 40 45 

Ser Leu Leu Lys Ala Leu Arg Glu Asp Gly Asp Val Val lie Val Val 
50 55 60 

Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly Gly Tyr 
65 70 75 80 

Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu Ala 
85 90 95 

Leu He Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu Glu Val 
100 105 110 

Pro Gly Phe Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys Lys Ala 
115 120 125 

Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Gly Asp Arg Asp Leu 
130 135 140 

Tyr Gin Leu Val Ser Asp Arg Val Ala Arg Pro Glu Gin Trp Val Asp 
145 150 155 160 



cttgccatcc cctacgagga ggcggtggcc tttatagagc gctacttcca aagcttcccc 2040 

aaggtgcggg cctggataga aaagaccctg gaggagggga ggaagcgggg ctacgtggaa 2100 

accctcttcg gaagaaggcg ctacgtgccc gacctcaacg cccgggtgaa gagcgtcagg 2160 

gaggccgcgg agcgcatggc cttcaacatg cccgtccagg gcaccgccgc cgacctcatg 2220 

aagctcgcca tggtgaagct cttcccccgc ctccgggaga tgggggcccg catgctcctc 2280 

caggtcgcca acgagctcct cctggaggcc ccccaagcgc gggccgagga ggtggcggct 2340 

ttggccaagg aggccatgga gaaggcctat cccctcgccg tgcccctgga ggtggaggtg 2400 

gggatggggg aggactggct ttccgccaag ggt 2433 



641 



wo 01/90337 



PCTAJSOl/17086 



Tyr Arg Ala Leu Ala Gly Asp Pro Ser Asp Asn lie Pro Gly Val Lys 
165 170 175 

Gly lie Gly Glu Lys Thr Ala Arg Lys Leu Leu Glu Glu Trp Gly Ser 
180 185 190 

Val Glu Ala Leu Leu Lys Asn Leu Asp Arg Leu Lys Pro Ala lie Arg 

195 200 205 

Glu Lys lie Leu Ala His Met Glu Asp Leu Lys Leu Ser Leu Glu Leu 
210 215 220 

Ser Arg Val Arg Thr Asp Leu Pro Leu Glu Val Asp Leu Ala Gin Gly 
225 230 235 240 

Arg Glu Pro Asp Arg Glu Gly Leu Lys Ala Phe Leu Glu Arg Leu Glu 
245 250 255 

Phe Gly Ser Leu Leu His Glu Phe Gly Leu Leu Glu Ser Pro Val Ala 
260 265 270 

Ala Glu Glu Ala Pro Trp Pro Pro Pro Glu Gly Ala Phe Val Gly Tyr 
275 280 285 

Val Leu Ser Arg Pro Glu Pro Met Trp Ala Glu Leu Asn Ala Leu Ala 
290 295 300 

Ala Ala Trp Gly Gly Arg Val His Arg Ala Ala Asp Pro Leu Ala Gly 
305 310 315 320 

Leu Lys Asp Leu Lys Glu Val Arg Gly Leu Leu Ala Lys Asp Leu Ala 
325 330 335 

Val Leu Ala Ser Arg Glu Gly Leu Asp Leu Val Pro Gly Asp Asp Pro 
340 345 350 

Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser Asn Thr Thr Pro Glu Gly 
355 360 365 

Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr Glu Asp Ala Ala His Arg 
370 375 380 

Ala Leu Leu Ser Glu Arg Leu His Arg Asn Leu Leu Lys Arg Leu Glu 
385 390 395 400 

Gly Glu Glu Lys Leu Leu Trp Leu Tyr His Glu Val Glu Lys Pro Leu 
405 410 415 

Ser Arg Val Leu Ala His Met Glu Ala Thr Gly Val Arg Arg Asp Val 
420 425 430 

Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu Ala Glu Glu lie Arg Arg 
435 440 445 

Leu Glu Glu Glu Val Phe Arg Leu Ala Gly His Pro Phe Asn Leu Asn 
450 455 460 
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Ser Arg Asp Gin Leu Glu Arg Val Leu Phe Asp Glu Leu Arg Leu Pro 
465 470 475 480 

Ala Leu Lys Lys Thr Lys Lys Thr Gly Lys Arg Ser Thr Ser Ala Ala 
485 490 495 

Val Leu Glu Ala Leu Arg Glu Ala His Pro lie Val Glu Lys He Leu 
500 505 510 

Gin His Arg Glu Leu Thr Lys Leu Lys Asn Thr Tyr Val Asp Pro Leu 
515 520 525 

Pro Ser Leu Val His Pro Arg Thr Gly Arg Leu His Thr Arg Phe Asn 
530 535 540 

Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser Ser Asp Pro Asn Leu 
545 550 555 560 

Gin Asn He Pro Val Arg Thr Pro Leu Gly Gin Arg He Arg Arg Ala 
565 570 575 

Phe Val Ala Glu Ala Gly Trp Ala Leu Val Ala Leu Asp Tyr Ser Gin 
580 585 590 

He Glu Leu Arg Val Leu Ala His Leu Ser Gly Asp Glu Asn Leu He 
595 600 605 

Arg Val Phe Gin Glu Gly Lys Asp He His Thr Gin Thr Ala Ser Trp 
610 615 620 

Met Phe Gly Val Pro Pro Glu Ala Val Asp Pro Leu Met Arg Arg Ala 
625 630 635 640 

Ala Lys Thr Val Asn Phe Gly Val Leu Tyr Gly Met Ser Ala His Arg 
645 650 655 

Leu Ser Gin Glu Leu Ala He Pro Tyr Glu Glu Ala Val Ala Phe He 
660 665 670 

Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val Arg Ala Trp He Glu Lys 
675 680 685 

Thr Leu Glu Glu Gly Arg Lys Arg Gly Tyr Val Glu Thr Leu Phe Gly 
690 695 700 

Arg Arg Arg Tyr Val Pro Asp Leu Asn Ala Arg Val Lys Ser Val Arg 
705 710 715 720 

Glu Ala Ala Glu Arg Met Ala Phe Asn Met Pro Val Gin Gly Thr Ala 
725 730 735 

Ala Asp Leu Met Lys Leu Ala Met Val Lys Leu Phe Pro Arg Leu Arg 
740 745 750 



Glu Met Gly Ala Arg Met Leu Leu Gin Val Ala Asn Glu Leu Leu Leu 
755 760 765 
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Glu Ala Pro Gin Ala Arg Ala Glu Glu Val Ala Ala Leu Ala Lys Glu 
770 775 780 

Ala Met Glu Lys Ala Tyr Pro Leu Ala Val Pro Leu Glu Val Glu Val 
785 790 795 800 

Gly Met Gly Glu Asp Trp Leu Ser Ala Lys Gly 
805 810 

<210> 589 
<211> 2493 
<212> DNA 
<213> Artificial 

<220> 

<223> Synthetic 



<400> 589 



atgaattccc 


tgcccctctt 


tgagcccaag 


ggccgggtcc 


tcctggtgga 


cggccaccac 


60 


ctggcctacc 


gtaccttttt 


tgccctgaag 


ggcctcacca 


cctcccgggg 


ggagccggtg 


120 


cagatggtct 


acggcttcgc 


ccggagcctc 


ctcaaggccc 


tcaaggagga 


C999?e^c:gcg 


180 


gtgatcgtgg 


tctttgacgc 


cgaggccccc 


tccttccgcc 


accagaccta 


cgaggcctac 


240 


aaggcgggga 


gggctcccac 


ccccgaggac 


tttccccggc 


agcttgccct 


tatcaaggag 


300 


ctggtggacc 


tcctggggtt 


tacccgcctc 


gaggtccccg 


gctacgaggc 


ggacgacgtt 


360 


ctcgccaccc 


tggccaagaa 


ggcggaaaag 


gagggctacg 


aggtgcgcat 


cctcaccgcg 


420 


gaccgggacc 


tttaccagct 


tctttccgac 


cgcattcacg 


tccttcaccc 


cgaggggtac 


480 


ctcatcaccc 


cggcctggct 


ttgggaaaag 


tacggcttga 


ggcccgacca 


gtgggccgac 


540 


taccgggccc 


tgaccgggga 


cgaatccgac 


aacctttccg 


gggtcaaggg 


catcggggag 


600 


aagacggcga 


ggaagcttct 


ggaggagtgg 


gggagcctgg 


aagccctcct 


caagaacctg 


660 


gaccggctga 


agcccgccat 


ccgggagaag 


atcctggccc 


acatggacga 


tctgaagctc 


720 


tccttggagc 


tctcccgggt 


gcgcaccgac 


ctccccctgg 


aggtggactt 


cgccaaaagg 


780 


cgggagcccg 


accgggagag 


gcttagggcc 


tttctggaga 


ggcttgagtt 


tggcagcctc 


840 


ctccacgagt 


tcggcccttt 


ggaaagcccc 


agggccgcgg 


aggaagctcc 


ctggccgccc 


900 


cccgagggag 


ccttcgtggg 


gtacgttctt 


tcccgccccg 


agcccatgtg 


ggcggagctt 


960 


aacgccttgg 


ccgccgccag 


gggcggccgc 


gtgcaccggg 


cagcagaccc 


cttggcgggg 


1020 


ctaaaggacc 


tcaaggaggt 


ccggggcctc 


ctcgccaagg 


acctcgccgt 


cttggcctcg 


1080 


agggaggggc 


tagacctcgt 


gcccggggac 


gaccccatgc 


tcctcgccta 


cctcctggac 


1140 
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ccttcgaaca ccacccccga gggggtggcg cggcgctacg ggggggagtg gacggaggac 1200 

gccgcccacc gggccctcct ctcggagagg ctccatcgga acctccttaa gcgcctcgag 1260 

ggggaggaga agctcctttg gctctaccac gaggtggaaa agcccctctc ccgggtcctg 1320 

gcccatatgg aggccaccgg ggtacggcgg gacgtggcct accttcaggc cctttccctg 1380 

gagcttgcgg aggagatccg ccgcctcgag gaggaggtct tccgcttggc gggccacccc 1440 

ttcaacctca actcccggga ccagctggaa agggtgctct ttgacgagct taggcttccc 1500 

gccttgaaga agacgaagaa gacaggcaag cgctccacca gcgccgcggt gctggaggcc 1560 

ctacgggagg cccaccccat cgtggagaag atcctccagc accgggagct caccaagctc 1620 

aagaacacct acgtggaccc cctcccaagc ctcgtccacc cgaggacggg ccgcctccac 1680 

acccgcttca accagacggc cacggccacg gggaggctta gtagctccga ccccaacctg 1740 

cagaacatcc ccgtccgcac ccccttgggc cagaggatcc gccgggcctt cgtggccgag 1800 

gcgggttggg cgttggtggc cctggactat agccagatag agctccgcgt cctcgcccac 1860 

ctctccgggg acgaaaacct gatcagggtc ttccaggagg ggaaggacat ccacacccag 1920 

accgcaagct ggatgttcgg cgtccccccg gaggccgtgg accccctgat gcgccgggcg 1980 

gccaagacgg tgaacttcgg cgtcctctac ggcatgtccg cccataggct ctcccaggag 2040 

cttgccatcc cctacgagga ggcggtggcc tttatagagc gctacttcca aagcttcccc 2100 

aaggtgcggg cctggataga aaagaccctg gaggagggga ggaagcgggg ctacgtggaa 2160 

accctcttcg gaagaaggcg ctacgtgccc gacctcaacg cccgggtgaa gagcgtcagg 2220 

gaggccgcgg agcgcatggc cttcaacatg cccgtccagg gcaccgccgc cgacctcatg 2280 

aagctcgcca tggtgaagct cttcccccgc ctccgggaga tgggggcccg catgctcctc 2340 

caggtcgcca acgagctcct cctggaggcc ccccaagcgc gggccgagga ggtggcggct 2400 

ttggccaagg aggccatgga gaaggcctat cccctcgccg tgcccctgga ggtggaggtg 2460 

gggatggggg aggactggct ttccgccaag ggt 2493 



<210> 590 

<211> 831 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 590 
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Met Asn Ser Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu Val 
1.5 10 15 

Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu Lys Gly Leu 
20 25 30 

Thr Thr Ser Arg Gly Glu Pro Val Gin Met Val Tyr Gly Phe Ala Arg 
35 40 45 

Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie Val Val 
50 55 60 

Phe Asp Ala Glu Ala Pro Ser Phe Arg His Gin Thr Tyr Glu Ala Tyr 
65 70 75 80 

Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin Leu Ala 
85 90 95 

Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg Leu Glu Val 
100 105 110 

Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu Ala Lys Lys Ala 

115 120 125 

Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Arg Asp Leu 
130 135 140 

Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu Gly Tyr 
145 150 155 160 

Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg Pro Asp 
165 170 175 

Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp Asn Leu 
180 185 190 

Ser Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu Leu Glu 
195 200 205 

Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg Leu Lys 
210 215 220 

Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu Lys Leu 
225 230 235 240 

Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu Pro Leu Glu Val Asp 
245 250 255 

Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala Phe Leu 
260 265 270 

Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Pro Leu Glu 
275 280 285 



Ser Pro Arg Ala Ala Glu Glu Ala Pro Trp Pro Pro Pro Glu Gly Ala 
290 295 300 
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Phe Val Gly Tyr Val Leu Ser Arg Pro Glu Pro Met Trp Ala Glu Leu 
305 310 315 320 

Asn Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala Ala Asp 
325 330 335 

Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val Arg Gly Leu Leu Ala 
340 345 350 

Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly Leu Asp Leu Val Pro 
355 360 365 

Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser Asn Thr 
370 375 380 

Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr Glu Asp 
385 390 395 400 

Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu His Arg Asn Leu Leu 
405 410 415 

Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp Leu Tyr His Glu Val 
420 425 430 

Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr Gly Val 
435 440 445 

Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser Leu Glu Leu Ala Glu 
450 455 460 

Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg Leu Ala Gly His Pro 
465 470 475 480 

Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe Asp Glu 
485 490 495 

Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys Thr Gly Lys Arg Ser 
500 505 510 

Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro He Val 
515 520 525 

Glu Lys He Leu Gin His Arg Glu Leu Thr Lys Leu Lys Asn Thr Tyr 
530 535 540 

Val Asp Pro Leu Pro Ser Leu Val His Pro Arg Thr Gly Arg Leu His 
545 550 555 560 

Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser Ser 
565 570 575 

Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly Gin Arg 
580 585 , 590 

He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp Ala Leu Val Ala Leu 
595 600 605 
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Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala His Leu Ser Qly Asp 
610 615 620 

Glu Asn Leu lie Arg Val Phe Gin Glu Gly Lys Asp lie His Thr Gin 
625 630 635 640 

Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp Pro Leu 
645 650 655 

Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu Tyr Gly Met 
660 665 670 

Ser Ala His Arg Leu Ser Gin Glu Leu Ala He Pro Tyr Glu Glu Ala 

675 680 685 

Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val Arg Ala 
690 695 700 

Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys Arg Gly Tyr Val Glu 
705 710 715 720 

Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Asn Ala Arg Val 
725 730 735 

Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met Pro Val 
740 745 750 

Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys Leu Phe 
755 760 765 

Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu Leu Gin Val Ala Asn 
770 775 780 

Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala Glu Glu Val Ala Ala 
785 790 795 800 

Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro Leu Ala Val Pro Leu 
805 810 815 

Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu Ser Ala Lys Gly 
820 825 830 

<210> 591 
<211> 23 
<212> DNA 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 591 

agaaaggaag ggaagaaagc gaa 23 



.<210> 592 
<211> 45 
<212> DNA 
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<213> Artificial 
<220> 

<223> Synthetic 
<400> 592 

tggaggtcaa aacatcgata agtcgaagaa aggaagggaa gaaat 45 



<210> 593 

<211> 14 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 593 
tgttttgacc tcca 



<210> 594 

<211> 20 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 
<400> 594 

ggcgaccaca cccgtcctgt 20 



<210> 595 

<211> 20 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 



<400> 595 

ccacgatgcg tccggcgtag 20 



<210> 596 

<211> 26 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 



<400> 596 

acgggtcaat gtccatgccc caaaga 26 
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<210> 597 

<211> 41 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<220> 

<221> modif ied_base 

<222> (23).. (23) 

<223> /note= "The residue at this position indicates 2' o-tnethyl sugar" 

<400> 597 

gtctgagatg aaagtgctcc cgcacccacc caaggcacag c 41 



<210> 598 

<211> 26 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 598 

tcttcgcaca tttcatctca gacgga 26 



<210> 599 

<211> 17 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<220> 

<22l> modif ied_base 

<222> (1)..(17) 

<223> /note= «The residues at these positions are 2' o-methyl sugars" 



<400> 599 

gctgtgcctt gggtggg 17 



<210> 600 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
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<220> 

<221> modif ied_base 

<222> (1)..(21) 

<223> /notes "The residues at these positions are 2' o-methyls" 



<400> 600 

gctgtgcctt gggtgggtgc g 21 



<210> 601 

<211> 28 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 601 

aacgaggcgc acccacccaa ggcacagc 28 



<210> 602 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<220> 

<221> modif ied_base 

<222> (23) . . (27) 

<223> /notes "The residues at these positions are 2' o-methyls" 



<400> 602 

gtctgalgatg aaagtgcgcc tcgttaa 27 



<210> 603 

<211> 28 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 603 

aacgaggcgc acccacccaa ggcacagc 28 



<210> 604 

<211> 23 

<212> DNA 

<213> Artificial 
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<220> 

<223> Synthetic 
<400> 604 

tcttcgcaca tttcatctca gac 23 



<210> 605 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 605 

gctgtgcctt gggtgggtgc g 21 



<210> 606 

<211> 26 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 606 

tcttcgcaca tttcatctca gacgga 26 



<210> 607 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 607 

gctgtgcctt gggtgggtgc 20 



<210> 608 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<220> 

<22l> modif ied_base 

<222> (15).. (20) 

<223> /note* "The residues at these positions are 2* o-tnethyl sugars" 
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<400> 606 

gctgtgcctt gggtgggtgc 20 



<210> 


609 


<211> 


20 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<220> 




<221> 


modified base 


<222> 


(1) . . (20) 


<223> 


/note= "The residues 


<400> 


609 


gctgtgcctt gggtgggtgc 


<210> 


610 


<211> 


26 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


610 



20 



acgggtcaat gtccatgccc caaaga 26 

<210> 611 

<211> 28 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 



28 



<400> 


611 


aacgaggcgc acccacccaa 


<210> 


612 


<211> 


27 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<220> 




<221> 


modified base 


<222> 


(23) . . (27) 
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<223> /iiote= "The residues at these positions are 2' o-tnethyls" 



<400> 612 

gtctgagatg aaagtgcgcc tcgttaa 27 



<210> 613 

<211> 17 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<220> 

<221> modif ied_base 

<222> (1)..(17) 

<223> /note= "The residues at these positions are 2' o-methyls" 



<400> 613 

gctgtgcctt gggtggg 17 



<210> 


614 


<211> 


19 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<220> 




<221> 


modified base 


<222> 


(1) . . (19) 


<223> 


/note= "The res 


<400> 


614 


gctgtgcctt gggtgggtg 


<210> 


615 


<211> 


21 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<220> 




<221> 


modified base 


<222> 


(1) (21) 



19 



<223> /notes "The residues at these positions are 2* o-methyls" 
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<400> 615 

gctgtgcctt gggtgggtgc g 21 

<210> 616 

<211> 22 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<220> 

<221> modified_base 

<222> (1)..(22) 

<223> /notes "The residues at these positions are 2* o-tnethyls" 

<400> 616 

gctgtgcctt gggtgggtgc gc 22 

<210> 617 

<211> 26 

<212> DSTA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 617 

ccgtcaacat ttaccatggg tgcgga 26 



<210> 618 

<211> 31 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 618 

ccgccacctc gtagtcgaca tccttttcgt g 

<210> 619 

<211> 29 

<212> DNA 

<213> Artificial 



31 



<220> 

<223> Synthetic 
<400> 619 

ttgataaatt tggggtggaa aggtttgga 29 



655 



wo 01/90337 



PCTAJSOl/17086 



<210> 620 

<211> 17 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 620 

agaactgaga ggaggcg 17 



<210> 621 

<211> 25 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 621 

aacgaggcgc accaaactca ctcat 25 



<210> 622 

<211> 35 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 622 

gtcatgtagg cttctatgta gttgatgaag atgta 35 



<210> 623 

<211> 26 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 623 

ggctttgtag atgcctttct cttgga 26 



<210> 624 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
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<400> 624 

atgagtgagt ttggtgcg 18 



<210> 625 

<211> 27 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 
<400> 625 

ccaggaagca agtggtgcgc ctcgttt 27 



<210> 626 

<211> 22 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 
<400> 626 

aacgaggcgc accttggagg ca 22 



<210> 627 

<211> 24 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 
<400> 627 

aaggtttcct tctcagttgt gtta 24 



<210> 628 

<211> 26 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 628 

gcaaagatgt ctgttacggt caactc 26 



<210> 629 

<211> 16 

<212> DNA 

<213> Artificial 
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<220> 

<223> Synthetic 
<400> 629 

tgcctccaag gtgcgc 16 



<210> 630 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 630 

aacgaggcgc accttcaaaa tgcctaa 27 



<210> 631 

<211> 23 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 631 

tgtcactctc ctctttccaa tta 23 



<210> 632 

<211> 29 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 632 

gaaaagagtt ccattatccg ctacatctg 29 



<210> 633 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 633 

ttaggcattt tgaaggtgcg c 21 



<210> 634 
<211> 23 
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<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 634 

aacgaggcgc accgttgtgt ccc 

<210> 635 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 635 

gggatgtaga agccattcag a 

<210> 636 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 636 

ttgttgtgct gtgggggatg 

<210> 637 

<211> 17 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 637 
gggacacaac ggtgcgc 

<210> 638 

<211> 23 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 638 

ccgtcacgcc tccaccatat ccc 
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<210> 639 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 639 

ccagcggttt ccattggcaa agatcaa 27 



<210> 640 

<211> 21 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 
<400> 640 

cggaagaatg ggtcgaccat g 21 



<210> 641 

<211> 17 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 
<400> 641 

gggatatggt ggaggcg 17 



<210> 642 

<211> 22 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 642 

aacgaggcgc accgttccag gc 22 



<210> 643 

<211> 24 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
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<400> 643 

catatccatg cagcaccacc atga 24 



<210> 644 

<211> 25 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 644 

caaaatacag agtgaacaca gggcc 25 



<210> 645 

<211> 16 

<212> DNA 

<213> Artificial 

<220> 

<223> synthetic 

<400> 645 

gcctggaacg gtgcgc 16 



<210> 646 

<211> 25 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 646 

ccgtcacgcc tcatggataa tgccc 25 



<210> 647 

<211> 33 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 647 

caggtgagaa aaggcattac agatagtgaa age 33 

<210> 648 

<211> 22 

<212> DNA 

<213> Artificial 
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<220> 

<223> Synthetic 
<400> 648 

cagaggaaag agagctgcag gg 



<210> 649 

<211> 19 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 
<400> 649 

gggcattatc catgaggcg 



<210> 650 

<211> 24 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 650 

ccgtcacgcc tccctgctga gaaa 24 



<210> 651 

<211> 20 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 
<400> 651 

cccgaggcat gcacggcgga 20 



<210> 652 

<211> 15 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 652 

ggcaggaagg cctcc 15 



• <210> 653 
<211> 18 
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<220> 

<223> Synthetic 

<400> 653 

tttctcagca gggaggcg 18 



<210> 654 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 654 

ccgtcacgcc tcgccccaca 20 



<210> 655 

<211> 28 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 655 

cagcacaggc tgttgaccat cataaaac 28 



<210> 656 

<211> 25 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 
<400> 656 

cttttccata ctttttatga cattc 25 



<210> 657 

<211> 14 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 
<400> 657 

tgtggggcga ggcg 14 
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<210> 658 

<211> 23 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 
<400> 658 

aacgaggcgc acagttgacc ttc 23 



<210> 659 

<211> 19 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 



<400> 659 

gtgatggcca gcacagggc 19 



<210> 660 

<211> 20 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 
<400> 660 

atacgttccc cacatttttc 20 



<210> 661 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 661 

tgaaggtcaa ctgtgcgc 18 



<210> 662 

<211> 26 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 
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<400> 662 

aacgaggcgc acgtcataaa tacccc 26 



<210> 663 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 663 

gccagcatag gctgttgaca c 21 



<210> 664 

<211> 28 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 664 

agacttttct atacttttta taacattc 28 



<210> 665 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 665 

ggggtattta tgacgtgcgc 20 



<210> 666 

<211> 23 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 666 

ccgtcacgcc tcctgtctgt gat 23 



<210> 667 

<211> 22 

<212> DNA 

<213> Artificial 
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<220> 

<223> Synthetic 
<400> 667 

tcctgacagt gctcaatcag ga 22 



<210> 668 

<211> 22 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 668 

tcctgacaat gctcaatgag ga 22 



<210> 669 

<211> 16 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 
<400> 669 

gtcccggatg tggccc 16 



<210> 670 

<211> 17 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 



<400> 670 
atcacagaca ggaggcg 



17 



<210> 671 

<211> 26 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 
<400> 671 

aacgaggcgc acggactgtt ttctgc 26 



<210> 672 
<211> 27 
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<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 672 

cttgtcaaag tcctgatagt gctcctc 

<210> 673 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 673 

cttgttgaag tcttgatagt gttcctc 

<210> 674 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 674 

gcagaaaaca gtccgtgcgc 

<210> 675 

<211> 23 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 675 

ccgtcacgcc tcactgcggt cat 



<210> 676 

<211> 31 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 676 

gtggataact gcatcagtgt atggcatttt c 



31 
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<210> 677 

<211> 25 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 



<400> 677 

caagggttgg tagcctgtgt gagcc 



25 



<210> 678 

<211> 17 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 



<400> 678 
atgaccgcag tgaggcg 



17 



<210> 679 

<211> 24 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 



<400> 679 

ccgtcacgcc tcagagccaa tcac 



24 



<210> 680 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 



<400> 680 

cgatcatcaa gggatggtgg cctgtgc 



27 



<210> 681 

<211> 31 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
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<400> 681 

ctgatcaatc tccttttgga ctttctctgc g 31 



<210> 682 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 682 

gtgattggct ctgaggcg 18 



<210> 683 

<211> 26 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 683 

ccgtcacgcc tcctcttcaa tttctg 26 



<210> 684 

<211> 26 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 684 

ccctgtcaat ttcttcatga agttta 26 



<210> 685 

<211> 23 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 685 

ggtatttcat gaggatcagg age 23 



<210> 686 

<211> 20 

<212> DNA 

<213> Artificial 
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<220> 

<223> Synthetic 
<400> 686 

cagaaattga agaggaggcg 20 



<210> 687 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 687 

aacgaggcgc accgggtccc a 21 



<210> 688 

<211> 30 

<212> DWA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 688 

tcccctgttt cttgaaaagt ccatgtgtga 30 



<210> 689 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 689 

aatccgtaga ggagcaccag g 21 



<210> 690 

<211> 15 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 690 

tgggacccgg tgcgc 15 



<210> 691 
<211> 21 



670 



wo 01/90337 



PCTAJSOl/17086 



<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 691 

ccgtcacgcc tcctcggcag g 



<210> 692 

<211> 29 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 692 

cacaatatcg taggtaggag gtgccttaa 



<210> 693 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 693 
gccccatcga tctcctcc 



<210> 694 

<211> 15 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 694 
cctgccgagg aggcg 



<210> 695 

<211> 23 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 695 

aacgaggcgc actaggcttt get 
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<210> 696 

<211> 31 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 



<400> 696 

ttcatgtagt cagggtcata gacaattaag a 31 



<210> 697 

<211> 23 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 697 

tccccagaac catcgaggaa agg 



<210> 698 

<:211> 17 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 698 
agcaaagcct agtgcgc 



<210> 699 

<211> 24 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 699 

aacgaggcgc acagaaggcc cctt 



<210> 700 

<211> 31 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
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<400> 700 

ccttgaacag caccagaaat agactgagca c 31 



<210> 701 

<211> 23 

<:212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 701 

ggaagaaccc agagacacca tec 23 



<210> 702 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 702 

aaggggcctt ctgtgcgc 18 



<210> 703 

<211> 25 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 703 

aacgaggcgc acgttgtgat acctt 25 



<210> 704 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 704 

gatgaaggcc ataaattaaa attgtgc 27 



<210> 705 

<211> 16 

<212> DNA 

<213> Artificial 
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<220> 

<223> Synthetic 

<400> 705 
tgggtatgga acgtcc 



<210> 706 
<211> 19 
<212> DNA 
<213> . Artificial 



<220> 

<223> Synthetic 



<400> 706 

aaggtatcac aacgtgcgc 19 



<210> 707 

<211> 14 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 
<400> 707 

cccccttttg gggg 14 



<210> 708 

<211> 30 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 
<400> 708 

ccctatcttt aaagttttta aaaagtttga 30 
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I* w v*^ L> ^ w A 


occuycayaa cauwcccyuc cywowCwwwb ugggccagag yduccgccgg 


1 flnn 


^ ^ 4* in ftfH 


ccgaggcggg ucgggcgucg gcggccccgg accatagcca gacagagccc 






cccacctc^c cgggg^cg^^ aacctgatca gggticttcca gg^iggggaag 




y cL^al.^Ca.^a 


cccagaccgc aagcuggacg cccggcgtcc ccccggaggc cgcggacccc 


1 Q o n 


^s-y otycgcc 


ygycyyccao. gacggcgaac uccyycg^cc ccuacggcau guccycccac 


o >i n 


^yycuctccc 


*yy*9c^^9c caccccccac gaggaggcgg cggcccccac agagcgccac 




U U^LrCLdcty C« U 


cccccaayyc ycgggccuyy auagaaaaga cccuggagga ggggaggaag 


^ J.OU 


CyyyyCbacy 


tggaaaccct^ ctbcggaaga aggcgctacg tgcccgacct caacgcccgg 




ycy aayaycy 


tcagggaggc cgcggagcgc atggccttca acatgcccgt ccagggcacc 


O O O A 


gccgccgacc 


tcatgaagct cgccatggtg aagctcttcc cccgcctccg ggagatgggg 


2340 


gcccgcatgc 


tcctccaggt cgccaacgag ctcctcctgg aggcccccca agcgcgggcc 


2400 


gaggaggtgg 


cggctttggc caaggaggcc atggagaagg cctatcccct cgccgtgccc 


2460 


ctggaggtgg 


aggtggggat gggggaggac tggchttccg ccaagggtca ccaccaccac 


2520 


caccac 




2526 



<210> 376 

<2H> 842 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 376 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 - 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 
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Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu lie Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Lys Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Qlu Leu Ser Arg Val Arg Thr Asp Leu 

245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 

355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 
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Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr' 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 
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Pro Lys Val Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly His His His His His His 
835 840 



<210> 


377 


<211> 


36 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


377 



gagaggctcc atcggaagaa gcttaagcgc ctcgag 36 



<210> 378 

<211> 18 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic 
<400> 378 

ccgatggagc ctctccga 18 



<210> 379 

<211> 2526 

<212> DNA 

<213> Artificial 
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<220> 

<223> Synthetic 
<400> 379 

atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 60 

ggccaccacc tggcctaccg caccttcttc gccctgaagg gcctcaccac gagccggggc 120 

gaaccggtgc aggcggtcta cggcttcgcc aagagcctcc tcaaggccct gaaggaggac 180 

gggtacaagg ccgtcttcgt ggtctttgac gccaaggccc cctccttccg ccacgaggcc 240 

tacgaggcct acaaggcggg gagggccccg acccccgagg acttcccccg gcagctcgcc 300 

ctcatcaagg agctggtgga cctcctgggg tttacccgcc tcgaggtccc cggctacgag 360 

gcggacgacg ttctcgccac cctggccaag aaggcggaaa aggaggggta cgaggtgcgc 420 

atcctcaccg ccgaccgcga cctctaccaa ctcgtctccg accgcgtcgc cgtcctccac 480 

cccgagggcc acctcatcac cccggagtgg ctttgggaga agtacggcct caggccggag 540 

cagtgggtgg acttccgcgc cctcgtgggg gacccctccg acaacctccc cggggtcaag 600 

ggcatcgggg agaagaccgc cctcaagctc ctcaaggagt ggggaagcct ggaaaacctc 660 

ctcaagaacc tggaccgggt aaagccagaa aacgtccggg agaagatcaa ggcccacctg 720 

gaagacctca ggctctcctt ggagctctcc cgggtgcgca ccgacctccc cctggaggtg 780 

gacctcgccc aggggcggga gcccgaccgg gaggggctta gggccttcct ggagaggctg 840 

gagttcggca gcctcctcca cgagttcggc ctcctggagg cccccgcccc cctggaggag 900 

gccccctggc ccccgccgga aggggccttc gtgggcttcg tcctctcccg ccccgagccc 960 

atgtgggcgg agcttaaagc cctggccgcc tgcaggggcg gccgcgtgca ccgggcagca 1020 

gaccccttgg cggggctaaa ggacctcaag gaggtccggg gcctcctcgc caaggacctc 1080 

gccgtcttgg cctcgaggga ggggctagac ctcgtgcccg gggacgaccc catgctcctc 1140 

gcctacctcc tggacccttc gaacaccacc cccgaggggg tggcgcggcg ctacgggggg 1200 

gagtggacgg aggacgccgc ccaccgggcc ctcctctcgg agaggctcca tcggaagaag 1260 

cttaagcgcc tcgaggggga ggagaagctc ctttggctct accacgaggt ggaaaagccc 1320 

ctctcccggg tcctggccca tatggaggcc accggggtac ggcgggacgt ggcctacctt 1380 

caggcccttt ccctggagct tgcggaggag atccgccgcc tcgaggagga ggtcttccgc 1440 

ttggcgggcc accccttcaa cctcaactcc cgggaccagc tggaaagggt gctctttgac 1500 

gagcttaggc ttcccgcctt gaagaagacg aagaagacag gcaagcgctc caccagcgcc 1560 

gcggtgctgg aggccctacg ggaggcccac cccatcgtgg agaagatcct ccagcaccgg 1620 
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gagctcacca 


agctcaagaa cacctacgtg gaccccctcc caagcctcgt ccacccgagg 


1680 


acgggccgcc 


tccacacccg cttcaaccag acggccacgg ccacggggag gcttagtagc 


1740 


tccgacccca 


acctgcagaa catccccgtc cgcaccccct tgggccagag gatccgccgg 


1800 


gccttcgtgg 


ccgaggcggg ttgggcgttg gtggccctgg actatagcca gatagagctc 


1860 


cgcgtcctcg 


cccacctctc cggggacgaa aacctgatca gggtcttcca ggaggggaag 


1920 


gacatccaca 


cccagaccgc aagctggatg ttcggcgtcc ccccggaggc cgtggacccc 


1980 


ctgatgcgcc 


gggcggccaa gacggtgaac ttcggcgtcc tctacggcat gtccgcccat 


2040 


aggctctccc 


aggagcttgc catcccctac gaggaggcgg tggcctttat agagcgctac 


2100 


ttccaaagct 


tccccaaggt gcgggcctgg atagaaaaga ccctggagga ggggaggaag 


2160 


cggggctacg 


tggaaaccct cttcggaaga aggcgctacg tgcccgacct caacgcccgg 


2220 


gtgaagagcg 


tcagggaggc cgcggagcgc atggccttca acatgcccgt ccagggcacc 


2280 


gccgccgacc 


tcatgaagct cgccatggtg aagctcttcc cccgcctccg ggagatgggg 


2340 


gcccgcatgc 


tcctccaggt cgccaacgag ctcctcctgg aggcccccca agcgcgggcc 


2400 


gaggaggtgg 


cggctttggc caaggaggcc atggagaagg cctatcccct cgccgtgccc 


2460 


ctggaggtgg 


aggtggggat gggggaggac tggctttccg ccaagggtca ccaccaccac 


2520 


caccac 




2526 



<210> 380 

<211> 842 

.<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 380 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
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65 



70 



75 



80 



Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu lie Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 

180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys lie Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 

290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 



Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
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370 



375 



380 



Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Lys Lys Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 

515 520 525 

Ala His Pro He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val* His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 



Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
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675 680 685 

Pro Tyx Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro hys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 

740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
.805 ' 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly His His His His His His 
835 840 

<210> 381 

<211> 33 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 381 

gtgcgcaccg acctcctcct ggaggtggac etc 33 



<210> 


382 


<211> 


33 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


382 



gaggtccacc tccaggagga ggtcggtgcg cac 33 



<210> 383 
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<211> 2526 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 383 



atgaattccg 


aggcgatgct 


tccgctcttt gaacccaaag gccgggtcct cctggtggac 


60 


ggccaccacc 


tggcctaccg 


caccttcttc gccctgaagg gcctcaccac gagccggggc 


120 


gaaccggtgc 


aggcggtcta 


cggcttcgcc aagagcctcc tcaaggccct gaaggaggac 


180 


gggtacaagg 


ccgtcttcgt 


ggtctttgac gccaaggccc cctccttccg ccacgaggcc 


240 


tacgaggcct 


acaaggcggg 


gagggccGcg acccccgagg acttcccccg gcagctcgcc 


300 


ctcatcaagg 


agctggtgga 


cctcctgggg tttacccgcc tcgaggtccc cggctacgag 


360 


gcggacgacg 


ttctcgccac 


cctggccaag aaggcggaaa aggaggggta cgaggtgcgc 


420 


atcctcaccg 


ccgaccgcga 


cctctaccaa ctcgtctccg accgcgtcgc cgtcctccac 


480 


cccgagggcc 


acctcatcac 


cccggagtgg ctttgggaga agtacggcct caggccggag 


540 


cagtgggtgg 


acttccgcgc 


cctcgtgggg gacccctccg acaacctccc cggggtcaag 


600 


ggcatcgggg 


agaagaccgc 


cctcaa4ctc ctcaaggagt ggggaagcct ggaaaacctc 


660 


ctcaagaacc 


tggaccgggt 


aaagccagaa aacgtccggg agaagatcaa ggcccacctg 


720 


gaagacctca 


ggctctcctt 


ggagctctcc cgggtgcgca ccgacctcct cctggaggtg 


780 


gacctcgccc 


aqqqqCQQqa 


qcccqaccqq qaqqqqctta qqgccttCCt qqaqaqqctg 


840 


aaattcQQca 


gcctcctcca 


cgagttcggc ctcctggagg cccccgcccc cctggaggag 


900 


gccccctggc 


ccccciccQqa 


aggggccttc gtgggcttcg tcctctcccg ccccgagccc 


960 


atgtgggcgg 


agcttaaagc 


cctggccgcc tgcaggggcg gccgcgtgca ccgggcagca 


1020 


gaccccttgg 


cggggctaaa 


ggacctcaag gaggtccggg gcctcctcgc caaggacctc 


1080 


gccgtcttgg 


cctcgaggga 


ggggctagac ctcgtgcccg gggacgaccc catgctcctc 


1140 


gcctacctcc 


tggacccttc 


gaacaccacc cccgaggggg tggcgcggcg ctacgggggg 


1200 


gagtggacgg 


aggacgccgc 


ccaccgggcc ctcctctcgg agaggctcca tcggaacctc 


1260 


cttaagcgcc 


tcgaggggga 


ggagaagctc ctttggctct accacgaggt ggaaaagccc 


1320 


ctctcccggg 


tcctggccca 


tatggaggcc accggggtac ggcgggacgt ggcctacctt 


1380 


caggcccttt 


ccctggagct 


tgcggaggag atccgccgcc tcgaggagga ggtcttccgc 


1440 


ttggcgggcc 


accccttcaa 


cctcaactcc cgggaccagc tggaaagggt gctctttgac 


1500 
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gagcttaggc ttcccgcctt gaagaagacg aagaagacag gcaagcgctc caccagcgcc 1560 

gcggtgctgg aggccctacg ggaggcccac cccatcgtgg agaagatcct ccagcaccgg 1620 

gagctcacca agctcaagaa cacctacgtg gaccccctcc caagcctcgt ccacccgagg 1680 

acgggccgcc tccacacccg cttcaaccag acggccacgg ccacggggag gcttagtagc 1740 

tccgacccca acctgcagaa catccccgtc cgcaccccct tgggccagag gatccgccgg 1800 

gccttcgtgg ccgaggcggg ttgggcgttg gtggccctgg actatagcca gatagagctc 1860 

cgcgtcctcg cccacctctc cggggacgaa aacctgatca gggtcttcca ggaggggaag 1920 

gacatccaca cccagaccgc aagctggatg ttcggcgtcc ccccggaggc cgtggacccc 1980 

ctgatgcgcc gggcggccaa gacggtgaac ttcggcgtcc tctacggcat gtccgcccat 2040 

aggctctccc aggagcttgc catcccctac gaggaggcgg tggcctttat agagcgctac 2100 

ttccaaagct tccccaaggt gcgggcctgg atagaaaaga ccctggagga ggggaggaag 2160 

cggggctacg tggaaaccct cttcggaaga aggcgctacg tgcccgacct caacgcccgg 2220 

gtgaagagcg tcagggaggc cgcggagcgc atggccttca acatgcccgt ccagggcacc 2280 

gccgccgacc tcatgaagct cgccatggtg aagctcttcc cccgcctccg ggagatgggg 2340 

gcccgcatgc tcctccaggt cgccaacgag ctcctcctgg aggcccccca agcgcgggcc 2400 

gaggaggtgg cggctttggc caaggaggcc atggagaagg cctatcccct cgccgtgccc 2460 

ctggaggtgg aggtggggat gggggaggac tggctttccg ccaagggtca ccaccaccac 2520 

caccac 2526 



<210> 384 

<211> 842 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<4'00> 384 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 

15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 
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Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys ;Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Glzi Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu lie Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asa Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys lie Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 . 250 255 

Leu Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 

275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 



His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 
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Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 



Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 
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Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly T^g Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
80S 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly His His His His His His 
835 840 



<210> 


385 


<211> 


33 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


385 


ggggccttcg tgggctacgt 


<2ia> 


386 


<211> 


33 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


386 



33 
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ggggcgggag aggacgtagc 


ccacgaaggc 


ccc 




33 


<210> 387 
<211> 2526 
<212> DNA 
<213> Artificial 










<220> 

<223> Synthetic 










<400> 387 
atgaattccg 


^99C9atgct 


tccgctcttt 


gaacccaaag gccgggtcct 


cctggtggac 


60 


ggccaccacc 


tggcctaccg 


caccttcttc 


gccctgaagg gcctcaccac 


gagccggggc 


120 


gaaccggtgc 


aggcggtcta 


cggcttcgcc 


aagagcctcc tcaaggccct 


gaaggaggac 


180 


gggtacaagg 


ccgtcttcgt 


ggtctttgac 


gccaaggccc cctccttccg 


ccacgaggcc 


240 


tacgaggcct 


acaaggcggg 


gagggccccg 


acccccgagg acttcccccg 


gcagctcgcc 


300 


ctcatcaagg 


agctggtgga 


cctcctgggg 


tttacccgcc tcgaggtccc 


cggctacgag 


360 


gcggacgacg 


ttctcgccac 


cctggccaag 


aaggcggaaa aggaggggta 


cgaggtgcgc 


420 


atcctcaccg 


ccgaccgcga 


cctctaccaa 


ctcgtctccg accgcgtcgc 


cgtcctccac 


480 


cccgagggcc 


acctcatcac 


cccggagtgg 


ctttgggaga agtacggcct 


caggccggag 


540 


cagtgggtgg 


acttccgcgc 


cctcgtgggg 


gacccctccg acaacctccc 


cggggtcaag 


600 


ggcatcgggg 


agaagaccgc 


cctcaagctc 


ctcaaggagt ggggaagcct 


ggaaaacctc 


660 


ctcaagaacc 


tggaccgggt 


aaagccagaa 


aacgtccggg agaagatcaa 


ggcccacctg 


720 


gaagacctca 


ggctctcctt 


ggagctctcc 


cgggtgcgca ccgacctccc 


cctggaggtg 


780 


gacctcgccc 


aggggcggga 


gcccgaccgg 


gaggggctta gggccttcct 


ggagaggctg 


840 


gagttcggca 


gcctcctcca 


cgagttcggc 


ctcctggagg cccccgcccc 


cctggaggag 


900 


gccccctggc 


ccccgccgga 


aggggccttc 


gtgggctacg tcctctcccg 


ccccgagccc 


960 


atgtgggcgg 


agcttaaagc 


cctggccgcc 


tgcaggggcg gccgcgtgca 


ccgggcagca 


1020 


gaccccttgg 


cggggctaaa 


ggacctcaag 


gaggtccggg gcctcctcgc 


caaggacctc 


1080 


gccgtcttgg 


cctcgaggga 


ggggctagac 


ctcgtgcccg gggacgaccc 


catgctcctc 


1140 


gcctacctcc 


tggacccttc 


gaacaccacc 


cccgaggggg tggcgcggcg 


ctacgggggg 


1200 


gagtggacgg 


aggacgccgc 


ccaccgggcc 


ctcctctcgg agaggctcca 


tcggaacctc 


1260 


cttaagcgcc 


tcgaggggga 


ggagaagctc 


ctttggctct accacgaggt 


ggaaaagccc 


1320 


ctctcccggg 


tcctggccca 


tatggaggcc 


accggggtac ggcgggacgt 


ggcctacctt 


1380 
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caggcccttt ccctggagct tgcggaggag atccgccgcc tcgaggagga ggtcttccgc 1440 

ttggcgggcc accccttcaa cctcaactcc cgggaccagc tggaaagggt gctctttgac 1500 

gagcttaggc ttcccgcctt gaagaagacg aagaagacag gcaagcgctc caccagcgcc 1560 

gcggtgctgg aggccctacg ggaggcccac cccatcgtgg agaagatcct ccagcaccgg 1620 

gagctcacca agctcaagaa cacctacgtg gaccccctcc caagcctcgt ccacccgagg 1680 

acgggccgcc tccacacccg cttcaaccag acggccacgg ccacggggag gcttagtagc 1740 

tccgacccca acctgcagaa catccccgtc cgcaccccct tgggccagag gatccgccgg 1800 

gccttcgtgg ccgaggcggg ttgggcgttg gtggccctgg actatagcca gatagagctc 1860 

cgcgtcGtcg cccacctctc cggggacgaa aacctgatca gggtcttcca ggaggggaag 1920 

gacatccaca cccagaccgc aagctggatg ttcggcgtcc ccccggaggc cgtggacccc 1980 

ctgatgcgcc gggcggccaa gacggtgaac ttcggcgtcc tctacggcat gtccgcccat 2040 

aggctctccc aggagcttgc catcccctac gaggaggcgg tggcctttat agagcgctac 2100 

ttccaaagct tccccaaggt gcgggcctgg atagaaaaga ccctggagga ggggaggaag 2160 

cggggctacg tggaaaccct cttcggaaga aggcgctacg tgcccgacct caacgcccgg 2220 

gtgaagagcg tcagggaggc cgcggagcgc atggccttca acatgcccgt ccagggcacc 2280 

gccgccgacc tcatgaagct cgccatggtg aagctcttcc cccgcctccg ggagatgggg 2340 

gcccgcatgc tcctccaggt cgccaacgag ctcctcctgg aggcccccca agcgcgggcc 2400 

gaggaggtgg cggctttggc caaggaggcc atggagaagg cctatcccct cgccgtgccc 2460 

ctggaggtgg aggtggggat gggggaggac tggctttccg ccaagggtca ccaccaccac 2520 

caccac 2526 



<210> 388 

<211> 842 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 388 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 
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Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala hya Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu lie Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Tyr Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 



Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 
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His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Qlu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro lie Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
5-95 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 



His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 
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Asp He His Thr Gin Thr Ala Ser Trp Met Phe Qly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu hrg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly His His His His His His 
835 840 



<210> 


389 


<211> 


42 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


389 



gaaaacctcc tcaagcacct ggaccaggta aagccagaaa ac 42 



<210> 390 

<211> 42 

<212> DNA 

<213> Artificial 
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<220> 

<223> Synthetic 



<400> 390 

gttttctggc 


tttacctggt 


ccaggtgctt 


gaggaggttt 


tc 




42 


<210> 391 
<211> 2526 
<212> DNA 
<213> Artificial 












<220> 

<223> Synthetic 












<400> 391 
atgaattccg 


aggcgatgct 


tccgctcttt 


gaacccaaag 


gccgggtcct 


cctggtggac 


60 


ggccaccacc 


tggcctaccg 


caccttcttc 


gccctgaagg 


gcctcaccac 


gagccggggc 


120 


gaaccggtgc 


aggcggtcta 


cggcttcgcc 


aagagcctcc 


tcaaggccct 


gaaggaggac 


180 


gggtacaagg 


ccgtcttcgt 


ggtctttgac 


gccaaggccc 


cctTccttccg 


ccacgaggcc 


240 


tacgaggcct 


acaaggcggg 


gagggccccg 


acccccgagg 


acttcccccg 


gcagctcgcc 


300 


ctcatcaagg 


agctggtgga 


cctcctgggg 


tttacccgcc 


tcgaggtccc 


cggctacgag 


360 


gcggacgacg 


ttctcgccac 


cctggccaag 


aaggcggaaa 


aggaggggta 


cgaggtgcgc 


420 


atcctcaccg 


ccgaccgcga 


cctctaccaa 


ctcgtctccg 


accgcgtcgc 


cgtcctccac 


480 


cccgagggcc 


acctcatcac 


cccggagtgg 


ctttgggaga 


agtacggcct 


caggccggag . 


540 


cagtgggtgg 


acttccgcgc 


cctcgtgggg 


gacccctccg 


acaacctccc 


cggggtcaag 


600 


ggcatcgggg 


agaagaccgc 


cctcaagctc 


ctcaaggagt 


ggggaagcct 


ggaaaacctc 


660 


ctcaagcacc 


tggaccaggt 


aaagccagaa 


aacgtccggg 


agaagatcaa 


ggcccacctg 


720 


gaagacctca 


ggctctcctt 


ggagctctcc 


cgggtgcgca 


ccgacctccc 


cctggaggtg 


780 


gacctcgccc 


aggggcggga 


gcccgaccgg 


gaggggctta 


gggccttcct 


ggagaggctg 


840 


gagttcggca 


gcctcctcca 


cgagttcggc 


ctcctggagg 


cccccgcccc 


cctggaggag 


900 


gccccctggc 


ccccgccgga 


aggggccttc 


gtgggcttcg 


tcctctcccg 


ccccgagccc 


960 


atgtgggcgg 


agcttaaagc 


cctggccgcc 


tgcaggggcg 


gccgcgtgca 


ccgggcagca 


1020 


gaccccttgg 


cggggctaaa 


ggacctcaag 


gaggtccggg 


gcctcctcgc 


caaggacctc 


1080 


gccgtcttgg 


cctcgaggga 


ggggctagac 


ctcgtgcccg 


gggacgaccc 


catgctcctc 


1140 


gcctacctcc 


tggacGCttc 


gaacaccacc 


cccgaggggg 


tggcgcggcg 


ctacgggggg 


1200 


gagtggacgg 


aggacgccgc 


ccaccgggcc 


ctcctctcgg 


agaggctcca 


tcggaacctc 


1260 
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<210> 392 

<211> 842 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 392 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 



cttaagcgcc tcgaggggga ggagaagctc ctttggctct accacgaggt ggaaaagccc 1320 

ctctcccggg tcctggccca tatggaggcc accggggtac ggcgggacgt ggcctacctt 1380 

caggcccttt ccctggagct tgcggaggag atccgccgcc tcgaggagga ggtcttccgc 1440 

ttggcgggcc accccttcaa cctcaactcc cgggaccagc tggaaagggt gctctttgac 1500 

gagcttaggc ttcccgcctt gaagaagacg aagaagacag gcaagcgctc caccagcgcc 1560 

gcggtgctgg aggccctacg ggaggcccac cccatcgtgg agaagatcct ccagcaccgg 1620 

gagctcacca agctcaagaa cacctacgtg gaccccctcc caagcctcgt ccacccgagg 1680 

acgggccgcc tccacacccg cttcaaccag acggccacgg ccacggggag gcttagtagc 1740 

tccgacccca acctgcagaa catccccgtc cgcaccccct tgggccagag gatccgccgg 1800 

gccttcgtgg ccgaggcggg ttgggcgttg gtggccctgg actatagcca gatagagctc 1860 

cgcgtcctcg cccacctctc cggggacgaa aacctgatca gggtcttcca ggaggggaag 1920 

gacatccaca cccagaccgc aagctggatg ttcggcgtcc ccccggaggc cgtggacccc 1980 

ctgatgcgcc gggcggccaa gacggtgaac ttcggcgtcc tctacggcat gtccgcccat 2040 

aggctctccc aggagcttgc catcccctac gaggaggcgg tggcctttat agagcgctac 2100 

ttccaaagct tccccaaggt gcgggcctgg atagaaaaga ccctggagga ggggaggaag 2160 

cggggctacg tggaaaccct cttcggaaga aggcgctacg tgcccgacct caacgcccgg 2220 

gtgaagagcg tcagggaggc cgcggagcgc atggccttca acatgcccgt ccagggcacc 2280 

gccgccgacc tcatgaagct cgccatggtg aagctcttcc cccgcctccg ggagatgggg 2340 

gcccgcatgc tcctccaggt cgccaacgag ctcctcctgg aggcccccca agcgcgggcc 2400 

gaggaggtgg cggctttggc caaggaggcc atggagaagg cctatcccct cgccgtgccc 2460 

ctggaggtgg aggtggggat gggggaggac tggctttccg ccaagggtca ccaccaccac 2520 
caccac 



2526 



695 



wo 01/90337 



PCT/USOl/17086 



10 



15 



Leu Leu Val Asp Qly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu lie Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys His Leu 
210 215 220 

Asp Gin Val Lys Pro Glu Asn Val Arg Glu Lys lie Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265' 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 



Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
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305 310 315 320 

Met Trp Ala Qlu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lye Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
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610 



615 



620 



His Leu Ser Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Qly Lys 
625 630 635 640 

Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly His His His His His His 



<210> 393 

<211> 33 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 393 

gagctctccc gggtgcacac cgacctcccc ctg 33 

<210> 394 



835 



840 
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<211> 33 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 394 

cagggggagg tcggtgtgca cccgggagag etc 



33 



<210> 395 
<211> 2526 
<212> DNA 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 395 

atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 60 

ggccaccacc tggcctaccg caccttcttc gccctgaagg gcctcaccac gagccggggc 120 

gaaccggtgc aggcggtcta cggcttcgcc aagagcctcc tcaaggccct gaaggaggac 180 

gggtacaagg ccgtcttcgt ggtctttgac gccaaggccc cctccttccg ccacgaggcc 240 

tacgaggcct acaaggcggg gagggccccg acccccgagg acttcccccg gcagctcgcc 300 

ctcatcaagg agctggtgga cctcctgggg tttacccgcc tcgaggtccc cggctacgag 360 

gcggacgacg ttctcgccac cctggccaag aaggcggaaa aggaggggta cgaggtgcgc 420 

atcctcaccg ccgaccgcga cctctaccaa ctcgtctccg accgcgtcgc cgtcctccac 480 

cccgagggcc acctcatcac cccggagtgg ctttgggaga agtacggcct caggccggag 540 

cagtgggtgg acttccgcgc cctcgtgggg gacccctccg acaacctccc cggggtcaag 600 

ggcatcgggg agaagaccgc cctcaagctc ctcaaggagt ggggaagcct ggaaaacctc 660 

ctcaagaacc tggaccgggt aaagccagaa aacgtccggg agaagatcaa ggcccacctg 720 

gaagacctca ggctctcctt ggagctctcc cgggtgcaca ccgacctccc cctggaggtg 780 

gacctcgccc aggggcggga gcccgaccgg gaggggctta gggccttcct ggagaggctg 840 

gagttcggca gcctcctcca cgagttcggc ctcctggagg cccccgcccc cctggaggag 900 

gccccctggc ccccgccgga aggggccttc gtgggcttcg tcctctcccg ccccgagccc 960 

atgtgggcgg agcttaaagc cctggccgcc tgcaggggcg gccgcgtgca ccgggcagca 1020 

gaccccttgg cggggctaaa ggacctcaag gaggtccggg gcctcctcgc caaggacctc 1080 

gccgtcttgg cctcgaggga ggggctagac ctcgtgcccg gggacgaccc catgctcctc 1140 
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gcctacctcc 


tggacccttc 


gaacaccacc cccgaggggg tggcgcggcg ctacgggggg 


1200 


gagtggacgg 


aggacgccgc 


cdaccgggcc ctcctctcgg agaggctcca tcggaacctc 


1260 


cttaagcgcc 


tcgaggggga 


ggagaagctc ctttggctct accacgaggt ggaaaagccc 


1320 


ctctcccggg 


tcctggccca 


tatggaggcc accggggtac ggcgggacgt ggcctacctt 


1380 


caggcccttt 


ccctggagct 


tgcggaggag atccgccgcc tcgaggagga ggtcttccgc 


1440 


ttggcgggcc 


accccttcaa 


cctcaactcc cgggaccagc tggaaagggt gctctttgac 


1500 


gagcttaggc 


ttcccgcctt 


gaagaagacg aagaagacag gcaagcgctc caccagcgcc 


1560 


gcggtgctgg 


aggccctacg 


ggaggcccac cccatcgtgg agaagatoct ccagcaccgg 


1620 


gagctcacca 


agctcaagaa 


cacctacgtg gaccccctcc caagcctcgt ccacccgagg 


1680 


acgggccgcc 


tccacacccg 


cttcaaccag acggccacgg ccacggggag gcttagtagc 


1740 


tccgacccca 


acctgcagaa 


catccccgtc cgcaccccct tgggccagag gatccgccgg 


1800 


gccttcgtgg 


ccgaggcggg 


ttgggcgttg gtggccctgg actatagcca gatagagctc 


1860 


cgcgtcctcg 


cccacctctc 


cggggacgaa aacctgatca gggtcttcca ggaggggaag 


1920 


gacatccaca 


cccagaccgc 


aagctggatg ttcggcgtcc ccccggaggc cgtggacccc 


1980 


ctgatgcgcc 


9ggc99ccaa 


gacggtgaac ttcggcgtcc tctacggcat gtccgcccat 


2040 


aggctctccc 


aggagcttgc 


catcccctac gaggaggcgg tggcctttat agagcgctac 


2100 


ttccaaagct 


tccccaaggt 


gcgggcctgg atagaaaaga ccctggagga ggggaggaag 


2160 


cggggctacg 


tggaaaccct 


cttcggaaga aggcgctacg tgcccgacct caacgcccgg 


2220 


gtgaagagcg 


tcagggaggc 


cgcggagcgc atggccttca acatgcccgt ccagggcacc 


2280 


gccgccgacc 


tcatgaagct 


cgccatggtg aagctcttcc cccgcctccg ggagatgggg 


2340 


gcccgcatgc 


tcctccaggt 


cgccaacgag ctcctcctgg aggcccccca agcgcgggcc 


2400 


gaggaggtgg 


cggctttggc 


caaggaggcc atggagaagg cctatcccct cgccgtgccc 


2460 


ctggaggtgg 


aggtggggat 


ggsagaggac tggctttccg ccaagggtca ccaccaccac 


2520 


caccac 






2526 



<210> 396 

<211> 842 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
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<400> 396 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
ISO 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val His Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 
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Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 

290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys 
500 . 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 

515 520 525 

Ala His Pro lie Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 



Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
580 585 590 
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Pro Leu Gly Gin Arg lie Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala I.eu Val Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp lie His Tbr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

ser Ala Lys Gly His His His His His His 

840 





835 


<210> 


397 


<211> 


33 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


397 
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gagctctccc gggtgcacac cgacctcctc ctg 



<210> 398 

<211> 33 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 398 

caggaggagg tcggtgtgca cccgggagag etc 



33 



33 



<210> 399 
<211> 2526 
<212> DNA 
<213> Artificial 

<220> 

<223> Synthetic 

<400> 399 

atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 60 

ggccaccacc tggcctaccg caccttcttc gccctgaagg gcctcaccac gagccggggc 120 

gaaccggtgc aggcggtcta cggcttcgcc aagagcctcc tcaaggccct gaaggaggac 180 

gggtacaagg ccgtcttcgt ggtctttgac gccaaggccc cctccttccg ccacgaggcc 240 

tacgaggcct acaaggcggg gagggccccg acccccgagg acttcccccg gcagctcgcc 300 

ctcatcaagg agctggtgga cctcctgggg tttacccgcc tcgaggtccc cggctacgag 360 

gcggacgacg ttctcgccac cctggccaag aaggcggaaa aggaggggta cgaggtgcgc 420 

atcctcaccg ccgaccgcga cctctaccaa ctcgtctccg accgcgtcgc cgtcctccac 4 80 

cccgagggcc acctcatcac cccggagtgg ctttgggaga agtacggcct caggccggag 540 

cagtgggtgg acttccgcgc cctcgtgggg gacccctccg acaacctccc cggggtcaag 600 

ggcatcgggg agaagaccgc cctcaagctc ctcaaggagt ggggaagcct ggaaaacctc 660 

ctcaagaacc tggaccgggt aaagccagaa aacgtccggg agaagatcaa ggcccacctg 720 

gaagacctca ggctctcctt ggagctctcc cgggtgcaca ccgacctcct cctggaggtg 780 

gacctcgccc aggggcggga gcccgaccgg gaggggctta gggccttcct ggagaggctg 840 

gagttcggca gcctcctcca cgagttcggc ctcctggagg cccccgcccc cctggaggag 900 

gccccctggc ccccgccgga aggggccttc gtgggcttcg tcctctcccg ccccgagccc 960 

atgtgggcgg agcttaaagc cctggccgcc tgcaggggcg gccgcgtgca ccgggcagca 1020 
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gaccccttgg cggggctaaa ggacctcaag gaggtccggg gcctcctcgc caaggacctc 1080 

gccgtcttgg cctcgaggga ggggctagac ctcgtgcccg gggacgaccc catgctcctc 1140 

gcctacctcc tggacccttc gaacaccacc cccgaggggg tggcgcggcg ctacgggggg 1200 

gagtggacgg aggacgccgc ccaccgggcc ctcctctcgg agaggctcca tcggaacctc 1260 

cttaagcgcc tcgaggggga ggagaagctc ctttggctct accacgaggt ggaaaagccc 1320 

ctctcccggg tcctggccca tatggaggcc accggggtac ggcgggacgt ggcctacctt 1380 

caggcccttt ccctggagct tgcggaggag atccgccgcc tcgaggagga ggtcttccgc 1440 

ttggcgggcc accccttcaa cctcaactcc cgggaccagc tggaaagggt gctctttgac 1500 

gagcttaggc ttcccgcctt gaagaagacg aagaagacag gcaagcgctc caccagcgcc 1560 

gcggtgctgg aggccctacg ggaggcccac cccatcgtgg agaagatcct ccagcaccgg 1620 

gagctcacca agctcaagaa cacctacgtg gaccccctcc caagcctcgt ccacccgagg 1680 

acgggccgcc tccacacccg cttcaaccag acggccacgg ccacggggag gcttagtagc 1740 

tccgacccca acctgcagaa catccccgtc cgcaccccct tgggccagag gatccgccgg 1800 

gccttcgtgg ccgaggcggg ttgggcgttg gtggccctgg actatagcca gatagagctc 1860 

cgcgtcctcg cccacctctc cggggacgaa aacctgatca gggtcttcca ggaggggaag 1920 

gacatccaca cccagaccgc aagctggatg ttcggcgtcc ccccggaggc cgtggacccc 1980 

ctgatgcgcc gggcggccaa gacggtgaac ttcggcgtcc tctacggcat gtccgcccat 2040 

aggctctccc aggagcttgc catcccctac gaggaggcgg tggcctttat agagcgctac 2100 

ttccaaagct tccccaaggt gcgggcctgg atagaaaaga ccctggagga ggggaggaag 2160 

cggggctacg tggaaaccct cttcggaaga aggcgctacg tgcccgacct caacgcccgg 2220 

gtgaagagcg tcagggaggc cgcggagcgc atggccttca acatgcccgt ccagggcacc 2280 

gccgccgacc tcatgaagct cgccatggtg aagctcttcc cccgcctccg ggagatgggg 2340 

gcccgcatgc tcctccaggt cgccaacgag ctcctcctgg aggcccccca agcgcgggcc 2400 

gaggaggtgg cggctttggc caaggaggcc atggagaagg cctatcccct cgccgtgccc 2460 

ctggaggtgg aggtggggat gggggaggac tggctttccg ccaagggtca ccaccaccac 2520 

2526 

caccac 



<210> 400 
<211> 842 
<212> PRT 
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<213> Artificial 
<220> 

<223> Synthetic 
<400> 400 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 

115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu 

225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val His Thr Asp Leu 
245 250 255 

Leu Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 
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Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Qly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 



Ala His Pro lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 
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Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg lie Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly His His His His His His 
835 840 

<210> 401 

<211> 2508 

<212> DNA 

<213> Artificial 
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<220> 

<223> Synthetic 
<400> 401 



atgaattccg 


aggcgatgct 


tccgctcttt 


gaacccaaag 


gccgggtcct 


cctggtggac 


60 


ggccaccacc 


tggcctaccg 


caccttcttc 


gccctgaagg 


gcctcaccac 


gagccggggc 


120 


gaaccggtgc 


aggcggtcta 


cggcttcgcc 


aagagcctcc 


tcaaggccct 


gaaggaggac 


180 


gggtacaagg 


ccgtcttcgt 


ggtctttgac 


gccaaggccc 


cctccttccg 


ccacgaggcc 


240 


tacgaggcct 


acaaggcggg 


gagggccccg 


acccccgagg 


acttcccccg 


gcagctcgcc 


300 


ctcatcaagg 


agctggtgga 


cctcctgggg 


tttacccgcc 


tcgaggtccc 


cggctacgag 


360 


gcggacgacg 


ttctcgccac 


cctggccaag 


aaggcggaaa 


aggaggggta 


cgaggtgcgc 


420 


atcctcaccg 


ccgaccgcga 


cctctaccaa 


ctcgtctccg 


accgcgtcgc 


cgtcctccac 


480 


cccgagggcc 


acctcatcac 


cccggagtgg 


ctttgggaga 


agtacggcct 


caggccggag 


540 


cagtgggtgg 


acttccgcgc 


cctcgtgggg 


gacccctccg 


acaacctccc 


cggggtcaag 


600 


ggcatcgggg 


agaagaccgc 


cctcaagctc 


ctcaaggagt 


ggggaagcct 


ggaaaacctc 


660 


ctcaagaacc 


tggaccgggt 


aaagccagaa 


aacgtccggg 


agaagatcaa 


ggcccacctg 


720 


gaagacctca 


ggctctcctt 


ggagctctcc 


cgggtgcgca 


ccgacctccc 


cctggaggtg 


780 


gacctcgccc 


aggggcggga 


gcccgaccgg 


gaggggctta 


gggccttcct 


ggagaggctg 


840 


gagttcggca 


gcctcctcca 


cgagttcggc 


ctcctggagg 


cccccgcccc 


cctggaggag 


900 


gccccctggc 


ccccgccgga 


aggggccttc 


gtgggcttcg 


tcctctcccg 


ccccgagccc 


960 


atgtgggcgg 


agcttaaagc 


cctggccgcc 


tgcaggggcg 


gccgcgtgca 


ccgggcagca 


1020 


gaccccttgg 


cggggctaaa 


ggacctcaag 


gaggtccggg 


gcctcctcgc 


caaggacctc 


1060 


gccgtcttgg 


cctcgaggga 


ggggctagac 


ctcgtgcccg 


gggacgaccc 


catgctcctc 


1140 


gcctacctcc 


tggacGcttc 


gaacaccacc 


cccgaggggg 


tggcgcggcg 


ctacgggggg 


1200 


gagtggacgg 


aggacgccgc 


ccaccgggcc 


ctcctctcgg 


agaggctcca 


tcggaacctc 


1260 


cttaagcgcc 


tcgaggggga 


ggagaagctc 


gtttggctct 


accacgaggt 


ggaaaagccc 


1320 


ctctcccggg 


tcctggccca 


tatggaggcc 


accggggtac 


ggcgggacgt 


ggcctacctt 


1360 


caggcccttt 


ccctggagct 


tgcggaggag 


atccgccgcc 


tcgaggagga 


ggtcttccgc 


1440 


ttggcgggcc 


accccttcaa 


cctcaactcc 


cgggaccagc 


tggaaagggt 


gctctttgac 


1500 


gagcttaggc 


ttcccgcctt 


gaagaagacg 


aagaagacag 


gcaagcgctc 


caccagcgcc 


1560 


gcggtgctgg 


aggccctacg 


ggaggcccac 


cccatcgtgg 


agaagatcct 


ccagcaccgg 


1620 
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gagctcacca 


agctcaagaa 


cacctacgtg 


gaccccctcc 


caagcctcgt ccacccgagg 


1680 


acgggccgcc 


tccacacccg 


cttcaaccag 


acggccacgg 


ccacggggag gcttagtagc 


1740 


tccgacccca 


acctgcagaa 


catccccgtc 


cgcaccccct 


tgggccagag gatccgccgg 


1800 


gccttcgtgg 


ccgaggcggg 


ttgggcgttg 


gtggccctgg 


actatagcca gatagagctc 


1860 


cgcgtcctcg 


cccacctctc 


cggggacgaa 


aacctgatca 


gggtcttcca ggaggggaag 


1920 


gacatccaca 


cccagaccgc 


aagctggatg 


ttcggcgtcc 


ccccggaggc cgtggacccc 


1980 


ctgatgcgcc 


gggcggccaa 


gacggtgaac 


ttcggcgtcc 


tctacggcat gtccgcccat 


2040 


aggctctccc 


aggagcttgc 


catcccctac 


gaggaggcgg 


tggcctttat agagcgctac 


2100 


ttccaaagct 


tccccaaggt 


gcgggcctgg 


atagaaaaga 


ccctggagga ggggaggaag 


2160 


cggggctacg 


tgg^aaccct 


cttcggaaga 


aggcgctacg 


tgcccgacct caacgcccgg 


2220 


gtgaagagcg 


tcagggaggc 


cgcggagcgc 


atggccttca 


acatgcccgt ccagggcacc 


2280 


gccgccgacc 


tcatgaagct 


cgccatggtg 


aagctcttcc 


cccgcctccg ggagatgggg 


2340 


gcccgcatgc 


tcctccaggt 


cgccaacgag 


ctcctcctgg 


aggcccccca agcgcgggcc 


2400 


gaggaggtgg 


cggctttggc 


caaggaggcc 


atggagaagg 


cctatcccct cgccgtgccc 


2460 


ctggaggtgg 


aggtggggat 


gggggaggac 


tggctttccg 


ccaagggt 


2508 



<210> 402 

<211> 836 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 402 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Qlu Pro Lye Gly Arg Val 

15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 
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Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 . 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu He Thr Pro Glu Tip Leu Trp jjJlu Lys Tyr Gly 
165 170 ^ 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 36S 



Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 
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Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Val Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys 
500 505 510 

Thr Gly Lys Arg Ser« Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg lie Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala 

610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 
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Pro Tyr Glu Qlu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Qlu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 

725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825. 830 

Ser Ala Lys Gly 
835 

<210> 403 

<211> 66 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<220> 

<221> modif ied_base 

<222> (19) . . (51) 

<223> The bases in these positions are a synthesis of 91% base shown an 
d 3% all other bases 



<400> 403 

ctccatcgga acctccttaa gcgcctcgag ggggaggaga agctcctttg gctctaccac 60 
gaggtg 66 



<210> 404 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 
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<223> Synthesis 
<400> 404 



aaggaggttc 


cgatggag 




18 


<210> 405 
<211> 2508 
<212> DNA 
<213> Artificial 






<220> 

<223> Synthetic 


• 




<40U> 405 

atgaattccg 


aggcgatgct tccgctcttt gaacccaaag 


gccgggtcct cctggtggac 


60 


ggccaccacc 


tggcctaccg caccttcttc gccctgaagg 


gcctcaccac gagccggggc 


120 


gaaccggtgc 


aggcggtcta cggcttcgcc aagagcctcc 


tcaaggccct gaaggaggac 


180 


gggtacaagg 


ccgtcttcgt ggtctttgac gccaaggccc 


cctccttccg ccacgaggcc 


240 


tacgaggcct 


acaaggcggg gagggccccg acccccgagg 


acttcccccg gcagctcgcc 


300 


ctcatcaagg 


agctggtgga cctcctgggg tttacccgcc 


tcgaggtccc cggctacgag 


360 


gcggacgacg 


ttctcgccac cctggccaag aaggcggaaa 


aggaggggta cgaggtgcgc 


420 


atcctcaccg 


ccgaccgcga cctctaccaa ctcgtctccg 


accgcgtcgc cgtcctccac 


480 


cccgagggcc 


acctcatcac cccggagtgg ctttgggaga 


agtacggcct caggccggag 


540 


cagtgggtgg 


acttccgcgc cctcgtgggg gacccctccg 


acaacctccc cggggtcaag 


600 


ggcatcgggg 


agaagaccgc cctcaagctc ctcaaggagt 


ggggaagcct ggaaaacctc 


660 


ctcaagaacc 


tggaccgggt aaagccagaa aacgtccggg 


agaagatcaa ggcccacctg 


720 


gaagacctca 


ggctctcctt ggagctctcc cgggtgcgca 


ccgacctccc cctggaggtg 


780 


gacctcgccc 


aggggcggga gcccgaccgg gaggggctta 


gggccttcct ggagaggctg 


840 


gagttcggca 


gcctcctcca cgagttcggc ctcctggagg 


cccccgcccc cctggaggag 


900 


gccccctggc 


ccccgccgga aggggccttc gtgggcttcg 


tcctctcccg ccccgagccc 


960 


atgtgggcgg 


agcttaaagc cctggccgcc tgcaggggcg 


gccgcgtgca ccgggcagca 


1020 


gaccccttgg 


cggggctaaa ggacctcaag gaggtccggg 


gcctcctcgc caaggacctc 


1080 


gccgtcttgg 


cctcgaggga ggggctagac ctcgtgcccg 


gggacgaccc catgctcctc 


1140 


gcctacctcc 


tggacccttc gaacaccacc cccgaggggg 


tggcgcggcg ctacgggggg 


1200 


gagtggacgg 


aggacgccgc ccaccgggcc ctcctctcgg 


agaggctcca tcggaacctc 


1260 
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cttaagcgcc tcgagggggt ggagaagctc ctttggctct accacgaggt ggaaaagccc 1320 

ctctcccggg tcctggccca tatggaggcc accggggtac ggcgggacgt ggcctacctt 1380 

caggcccttt ccctggagct tgcggaggag atccgccgcc tcgaggagga ggtcttccgc 1440 

ttggcgggcc accccttcaa cctcaactcc cgggaccagc tggaaagggt gctctttgac 1500 

gagcttaggc ttcccgcctt gaagaagacg aagaagacag gcaagcgctc caccagcgcc 1560 

gcggtgctgg aggccctacg ggaggcccac cccatcgtgg agaagatcct ccagcaccgg 1620 

gagctcacca agctcaagaa cacctacgtg gaccccctcc caagcctcgt ccacccgagg 1680 

acgggccgcc tccacacccg cttcaaccag acggccacgg ccacggggag gcttagtagc 1740 

tccgacccca acctgcagaa catccccgto cgcaccccct tgggccagag gatccgccgg 1800 

gcottcgtgg ccgaggcggg ttgggcgttg gtggccctgg actatagcca gatagagctc 1860 

cgcgtcctcg cccacctctc cggggacgaa aacctgatca gggtcttcca ggaggggaag 1920 

gacatccaoa cccagaccgc aagctggatg ttcggcgtcc ccccggaggc cgtggacccc 1980 

ctgatgcgcc gggcggccaa gacggtgaac ttcggcgtcc tctacggcat gtccgcccat 2040 

aggctctccc aggagcttgc catcccctac gaggaggcgg tggcctttat agagcgctac 2100 

ttccaaagot tccccaaggt gcgggcctgg atagaaaaga ccctggagga ggggaggaag 2160 
cggggctacg tggaaaccct cttcggaaga aggcgctacg tgcccgacct caacgcccgg 
gtgaagagcg tcagggaggc cgcggagcgc atggccttca acatgcccgt ccagggcacc 

gccgccgacc tcatgaagct cgccatggtg aagctcttcc cccgcctccg ggagatgggg 2340 

gcccgcatgc tcctccaggt cgccaacgag ctcctcctgg aggcccccca agcgcgggcc 2400 
gaggaggtgg cggctttggc caaggaggcc atggagaagg cctatcccct cgccgtgccc 2460 
ctggaggtgg aggtggggat gggggaggac tggctttccg ccaagggt 



<210> 406 

<211> 836 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 



<400> 406 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 



2220 
2280 



2508 
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20 



25 



30 



Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 



Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
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325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly hen Lys Asp Leu hys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 



His Arg Asn Leu Leu Lys Arg Leu 
420 

Leu Tyr His Glu Val Glu. Lys Pro 
435 440 

Glu Ala Thr Gly Val Arg Arg Asp 
450 455 



Glu Gly Val Glu Lys Leu Leu Trp 
425 430 

Leu Ser Arg Val Leu Ala His Met 
445 

Val Ala Tyr Leu Gin Ala Leu Ser 
460 



Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Al^ Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg lie Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala 

610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
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625 630 635 640 

Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly 
835 



<210> 


407 


<211> 


2508 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


407 



atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 60 

ggccaccacc tggcctaccg caccttcttc gccctgaagg gcctcaccac gagccggggc 120 

gaaccggtgc aggcggtcta cggcttcgcc aagagcctcc tcaaggccct gaaggaggac 180 

gggtacaagg ccgtcttcgt ggtctttgac gccaaggccc cctccttccg ccacgaggcc 240 
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tacgaggcct acaaggcggg gagggccccg acccccgagg acttcccccg gcagctcgcc 300 

ctcatcaagg agctggtgga cctcctgggg tttacccgcc tcgaggtccc cggctacgag 360 

gcggacgacg ttctcgccac cctggccaag aaggcggaaa aggaggggta cgaggtgcgc 420 

atcctcaccg ccgaccgcga cctctaccaa ctcgtctccg accgcgtcgc cgtcctccac 480 

cccgagggcc acctcatcac cccggagtgg ctttgggaga agtacggcct caggccggag 540 

cagtgggtgg acttccgcgc cctcgtgggg gacccctccg acaacctccc cggggtcaag 600 

ggcatcgggg agaagaccgc cctcaagctc ctcaaggagt ggggaagcct ggaaaacctc 660 

. ctcaagaacc tggaccgggt aaagccagaa aacgtccggg agaagatcaa ggcccacctg 720 

gaagacctca ggctctcctt ggagctctcc cgggtgcgca ccgacctccc cctggaggtg 780 

gacctcgccc aggggcggga gcccgaccgg gaggggctta gggccttcct ggagaggctg 840 

gagttcggca gcctcctcca cgagttcggc ctcctggagg cccccgcccc cctggaggag 900 

gccccctggc ccccgccgga aggggccttc gtgggcttcg tcctctcccg ccccgagccc 960 

atgtgggcgg agcttaaagc cctggccgcc tgcaggggcg gccgcgtgca ccgggcagca 1020 

gaccccttgg cggggctaaa ggacctcaag gaggtccggg gcctcctcgc caaggacctc 1080 

gccgtcttgg cctcgaggga ggggctagac ctcgtgcccg gggacgaccc catgctcctc 1140 

gcctacctcc tgga'cccttc gaacaccacc cccgaggggg tggcgcggcg ctacgggggg 1200 

gagtggacgg aggacgccgc ccaccgggcc ctcctctcgg agaggctcca tcggaacctc 1260 

cttaagcgaa acgaggggaa ggagaagctc ctttggctct accacgaggt ggaaaagccc 1320 

ctctcccggg tcctggccca tatggaggcc accggggtac ggcgggacgt ggcctacctt 1380 

caggcccttt ccctggagct tgcggaggag atccgccgcc tcgaggagga ggtcttccgc 1440 

ttggcgggcc accccttcaa cctcaactcc cgggaccagc tggaaagggt gctctttgac 1500 

gagcttaggc ttcccgcctt gaagaagacg aagaagacag gcaagcgctc caccagcgcc 1560 

gcggtgctgg aggccctacg ggaggcccac cccatcgtgg agaagatcct ccagcaccgg 1620 

gagctcacca agctcaagaa cacctacgtg gaccccctcc caagcctcgt ccacccgagg 1680 

acgggccgcc tccacacccg cttcaaccag acggccacgg ccacggggag gcttagtagc 1740 

tccgacccca acctgcagaa catccccgtc cgcaccccct tgggccagag gatccgccgg 1800 

gccttcgtgg ccgaggcggg ttgggcgttg gtggccctgg actatagcca gatagagctc 1860 

cgcgtcctcg cccacctctc cggggacgaa aacctgatca gggtcttcca ggaggggaag 1920 
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gacatccaca cccagaccgc aagctggatg ttcggcgtcc ccccggaggc cgtggacccc 1980 

ctgatgcgcc gggcggccaa gacggtgaac ttcggcgtcc tctacggcat gtccgcccat 2040 

aggctctccc aggagcttgc catcccctac gaggaggcgg tggcctttat agagcgctac 2100 

ttccaaagct tccccaaggt gcgggcctgg atagaaaaga ccctggagga ggggaggaag 2160 

cggggctacg tggaaaccct cttcggaaga aggcgctacg tgcccgacct caacgcccgg 2220 

gtgaagagcg tcagggaggc cgcggagcgc atggccttca acatgcccgt ccagggcacc 2280 

gccgccgacc tcatgaagct cgccatggtg aagctcttcc cccgcctccg ggagatgggg 2340 

gcccgcatgc tcctccaggt cgccaacgag ctcctcctgg aggcccccca agcgcgggcc 2400 

gaggaggtgg cggctttggc caaggaggcc atggagaagg cctatcccct cgccgtgccc 2460 

ctggaggtgg aggtggggat gggggaggac tggctttccg ccaagggt 2508 



<2i0> 408 

<211> 836 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 408 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 

35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala 
130 135 140 
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Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 

155 



145 



150 



Pro Glu Gly His Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu 



195 



200 



205 



Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Asn Glu Gly Lys Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 
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Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg lie Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 

595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp lie His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 



Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 
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Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly 
835 

<210> 409 

<211> 2508 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 409 

atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 
ggccaccacc tggcctaccg caccttcttc gccctgaagg gcctcaccac gagccggggc 
gaaccggtgc aggcggtcta cggcttcgcc aagagcctcc tcaaggccct gaaggaggac 
gggtacaagg ccgtcttcgt ggtctttgac gccaaggccc cctccttccg ccacgaggcc 
tacgaggcct acaaggcggg gagggccccg acccccgagg acttcccccg gcagctcgcc 
ctcatcaagg agctggtgga cctcctgggg tttacccgcc tcgaggtccc cggctacgag 
gcggacgacg ttctcgccac cctggccaag aaggcggaaa aggaggggta cgaggtgcgc 
atcctcaccg ccgaccgcga cctctaccaa ctcgtctccg accgcgtcgc cgtcctccac 
cccgagggcc acctcatcac cccggagtgg ctttgggaga agtacggcct caggccggag 
cagtgggtgg acttccgcgc cctcgtgggg gacccctccg acaacctccc cggggtcaag 
ggcatcgggg agaagaccgc cctcaagctc ctcaaggagt ggggaagcct ggaaaacctc 
ctcaagaacc tggaccgggt aaagccagaa aacgtccggg agaagatcaa ggcccacctg 
gaagacctca ggctctcctt ggagctctcc cgggtgcgca ccgacctccc cctggaggtg 
gacctcgccc aggggcggga gcccgaccgg gaggggctta gggccttcct ggagaggctg 
gagttcggca gcctcctcca cgagttcggc ctcctggagg cccccgcccc cctggaggag 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



723 



wo 01/90337 



PCTAJSOl/17086 



gccccctggc 


ccccgccgga 


aggggccttc 


gtgggcttcg 


tcctctcccg ccccgagccc 


960 


atgtgggcgg 


agcttaaagc 


cctggccgcc 


tgcaggggcg 


gccgcgtgca ccgggcagca 


1020 


gaccccttgg 


cggggctaaa 


ggacctcaag 


gaggtccggg 


gcctcctcgc caaggacctc 


1080 


gccgtcttgg 


cctcgaggga 


ggggctagac 


ctcgtgcccg 


gggacgaccc catgctcctc 


1140 


gcctacctcc 


tggacccttc 


gaacaccacc 


cccgaggggg 


tggcgcggcg ctacgggggg 


1200 


gagtggacgg 


aggacgccgc 


ccaccgggcc 


ctcctctcgg 


agaggctcca tcggaacctc 


1260 


cttaagcgct 


tcgaggggga 


ggagaagctc 


ctttgcctct 


accacgaggt ggaaaagccc 


1320 


ctctcccggg 


tcctggccca 


tatggaggcc 


accggggtac 


ggcgggacgt ggcctacctt 


1380 


caggcccttt 


ccctggagct 


tgcggaggag 


atccgccgcc 


tcgaggagga ggtcttccgc 


1440 


ttggcgggcc 


accccttcaa 


cctcaactcc 


cgggaccagc 


tggaaagggt gctctttgac 


1500 


gagcttaggc 


ttcccgcctt 


gaagaagacg 


aagaagacag 


gcaagcgctc caccagcgcc 


1560 


gcggtgctgg 


aggccctacg 


ggaggcccac 


cccatcgtgg 


agaagatcct ccagcaccgg 


1620 


gagctcacca 


agctcaagaa 


cacctacgtg 


gaccccctcc 


caagcctcgt ccacccgagg 


1680 


acgggccgcc 


tccacacccg 


cttcaaccag 


acggccacgg 


ccacggggag gcttagtagc 


1740 


tccgacccca 


acctgcagaa 


catccccgtc 


cgcaccccct 


tgggccagag gatccgccgg 


1800 


gccttcgtgg 


ccgaggcggg 


ttgggcgttg 


gtggccctgg 


actatagcca gatagagctc 


1860 


cgcgtcctcg 


cccacctctc 


cggggacgaa 


aacctgatca 


gggtcttcca ggaggggaag 


1920 


gacatccaca 


cccagaccgc 


aagctggatg 


ttcggcgtcc 


ccccggaggc cgtggacccc 


1980 


ctgatgcgcc 


gggcggccaa 


gacggtgaac 


ttcggcgtcc 


tctacggcat gtccgcccat 


2040 


aggctctccc 


aggagcttgc 


catcccctac 


gaggaggcgg 


tggcctttat agagcgctac 


2100 


ttccaaagct 


tccccaaggt 


gcgggcctgg 


atagaaaaga 


ccctggagga ggggaggaag 


2160 


cggggctacg 


tggaaaccct 


cttcggaaga 


aggcgctacg 


tgcccgacct caacgcccgg 


2220 


gtgaagagcg 


tcagggaggc 


cgcggagcgc 


atggccttca 


acatgcccgt ccagggcacc 


2280 


gccgccgacc 


tcatgaagct 


cgccatggtg 


aagctcttcc 


cccgcctccg ggagatgggg 


2340 


gcccgcatgc 


tcctccaggt 


cgccaacgag 


ctcctcctgg 


aggcccccca agcgcgggcc 


2400 


gaggaggtgg 


cggctttggc 


caaggaggcc 


atggagaagg 


cctatcccct cgccgtgccc 


2460 


ctggaggtgg 


aggtggggat 


gggggaggac 


tggctttccg 


ccaagggt 


2508 



<210> 410 
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<211> 836 
<212> PRT 
<213> Artificial 

<220> 

<222> Synthetic 
<400> 410 

Met Asn ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Qly Leu Thr Thr Ser Arg Gly Glu Pro Val Qln Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala 
130 135 140 

ASP Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

ser ASP Asn Leu Pro Gly Val Lys Gly He Qly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Ast. Arg val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 
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Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275. 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Phe Glu Gly Glu Glu Lys Leu Leu Cys 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 

485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro lie Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 
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Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 

F-p«M 575 



565 



570 



Arg Leu Ser Ser Ser Asp Pro Asa Leu Gin Asn He Pro Val Arg Thr 



580 



585 



Pro Leu Gly Qln Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 



His lieu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

ASP He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 

655 



645 



650 



Ala val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 

665 670 



660 



Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 



675 



680 



685 



Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Qln Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 . 765 

Met Val Lye Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 



Ser Ala Lys Gly 
835 



<210> 411 
<211> 2508 
<212> DNA 
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<213> Artificial 

<220> 

<223> Synthetic 

<400> 411 



atgaattccg 


aaqcQatcrct 


tccgctcttt 


gaacccaaag 


acCQQQtCCt 

3>>»^333 www w 


CCtQQtQCiac 


60 


99ccaccacc 


tQOfCCtacccf 


caccttcttc 


QCCCtoaaQcr 

3 **3 3 


gcctcaccac 


QaCfCCaQCTQC 

3**3^^y33y ^ 


120 


aaaccQQtQC 


aoacaatcta 


caacttcacc 


aaaaacctcc 


tcaacraccct 


craaQoaaQac 


180 


aQCftacaaacr 


ccgtcttcgt 




accaaaaccc 

^■1 w w ^3 3 ^ 


cctccttcca 


ccacaaacrcc 

w w Bk w y 3 w> w 


240 


t acgacf cf c c t 


ACAAaacaaa 


3 ^33 3 w V* 


aGCfrr'aa act 

ciw w w ^ wy CLy y 


a r* ^ t" f r* rr* na 


QGaCIGf* GQGG 
y way w> wwy w w 


300 


ctcatcaaacf 


aGTG t acrt GTGia 


^3333 


t" t* t a n ffcrcc 


^ r'cracrcr^r^f'r' 


GCfCf G fr* a Gcra ct 
wyy w irCiwy ay 


360 


ctccta3.ccta.ca 


t tctccrccac 


cctcrnr'caaa 


a aacrcacra a a 

CLCiy y ^y y cmn 


acrcraoQOCita 
a.aaeiyyyy i-a 


caacrcr to cere 
*»y «y y ^y ^"•y w 


420 


S^fLCCtLCSf^ CCl 


n cf a cr r« f*T a 




^Uv«yl«v^U-wU>y 


civ,*k»y ^y L>wy o 


wy w w c w w CLW 


480 


cccaaaaacc 


accfccatcac 


^^v^yyeiy '^yy 


\« w k>y y y ciy Gi 


Ay i«ciwyy WW w 


ay y i*y y ay 


540 


cacitaaataQ 


ac 1 1 c CGTC ac 


ccfLca^aaaa 
x#v# i^wy "^y y y y 


CI a rrr* f t*r* GO 

y glvk\«^ v« wy 


aGaacctGGG 


GCTocra^ G a a a 

^3333 i»w»aay 


600 


ciaca t ccfcraa 

a 3 »- « fc- y y y 


acraa.cra^r! cere 


fr* t" a a on 


r» t* fa a aaa fli^ 


CTcrcma a cig g 

33 33**y 


aciaaaaGGt"G 


660 


ctcaagaacc 


wjjy »v*v*yyy w 




vidwy ^v«v«yy y 


acia aaa tGaa 


crcfccGaGCta 

y y WW waw w wy 


720 


gaagacctca 


ggctctcctt 


ggagctctcC 


^333 i-y ^y ^« 


ccaacctccG 


v-wi-yyayy L-y 


780 


gacctcgccc 


acToacrcacrcra 

**3333^333** 


acccaa.ccaa 


y c*y y y y v# u i-o, 


aacfcctfceet 


crcraa aoQC t a 

yyayayywuy 


840 


cracf t fc c era c a 


acctcctcca 

J ^ W W w W W W 0» 


caaat tcaac 


c t c c taaaaa 
^ u V'V^ y y ^yy 




WW uyyay y ay 


900 


accccchaac 


CGccaccacra 

w ^ v* w V* wy y ci 


^3333^'" ^ 


y ^y y y V I* u wy 


uw w IkWIh* wy 


^ ^ w wy ay w ta« ti« 


960 


atcrtoaacacr 


acf c 1 1 aaacrr* 


c c tacre ccig g 


i»y wayyyy vy 


Cf G G Cf GCrf* CTG S 

y wVi^y %»y uy w»a 


G GCf Cf Cf G acf G a 
wy y y '-•ay v» a 


1020 


QaccccttQQ 


cocrcrcrc t a aa 




3*33 *-^*-333 


OGGfcGCfcGaG 
y w w w V w ij wy w 


caaacracctc 

WCm%4 WGiWV^ WW 


1080 


gccgtcttgg 


cctcgaggga 


ggggctagac 


ctcgtgcccg 


gggacgaccc 


catgctcctc 


1140 


gcctacctcc 


tggacccttc 


gaacaccacc 


cccgaggggg 


tggcgcggcg 


ctacgggggg 


1200 


gagtggacgg 


aggacgccgc 


ccaccgggcc 


ctcctctcgg 


agaggctcca 


tcggaacctc 


1260 


cttaagcgcc 


tcgaggggga 


ggagaagctc 


ctttggctct 


accacgaggt 


ggaaaagccc 


1320 


ctctcccggg 


tcctggccca 


tatggaggcc 


accggggtac 


ggcgggacgt 


ggcctacctt 


1380 


caggcccttt 


ccctggagct 


tgcggaggag 


atccgccgcc 


tcgaggagga 


ggtcttccgc 


1440 


ttggcgggcc 


accccttcaa 


cctcaactcc 


cgggaccagc 


tggaaagggt 


gctctttgac 


1500 


gagcttaggc 


ttcccttttt 


gaagaagacg 


aagaagacag 


gcaagcgctc 


caccagcgcc 


1560 
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2220 
2280 



gcggtgctgg aggccctacg ggaggcccac cccatcgtgg agaagatcct ccagcaccgg 1620 

gagctcacca agctcaagaa cacctacgtg gaccccctcc caagcctcgt ccacccgagg 1680 

acgggccgcc tccacacccg cttcaaccag acggccacgg ccacggggag gcttagtagc 1740 

tccgacccca acctgcagaa catccccgtc cgcaccccct tgggccagag gatccgccgg 1800 

gccttcgtgg ccgaggcggg ttgggcgttg gtggccctgg actatagcca gatagagctc 1860 

cgcgtcctcg cccacctctc cggggacgaa aacctgatca gggtcttcca ggaggggaag 1920 

gacatccaca cccagaccgc aagctggatg ttcggcgtcc ccccggaggc cgtggacccc 1980 

ctgatgcgcc gggcggccaa gacggtgaac ttcggcgtcc tctacggcat gtccgcccat 2040 

aggctctccc aggagcttgc catcccctac gaggaggcgg tggcctttat agagcgctac 2100 

ttccaaagct tccccaaggt gcgggcctgg atagaaaaga ccctggagga ggggaggaag 2160 
cggggctacg tggaaaccct cttcggaaga aggcgctacg tgcccgacct caacgcccgg 
gtgaagagcg tcagggaggc cgcggagcgc atggccttca acatgcccgt ccagggcacc 

gccgccgacc tcatgaagct cgccatggtg aagctcttcc cccgcctccg ggagatgggg 2340 

gcccgcatgc tcctccaggt cgccaacgag ctcctcctgg aggcccccca agcgcgggcc 2400 
gaggaggtgg cggctttggc caaggaggcc atggagaagg cctatcccct cgccgtgccc 2460 
ctggaggtgg aggtggggat gggggaggac tggctttccg ccaagggt 

<210> 412 

<211> 836 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 412 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

val Phe val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 



2508 
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65 



70 



75 



80 



Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu lie Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 



Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
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370 375 380 

ASP Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

val Leu Phe Asp Glu Leu Arg Leu Pro Phe Leu Lys Lys Thr Lys Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 ^20 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
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675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly 
835 



<210> 


413 


<211> 


42 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<220> 




<221> 


n 


<222> 


(19)., (21) 


<223> 


n is any base a 


<400> 


413 


caggagctta ggcttcccnn 


<210> 


414 


<211> 


18 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 



42 



732 



wo 01/90337 



PCTAJSOl/17086 



<400> 414 
cctaagctcg tcaaagag 



<210> 415 

<211> 2508 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 



18 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



<400> 415 ^ ^ 

atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 

ggccaccacc tggcctaccg caccttcttc gccctgaagg gcctcaccac gagccggggc 

gaaccggtgc aggcggtcta cggcttcgcc aagagcctcc tcaaggccct gaaggaggac 

gggtacaagg ccgtcttcgt ggtctttgac gccaaggccc cctccttccg ccacgaggcc 

tacgaggcct acaaggcggg gagggccccg acccccgagg acttcccccg gcagctcgcc 

ctcatcaagg agctggtgga cctcctgggg tttacccgcc tcgaggtccc cggctacgag 

gcggacgacg ttctcgccac cctggccaag aaggcggaaa aggaggggta cgaggtgcgc 

atcctcaccg ccgaccgcga cctctaccaa ctcgtctccg accgcgtcgc cgtcctccac 

cccgagggcc acctcatcac cccggagtgg ctttgggaga agtacggcct caggccggag 

cagtgggtgg acttccgcgc cctcgtgggg gacccctccg acaacctccc cggggtcaag 

ggcatcgggg agaagaccgc cctcaagctc ctcaaggagt ggggaagcct ggaaaacctc 660 

ctcaagaacc tggaccgggt aaagccagaa aacgtccggg agaagatcaa ggcccacctg 720 

gaagacctca ggctctcctt ggagctctcc cgggtgcgca ccgacctccc cctggaggtg 780 

gacctcgccc aggggcggga gcccgaccgg gaggggctta gggccttcct ggagaggctg 840 

gagttcggca gcctcctcca cgagttcggc ctcctggagg cccccgcccc cctggaggag 900 

gccccctggc ccccgccgga aggggccttc gtgggcttcg tcctctcccg ccccgagccc 960 

atgtgggcgg agcttaaagc cctggccgcc tgcaggggcg gccgcgtgca ccgggcagca 1020 

gaccccttgg cggggctaaa ggacctcaag gaggtccggg gcctcctcgc caaggacctc 1080 

gccgtcttgg cctcgaggga ggggctagac ctcgtgcccg gggacgaccc catgctcctc 1140 

gcctacctcc tggacccttc gaacaccacc cccgaggggg tggcgcggcg ctacgggggg 1200 

gagtggacgg aggacgccgc ccaccgggcc ctcctctcgg agaggctcca tcggaacctc 1260 

cttaagcgcc tcgaggggga ggagaagctc ctttggctct accacgaggt ggaaaagccc 1320 
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ctctcccggg 


tcctggccca 


tatggaggcc 


accggggtac 


ggcgggacgt ggcctacctt 


1380 


caggcccttt 


ccctggagct 


tgcggaggag 


atccgccgcc 


tcgaggagga ggtcttccgc 


1440 


ttggcgggcc 


accccttcaa 


cctcaactcc 


cgggaccagc 


tggaaagggt gctctttgac 


1500 


gagcttaggc 


ttcccgtttt 


gaagaagacg 


aagaagacag 


gcaagcgctc caccagcgcc 


1560 


gcggtgctgg 


aggccctacg 


ggaggcccac 


cccatcgtgg 


agaagatcct ccagcaccgg 


1620 


gagctcacca 


agctcaagaa 


cacctacgtg 


gaccccctcc 


caagcctcgt ccacccgagg 


1680 


acgggccgcc 


tccacacccg 


cttcaaccag 


acggccacgg 


ccacggggag gcttagtagc 


1740 


tccgacccca 


acctgcagaa 


catccccgtc 


cgcaccccct 


tgggccagag gatccgccgg 


1800 


gccttcgtgg 


ccgaggcggg 


ttgggcgttg 


gtggccctgg 


actatagcca gatagagctc 


1860 


cgcgtcctcg 


cccacctctc 


cggggacgaa 


aacctgatca 


gggtcttcca ggaggggaag 


1920 


gacatccaca 


cccagaccgc 


aagctggatg 


ttcggcgtcc. 


ccccggaggc cgtggacccc 


1980 


ctgatgcgcc 


gggcggccaa 


gacggtgaac 


ttcggcgtcc 


tctacggcat gtccgcccat 


2040 


aggctctccc 


aggagcttgc 


catcccctac 


gaggaggcgg 


tggcctttat agagcgctac 


2100 


ttccaaagct 


tccccaaggt 


gcgggcctgg 


atagaaaaga 


ccctggagga ggggaggaag 


2160 


cggggctacg 


tggaaaccct 


cttcggaaga 


aggcgctacg 


tgcccgacct caacgcccgg 


2220 


gtgaagagcg 


tcagggaggc 


cgcggagcgc 


atggccttca 


acatgcccgt ccagggcacc 


2280 


gccgccgacc 


tcatgaagct 


cgccatggtg 


aagctcttcc 


cccgcctccg ggagatgggg 


2340 


gcccgcatgc 


tcctccaggt 


cgccaacgag 


ctcctcctgg 


aggcccccca agcgcgggcc 


2400 


gaggaggtgg 


cggctttggc 


caaggaggcc 


atggagaagg 


cctatcccct cgccgtgccc 


2460 


ctggaggtgg 


aggtggggat 


gggggaggac 


tggctttccg 


ccaagggt 


2508 



<210> 416 

<211> 836 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 416 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
1 5 - 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 
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Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Qly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

val Phe val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 



75 



80 



65 70 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 



85 90 
Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 



100 105 

Arg Leu Glu Val Pro Qly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
" 125 



115 120 
Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala 



130 



135 



140 



Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 ISO ^55 

pro Glu Gly His Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 

190 



180 



185 



ser Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu 



195 



200 



205 



Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 



210 



215 



220 



AST5 Arg Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 



260 



265 



270 



Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 

315 320 



305 



310 



Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 
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His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 

370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His TUrg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 

435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Val Leu Lys Lys Thr Lys Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro lie Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 .560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 

595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 
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A8P lie His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
80S 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Qly 





835 


<210> 


417 


<211> 


2508 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


417 



60 



ggccaccacc tggcctaccg caccttcttc gcoctgaagg gcctcaccac gagccggggc 120 
gaaccggtgc aggcggtcta cggcttcgcc aagagcctcc tcaaggccct gaaggaggac 180 
gggtacaagg ccgtcttcgt ggtctttgac gccaaggccc octccttccg ccacgaggcc 240 
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tacgaggcct 


acaaggcggg 


gagggccccg 


acccccgagg 


acttcccccg 


gcagctcgcc 


300 


ctcatcaagg 


agctggtgga 


cctcctgggg 


tttacccgcc 


tcgaggtccc 


cggctacgag 


360 


gcggacgacg 


ttctcgccac 


cctggccaag 


aaggcggaaa 


aggaggggta 


cgaggtgcgc 


420 


atcctcaccg 


ccgaccgcga 


cctctaccaa 


ctcgtctccg 


accgcgtcgc 


cgtcctccac 


480 


cccgagggcc 


acctcatcac 


cccggagtgg 


ctttgggaga 


agtacggcct 


caggccggag 


540 


cagtgggtgg 


acttccgcgc 


cctcgtgggg 


gacccctccg 


^caacctccc 


cggggtcaag 


600 


ggcatcgggg 


agaagaccgc 


cctcaagctc 


ctcaaggagt 


ggggaagcct 


ggaaaacctc 


660 


ctcaagaacc 


tggaccgggt 


aaagccagaa 


aacgtccggg 


agaagatcaa 


ggcccacctg 


720 


gaagacctca 


ggctctcctt 


ggagctctcc 


cgggtgcgca 


ccgacctccc 


cctggaggtg 


780 


gacctcgccc 


aggggcggga 


gcccgaccgg 


gaggggctta 


gggccttcct 


ggagaggctg 


840 


gagttcggca 


gcctcctcca 


cgagttcggc 


ctcctggagg 


cccccgcccc 


cctggaggag 


900 


gccccctggc 


ccccgccgga 


aggggccttc 


gtgggcttcg 


tcctctcccg 


ccccgagccc 


960 


atgtgggcgg 


agcttaaagc 


cctggccgcc 


tgcaggggcg 


gccgcgtgca 


ccgggcagca 


1020 


gaccccttgg 


cggggctaaa 


ggacctcaag 


gaggtccggg 


gcctcctcgc 


caaggacctc 


1080 


gccgtcttgg 


cctcgaggga 


ggggctagac 


ctcgtgcccg 


gggacgaccc 


catgctcctc 


1140 


gcctacctcc 


tggacccttc 


gaacaccacc 


cccgaggggg 


tggcgcggcg 


ctacgggggg 


1200 


gagtggacgg 


aggacgccgc 


ccaccgggcc 


ctcctctcgg 


agaggctcca 


tcggaacctc 


1260 


cttaagcgcc 


tcgaggggga 


ggagaagctc 


ctttggctct 


accacgaggt 


ggaaaagccc 


1320 


ctctcccggg 


tcctggccca 


tatggaggcc 


accggggtac 


ggcgggacgt 


ggcctacctt 


. 1380 


caggcccttt 


ccctggagct 


tgcggaggag 


atccgccgcc 


tcgaggagga 


ggtcttccgc 


1440 


ttggcgggcc 


accccttcaa 


cctcaactcc 


cgggaccagc 


tggaaagggt 


gctctttgac 


1500 


gagcttaggc 


ttcccagttt 


gaagaagacg 


aagaagacag 


gcaagcgctc 


caccagcgcc 


1560 


gcggtgctgg 


aggccctacg 


ggaggcccac 


cccatcgtgg 


agaagatcct 


ccagcaccgg 


1620 


gagctcacca 


agctcaagaa 


cacctacgtg 


gaccccctcc 


caagcctcgt 


ccacccgagg 


1680 




tccacaeeca 


w K» W wnCtW WBKj 






acttaataac 


1740 


tccgacccca 


acctgcagaa 


catccccgtc 


cgcaccccct 


tgggccagag 


gatccgccgg 


1800 


gccttcgtgg 


ccgaggcggg 


ttgggcgttg 


gtggccctgg 


actatagcca 


gatagagctc 


1860 


cgcgtcctcg 


cccacctctc 


cggggacgaa 


aacctgatca 


gggtcttcca 


ggaggggaag 


1920 


gacatccaca 


cccagaccgc 


aagctggatg 


ttcggcgtcc 


ccccggaggc 


cgtggacccc 


1980 
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ctgatgcgcc gggcggccaa gacggtgaac ttcggcgtcc tctacggcat gtccgcccat 2040 

aggctctccc aggagcttgc catcccctac gaggaggcgg tggcctttat agagcgctac 2100 

ttccaaagct tccccaaggt gcgggcctgg atagaaaaga ccctggagga ggggaggaag 2160 

cggggctacg tggaaaccct cttcggaaga aggcgctacg tgcccgacct caacgcccgg 2220 

gtgaagagcg tcagggaggc cgcggagcgc atggccttca acatgcccgt ccagggcacc 2280 

gccgccgacc tcatgaagct cgccatggtg aagctcttcc cccgcctccg ggagatgggg 2340 

gcccgcatgc tcctccaggt cgccaacgag ctcctcctgg aggcccccca agcgcgggcc 2400 

gaggaggtgg cggctttggc caaggaggcc atggagaagg cctatcccct cgccgtgccc 2460 

ctggaggtgg aggtggggat gggggaggac tggctttccg ccaagggt 2508 

<210> 418 

<211> 836 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 418 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala 
130 135 140 



739 



wo 01/90337 



PCT/USOl/17086 



Asp Arg Asp I*eu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 

260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 

420 425 430 



Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 
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Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ser Leu Lys Lys Thr Lys Lys 
500 505 5X0 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 . 745 750 
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Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly 
835 

<210> 419 

<211> 2508 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 419 



atgaattccg 


aggcgatgct 


tccgctcttt 


gaacccaaag 


gccgggtcct 


cctggtggac 


60 


ggccaccacc 


tggcctaccg 


caccttcttc 


gccctgaagg 


gcctcaccac 


gagccggggc 


120 


gaaccggtgc 


aggcggtcta 


cggcttcgcc 


aagagcctcc 


tcaaggcGCt 


gaaggaggac 


180 


gggtacaagg 


ccgtcttcgt 


ggtctttgac 


gccaaggccc 


cctccttccg 


ccacgaggcc 


240 


tacgaggcct 


acaaggcggg 


gagggccccg 


acccccgagg 


acttcccccg 


gcagctcgcc 


300 


ctcatcaagg 


agctggtgga 


cctcctgggg 


tttacccgcc 


tcgaggtccc 


cggctacgag 


360 


gcggacgacg 


ttctcgccac 


cctggccaag 


aaggcggaaa 


aggaggggta 


cgaggtgcgc 


420 


atcctcaccg 


ccgaccgcga 


cctctaccaa 


ctcgtctccg 


accgcgtcgc 


cgtcctccac 


480 


cccgagggcc 


acctcatcac 


cccggagtgg 


ctttgggaga 


agtacggcct 


caggccggag 


540 


cagtgggtgg 


acttccgcgc 


cctcgtgggg 


gacccctccg 


acaacctccc 


cggggtcaag 


600 


ggcatcgggg 


agaagaccgc 


cctcaagctc 


ctcaaggagt 


ggggaagcct 


ggaaaacctc 


660 


ctcaagaacc 


tggaccgggt 


aaagccagaa 


aacgtccggg 


agaagatcaa 


ggcccacctg 


720 


gaagacctca 


ggctctcctt 


ggagctctcc 


cgggtgcgca 


ccgacctccc 


cctggaggtg 


780 


gacctcgccc 


aggggcggga 


gcccgaccgg 


gaggggctta 


gggccttcct 


ggagaggctg 


840 


gagttcggca 


gcctcctcca 


cgagttcggc 


ctcctggagg 


cccccgcccc 


cctggaggag 


900 
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gccccctggc ccccgccgga aggggccttc gtgggcttcg tcctctcccg ccccgagccc 960 

atgtgggcgg agcttaaagc cctggccgcc tgcaggggcg gccgcgtgca ccgggcagca 1020 

gaccocttgg cggggctaaa ggacctcaag gaggtccggg gcctcctcgc caaggacctc 1080 

gccgtcttgg cctcgaggga ggggctagac ctcgtgcccg gggacgaccc catgctcctc 1140 

gcctacctcc tggacccttc gaacaccacc cccgaggggg tggcgcggcg ctacgggggg 1200 

gagtggacgg aggacgccgc ccaccgggcc ctcctctcgg agaggctcca tcggaacctc 1260 

cttaagcgcc tcgaggggga ggagaagctc ctttggctct accacgaggt ggaaaagccc 1320 

ctctcccggg tcctggccca tatggaggcc accggggtac ggcgggacgt ggcctacctt 1380 

caggcccttt ccctggagct tgcggaggag atccgccgcc tcgaggagga ggtcttccgc 1440 

ttggcgggcc accccttcaa cctcaactcc cgggaccagc tggaaagggt gctctttgac 1500 

gagcttaggc ttcccgcctt gaagaagacg aagaagacag gcaagcgctc caccggtgcc 1560 

gcggtgctgg aggccctacg ggaggcccac cccatcgtgg agaagatcct ccagcaccgg 1620 

gagctcacca agctcaagaa cacctacgtg gaccccctcc caagcctcgt ccacccgagg 1680 

acgggccgcc tccacacccg cttcaaccag acggccacgg ccacggggag gcttagtagc 1740 

tccgacccca acctgcagaa catccccgtc cgcaccccct tgggccagag gatccgccgg 1800 

gccttcgtgg ccgaggcggg ttgggcgttg gtggccctgg actatagcca gatagagctc I860 

cgcgtcctcg cccacctctc cggggacgaa aacctgatca gggtcttcca ggaggggaag 1920 
gacatccaca cccagaccgc aagctggatg ttcggcgtcc ccccggaggc cgtggacccc 1980 
ctgatgcgcc gggcggccaa gacggtgaac ttcggcgtcc totacggcat gtccgcccat 2040 
aggctctccc aggagcttgc catcccctac gaggaggcgg tggcctttat agagcgctac 2100 
ttccaaagct tccccaaggt gcgggcctgg atagaaaaga ccctggagga ggggaggaag 2160 
cggggctacg tggaaaccct cttcggaaga aggcgctacg tgcccgacct caacgcccgg 2220 
gtgaagagcg tcagggaggc cgcggagcgc atggccttca acatgcccgt ccagggcacc 2280 
gccgccgacc tcatgaagct cgccatggtg aagctcttcc cccgcctccg ggagatgggg 2340 
gcccgcatgc tcotccaggt cgccaacgag ctcctoctgg aggcccccca agcgcgggcc 2400 
gaggaggtgg cggctttggc caaggaggcc atggagaagg cctatcccct cgccgtgccc 2460 
ctggaggtgg aggtggggat gggggaggac tggctttccg ccaagggt 2508 

<210> 420 
<211> 836 
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<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 420 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 

100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
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260 



265 270 



Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
■ 325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 4X5 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly. Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Gly Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
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565 



570 



575 



Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 



Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 



Ser Ala Lys Gly 
835 

<210> 421 

<211> 42 

<212> DNA 

<213> Artificial 
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<220> 

<223> Synthetic 

<220> 

<221> n 

<222> (16).. (18) 

<223> n is any base a,t|C, or g. 



<400> 421 

ggcaagcgct ccaccnnngc cgcggtgctg gaggccctac gg 42 



<210> 422 

<211> 15 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 422 
ggtggagcgc ttgcc 



<210> 423 

<211> 2508 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 



<400> 423 

atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 60 

ggccaccacc tggcctaccg caccttcttc gccctgaagg gcctcaccac gagccggggc 120 

gaaccggtgc aggcggtcta cggcttcgcc aagagcctcc tcaaggccct gaaggaggac 180 

gggtacaagg ccgtcttcgt ggtctttgac gccaaggccc cctccttccg ccacgaggcc 240 

tacgaggcct acaaggcggg gagggccccg acccccgagg acttcccccg gcagctcgcc 300 

ctcatcaagg agctggtgga cctcctgggg tttacccgcc tcgaggtccc cggctacgag 360 

gcggacgacg ttctcgccac cctgg'ccaag aaggcggaaa aggaggggta cgaggtgcgc 420 

atcctcaccg ccgaccgcga cctctaccaa ctcgtctccg accgcgtcgc cgtcctccac 480 

cccgagggcc acctcatcac cccggagtgg ctttgggaga agtacggcct caggccggag 540 

cagtgggtgg acttccgcgc cctcgtgggg gacccctccg acaacctccc cggggtcaag 600 

ggcatcgggg agaagaccgc cctcaagctc ctcaaggagt ggggaagcct ggaaaacctc 660 
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ctcaagaacc 


tggaccgggt 


aaagccagaa 


aacgtccggg 


agaagatcaa ggcccacctg 


720 


gaagacctca 


ggctctcctt 


ggagctctcc 


cgggtgcgca 


ccgacctccc 


cctggaggtg 


780 


gacctcgccc 


aggggcggga 


gcccgaccgg 


gaggggctta 


gggccttcct ggagaggctg 


840 


gagttcggca 


gcctcctcca 


cgagttcggc 


ctcctggagg 


cccccgcccc cctggaggag 


900 


gccccctggc 


ccccgccgga 


aggggccttc 


gtgggcttcg 


tcctctcccg 


ccccgagccc 


960 


atgtgggcgg 


agcttaaagc 


cctggccgcc 


tgcaggggcg 


gccgcgtgca 


ccgggcagca 


1020 


gaccccttgg 


cggggctaaa 


ggacctcaag 


gaggtccggg 


gcctcctcgc 


caaggacctc 


X080 


gccgtcttgg 


cctcgaggga 


ggggctagac 


ctcgtgcccg 


gggacgaccc 


catgctcctc 


1140 


gcctacctcc 


tggacccttc 


gaacaccacc 


cccgaggggg 


tggcgcggcg 


ctacgggggg 


1200 


gagtggacgg 


aggacgccgc 


ccaccgggcc 


ctcctctcgg 


agaggctcca 


tcggaacctc 


1260 


cttaagcgcc 


tcgaggggga 


ggagaagctc 


ctttggctct 


accacgaggt 


ggaaaagccc 


1320 


ctctcccggg 


tcctggccca 


tatggaggcc 


accggggtac 


ggcgggacgt 


ggcctacctt 


1380 


caggcccttt 


ccctggagct 


tgcggaggag 


atccgccgcc 


tcgaggagga 


ggtcttccgc 


1440 


ttggcgggcc 


accccttcaa 


cctcaactcc 


cgggaccagc 


tggaaagggt 


gctctttgac 


1500 


gagcttaggc 


ttcccgcctt 


gaagaagacg 


aagaagacag 


gcaagcgctc 


caccagctta 


1560 


gcggtgctgg 


aggccctacg 


ggaggcccac 


cccatcgtgg 


agaagatcct 


ccagcaccgg 


1620 


gagctcacca 


agctcaagaa 


cacctacgtg 


gaccccctcc 


caagcctcgt 


ccacccgagg 


1680 


acgggccgcc 


tccacacccg 


cttcaaccag 


acggccacgg 


ccacggggag 


gcttagtagc 


1740 


tccgacccca 


acctgcagaa 


catccccgtc 


cgcaccccct 


tgggccagag 


gatccgccgg 


1800 


gccttcgtgg 


ccgaggcggg 


ttgggcgttg 


gtggccctgg 


actatagcca 


gatagagctc 


1860 


cgcgtcctcg 


cccacctctc 


cggggacgaa 


aacctgatca 


gggtcttcca 


aaacfcraQaaQ 


1920 


gacatccaca 


cccagaccgc 


aagctggatg 


ttcggcgtcc 


ccccggaggc 


cgtggacccc 


1980 


ctgatgcgcc 


gggcggccaa 


gacggtgaac 


ttcggcgtcc 


tctacggcat 


gtccgcccat 


2040 


aggctctccc 


aggagcttgc 


catcccctac 


gaggaggcgg 


tggcctttat 


agagcgctac 


2100 


ttccaaagct 


tccccaaggt 


gcgggcctgg 


atagaaaaga 


ccctggagga 


ggggaggaag 


2160 


cggggctacg 


tggaaaccct 


cttcggaaga 


aggcgctacg 


tgcccgacct 


caacgcccgg 


2220 


gtgaagagcg 


tcagggaggc 


cgcggagcgc 


atggccttca 


acatgcccgt 


ccagggcacc 


2280 


gccgccgacc 


tcatgaagct 


cgccatggtg 


aagctcttcc 


cccgcctccg 


ggagatgggg 


2340 


gcccgcatgc 


tcctccaggt 


cgccaacgag 


ctcctcctgg 


aggcccccca 


agcgcgggcc 


2400 
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gaggaggtgg cggctttggc caaggaggcc atggagaagg cctatcccct cgccgtgccc 2460 
ctggaggtgg aggtggggat gggggaggac tggctttccg ccaagggt 2508 



<210> 424 

<211> 836 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 424 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
1 5 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 
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Asp Arg Val Lys Pro 61u Asn Val Arg Olu Lys He Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 

485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys 
500 505 51Q 



Thr Gly Lys Arg Ser Thr Ser Leu Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 
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Ala His Pro lie Val Qlu Lys lie Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 

660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 
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Ser Ala Lys Gly 
835 



<210> 


425 


<211> 


42 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<220> 




<221> 


n 


<222> 


(19) . . (21) 


<223> 


n is any base 



<400> 425 

ggcaagcgct ccaccagciin ngcggtgctg gaggccctac gg 42 

<210> 426 

<211> 2508 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 426 



atgaattccg 


aggcgatgct tccgctcttt gaacccaaag gccgggtcct 


cctggtggac 


60 


ggccaccacc 


tggcctaccg caccttcttc gccctgaagg gcctcaccac 


gagccggggc 


120 


gaaccggtgc 


aggcggtcta cggcttcgcc aagagcctcc tcaaggccct 


gaaggaggac 


180 


gggtacaagg 


ccgtcttcgt ggtctttgac gccaaggccc cctccttccg 


ccacgaggcc 


240 


tacgaggcct 


acaaggcggg gagggccccg acccccgagg acttcccccg 


gcagctcgcc 


300 


ctcatcaagg 


agctggtgga cctcctgggg tttacccgcc tcgaggtccc 


cggctacgag 


360 


gcggacgacg 


ttctcgccac cctggccaag aaggcggaaa aggaggggta 


cgaggtgcgc 


420 


atcctcaccg 


ccgaccgcga cctctaccaa ctcgtctccg accgcgtcgc 


cgtcctccac 


480 


cccgagggcc 


acctcatcac cccggagtgg ctttgggaga agtacggcct 


caggccggag 


540 


cagtgggtgg 


acttccgcgc cctcgtgggg gacccctccg acaacctccc 


cggggtcaag 


600 


ggcatcgggg 


agaagaccgc cctcaagctc ctcaaggagt ggggaagcct 


ggaaaacctc 


660 


ctcaagaacc 


tggaccgggt aaagccagaa aacgtccggg agaagatcaa 


ggcccacctg 


720 


gaagacctca 


ggctctcctt ggagctctcc cgggtgcgca ccgacctccc 


cctggaggtg 


780 
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gacctcgccc aggggcggga gcccgaccgg gaggggctta gggccttcct ggagaggctg 840 

gagttcggca gcctcctcca cgagttcggc ctcctggagg cccccgcccc cctggaggag 900 

gccccctggc ccccgccgga aggggccttc gtgggcttcg tcctctcccg ccccgagccc 960 

atgtgggcgg agcttaaagc cctggccgcc tgcaggggcg gccgcgtgca ccgggcagca 1020 

gaccccttgg cggggctaaa ggacctcaag gaggtccggg gcctcctcgc caaggacctc 1080 

gccgtcttgg cctcgaggga ggggctagac ctcgtgcccg gggacgaccc catgctcctc 1140 

gcctacctcc tggacccttc gaacaccacc cccgaggggg tggcgcggcg ctacgggggg 1200 

gagtggacgg aggacgccgc ccaccgggcc ctcctctcgg agaggctcca tcggaacctc 12 60 

cttaagcgcc tcgaggggga ggagaagctc ctttggctct accacgaggt ggaaaagccc 1320 

ctctcccggg tcctggccca tatggaggcc accggggtac ggcgggacgt ggcctacctt 1380 
ccctggagct tgcggaggag atccgccgcc tcgaggagga ggtcttccgc 1440 

ttggcgggcc accccttcaa cctcaactcc cgggaccagc tggaaagggt gctctttgac 1500 

gagcttaggc ttcccgcctt gaagaagacg aagaagacag gcaagcgctc caccagccgt 1560 

gcggtgctgg aggccctacg ggaggcccac cccatcgtgg agaagatcct ccagcaccgg 1620 

gagctcacca agctcaagaa cacctacgtg gaccccctcc caagcctcgt ccacccgagg 1680 

acgggccgcc tccacacccg cttcaaccag acggccacgg ccacggggag gcttagtagc 1740 

tccgacccca acctgcagaa catccccgtc cgcaccccct tgggccagag gatccgccgg. 1800 

gccttcgtgg ccgaggcggg ttgggcgttg gtggccctgg actatagcca gatagagctc 1860 

cgcgtcctcg cccacctctc cggggacgaa aacctgatca gggtcttcca ggaggggaag 1920 

gacatccaca cccagaccgc aagctggatg ttcggcgtcc ccccggaggc cgtggacccc 1980 

ctgatgcgcc gggcggccaa gacggtgaac ttcggcgtcc tctacggcat gtccgcccat 2040 

aggctctccc aggagcttgc catcccctac gaggaggcgg tggcctttat agagcgctac 2100 

ttccaaagct tccccaaggt gcgggcctgg atagaaaaga ccctggagga ggggaggaag 2160 

cggggctacg tggaaaccct cttcggaaga aggcgctacg tgcccgacct caacgcccgg 2220 

gtgaagagcg tcagggaggc cgcggagcgc atggccttca acatgcccgt ccagggcacc 2280 

gccgccgacc tcatgaagct cgccatggtg aagctcttcc cccgcctccg ggagatgggg 2340 

gcccgcatgc tcctccaggt cgccaacgag ctcctcctgg aggcccccca agcgcgggcc 2400 

gaggaggtgg cggctttggc caaggaggcc atggagaagg cctatcccct cgccgtgccc 2460 

ctggaggtgg aggtggggat gggggaggac tggctttccg ccaagggt 2508 
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<210> 427 

<211> 836 

<212> PRT 

<213> Artificial 



<220> 

<223> Synthetic 



<:400> 427 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 



Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala 
130 135 140 



Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 



Pro Glu Gly His Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 

180 185 . 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu 
195 200 205 



Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
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245 250 255 

Pro Leu Glu Val Asp Leu Ala Gla Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 

290 295 300 

Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 3X5 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 . 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Arg Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro He Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 
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Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg lie Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 

770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly 
835 

<210> 428 
<211> 2499 
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<212> DNA 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 428 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 
caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 
ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 
gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 
gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 
aaggagctgg tggacctcct ggggctggcg cgcctcgagg tcccgggcta cgaggcggac 
gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 
accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 
gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 
gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 
ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 660 
aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 
aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 
aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 
agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 
cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 
gatcttctgg ccctggccgc cgccaggggc ggccgcgtcc accgggcccc cgagccttat 
aaagccctca gggacctgaa ggaggcgcgg gggcttctcg ccaaagacct gagcgttctg 
gccctgaggg aaggccttgg cctcccgccc ggcgacgacc ccatgctcct cgcctacctc 
ctggaccctt cgaacaccac ccccgagggg gtggcccggc gctacggcgg ggagtggacg 1200 
gaggaggcgg gggagcgggc cgccctttcc gagaggctct tcgccaacct gcttaagagg 
cttgaggggg aggagaggct cctttggctt taccgggagg tggagaggcc cctttccgct 
gtcctggccc atatggaggc cacgggggtg cgccgggacg tggcctatct cagggccttg 
tccctggagg tggccgagga gatcgcccgc ctcgaggccg aggtcttccg cctggccggc 1440 
caccccttca acctcaactc ccgggaccag ctggaaaggg tcctctttga cgagctaggg 1500 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



. 720 
780 
840 
900 
960 
1020 
1080 
1140 



1260 
1320 

1380 
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cttcccgcca 


tcaagaagac 


gcaaaagacc 


ggcaagcgct ccaccagcgc cgccgtcctg 


1560 


gaggccctcc 


gcgaggccca 


ccccatcgtg 


gagaagatcc tgcagtaccg ggagctcacc 


1620 


aagctgaaga 


gcacctacat 


tgaccccttg 


ccggacctca tccaccccag gacgggccgc 


1680 


ctccacaccc 


gcttcaacca 


gacggccacg 


gccacgggca ggctaagtag ctccgatccc 


1740 


aacctccaga 


acatccccgt 


ccgcaccccg 


cttgggcaga ggatccgccg ggccttcatc 


1800 


gccgaggagg 


ggtggctatt 


ggtggccctg 


gactatagcc agatagagct cagggtgctg 


1860 


gcccacctct 


ccggcgacga 


gaacctgatc 


cgggtcttcc aggaggggcg ggacatccac 


1920 


acggagaccg 


ccagctggat 


gttcggcgtc 


ccccgggagg ccgtggaccc cctgatgcgc 


1980 


cgggcggcca 


agaccatcaa 


cttcggggtc 


ctctacggca tgtcggccca ccgcctctcc 


2040 


caggagctag 


ccatccctta 


cgaggaggcc 


caggccttca ttgagcgcta ctttcagagc 


2100 


ttccccaagg 


tgcgggcctg 


gattgagaag 


accctggagg agggcaggag gcgggggtac 


2160 


gtggagaccc 


tcttcggccg 


ccgccgctac 


gtgccagacc tagaggcccg ggtgaagagc 


2220 


gcgcgggagg 


cggccgagcg 


catggccttc 


aacat^gcccg tccagggcac cgccgccgac 


2280 


ctcatgaagc 


tggctatggt 


gaagctcttc 


cccaggctgg aggaaatggg ggccaggatg 


2340 


ctccttcagg 


tcgccaacga 


gctggtcctc 


gaggccccaa aagagagggc ggaggccgtg 


2400 


gcccggctgg 


ccaaggaggt 


catggagggg 


gtgtatcccc tggccgtgcc cctggaggtg 


2460 


gaggtgggga 


taggggagga 


ctggctctcc 


gccaaggag 


2499 



<210> 429 

<211> 833 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 429 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val He 
50 55 60 
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Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Leu Ala Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu lie Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
- 210 215 220 

Leu Lys Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Pro Glu Pro Tyr Lys Ala Leu Arg Asp Leu Lys Glu Ala Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ser Val Leu Ala Leu Arg Glu Gly Leu Gly Leu 
355 360 365 
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Pro Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 

370 375 380 

Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
385 390 395 400 

Glu Glu Ala Gly Glu Arg Ala Ala Leu Ser Glu Arg Leu Phe Ala Asn 
405 410 415 

Leu Leu Lys Arg Leu Glu Gly Glu Glu Arg Leu Leu Trp Leu Tyr Arg 
420 425 430 

Glu Val Glu Arg Pro Leu Ser Ala Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Arg Asp Val Ala Tyr Leu Arg Ala Leu Ser Leu Glu Val 
450 455 460 

Ala Glu Glu He Ala Arg Leu Glu Ala Glu Val Phe Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala He Lys Lys Thr Gin Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

He Val Glu Lys He Leu Gin Tyr Arg Glu Leu Thr Lys Leu Lys Ser 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Asp Leu He His Pro Arg Thr Gly Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 
580 585 590 

Gin Arg He Arg Arg Ala Phe He Ala Glu Glu Gly Trp Leu Leu Val 
595 600 605 

Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Arg Asp He His 
625 630 635 640 

Thr Glu Thr Ala Ser Trp Met Phe Gly Val Pro Arg Glu Ala Val Asp 
645 650 655 

Pro Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 
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Gly Met Ser Ala Hie Arg Leu Ser Gin Glu Leu Ala lie Pro Tyr Glu 
675 680 685 

Glu Ala Gin Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val 
690 695 700 

Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Arg Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Glu Ala 
725 730 735 

Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Glu Glu Met Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

Ala Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Arg Ala Glu Ala Val 
785 790 795 800 

Ala Arg Leu Ala Lys Glu Val Met Glu Gly Val Tyr Pro Leu Ala Val 
805 810 815 

Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Glu 



<210> 


430 


<211> 


27 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


430 



acgggggtgc gccgggacgt ggcctat 



<210> 


431 


<211> 


2508 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


431 



atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 
ggccaccacc tggcctaccg caccttcttc gccctgaagg gcctcaccac gagccggggc 
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gaaccggtgc 


aggcggtcta 


cggcttcgcc 


aagagcctcc 


tcaaggccct 


gaaggaggac 


180 


gggtacaagg 


ccgtcttcgt 


ggtctttgac 


gccaaggccc 


cctccttccg 


ccacgaggcc 


240 


tacgaggcct 


acaaggcggg 


gagggccccg 


acccccgagg 


acttcccccg 


gcagctcgcc 


300 


ctcatcaagg 


agctggtgga 


cctcctgggg 


tttacccgcc 


tcgaggtccc 


cggctacgag 


360 


gcggacgacg 


ttctcgccac 


cctggccaag 


aaggcggaaa 


aggaggggta 


cgaggtgcgc 


420 


atcctcaccg 


ccgaccgcga 


cctctaccaa 


ctcgtctccg 


accgcgtcgc 


cgtcctccac 


480 


cccgagggcc 


acctcatcac 


cccggagtgg 


ctttgggaga 


agtacggcct 


caggccggag 


540 


cagtgggtgg 


acttccgcgc 


cctcgtgggg 


gacccctccg 


acaacctccc 


cggggtcaag 


600 


ggcatcgggg 


agaagaccgc 


cctcaagctc 


ctcaaggagt 


ggggaagcct 


ggaaaacctc 


660 


ctcaagaacc 


tggaccgggt 


aaagccagaa 


aacgtccggg 


agaagatcaa 


ggcccacctg 


720 


gaagacctca 


ggctctcctt 


ggagctctcc 


cgggtgcgca 


ccgacctccc 


cctggaggtg 


780 


gacctcgccc 


aggggcggga 


gcccgaccgg 


Saggggctta 


gggccttcct 


ggagaggctg 


840 


gagttcggca 


gcctcctcca 


cgagttcggc 


ctcctggagg 


cccccgcccc 


cctggaggag 


900 


gccccctggc 


ccccgccgga 


aggggccttc 


gtgggcttcg 


tcctctcccg 


ccccgagccc 


960 


atgtgggcgg 


agcttaaagc 


cctggccgcc 


tgcaggggcg 


gccgcgtgca 


ccgggcagca 


1020 


gaccccttgg 


cggggctaaa 


ggacctcaag 


gaggtccggg 


gcctcctcgc 


caaggacctc 


1080 


gccgtcttgg 


cctcgaggga 


ggggctagac 


ctcgtgcccg 


gggacgaccc 


catgctcctc 


1140 


gcctacctcc 


tggacccttc 


gaacaccacc 


cccgaggggg 


tggcgcggcg 


ctacgggggg 


1200 


gagtggacgg 


aggacgccgc 


ccaccgggcc 


ctcctctcgg 


agaggctcca 


tcggaacctc 


1260 


cttaagcgcc 


tcgaggggga 


ggagaagctc 


ctttggctct 


accacgaggt 


ggaaaagccc 


1320 


ctctcccggg 


tcctggccca 


tatggaggcc 


accggggtac 


ggcgggacgt 


ggcctacctt 


1380 


caggcccttt 


ccctggagct 


tgcggaggag 


atccgccgcc 


tcgaggagga 


ggtcttccgc 


1440 


ttggcgggcc 


accccttcaa 


cctcaactcc 


cgggaccagc 


tggaaagggt 


gctctttgac 


1500 


gagcttaggc 


ttcccaagtt 


gaagaagacg 


aagaagacag 


gcaagcgctc 


caccagcgcc 


1560 


gcggtgctgg 


aggccctacg 


ggaggcccac 


cccatcgtgg 


agaagatcct 


c cage ac egg 


1620 


gagctcacca 


agctcaagaa 


cacctacgtg 


gaccccctcc 


caagcctcgt 


ccacccgagg 


1680 


acgggccgcc 


tccacacccg 


cttcaaccag 


acggccacgg 


ccacggggag 


gcttagtagc 


1740 


tccgacccca 


acctgcagaa 


catccccgtc 


cgcaccccct 


tgggccagag 


gatccgccgg 


1800 
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gccttcgtgg 


ccgaggcggg 


ttgggcgttg gtggccctgg 


actatagcca gatagagctc 


1860 


cgcgtcctcg 


cccacctctc 


cggggacgaa aacctgatca 


gggtcttcca ggaggggaag 


1920 


gacatccaca 


cccagaccgc 


aagctggatg ttcggcgtcc 


ccccggaggc cgtggacccc 


1980 


ctgatgcgcc 


gggcggccaa 


gacggtgaac ttcggcgtcc 


tctacggcat gtccgcccat 


2040 


aggctctccc 


aggagcttgc 


catcccctac gaggaggcgg 


tggcctttat agagcgctac 


2X00 


ttccaaagct 


tccccaaggt 


gcgggcctgg atagaaaaga 


ccctggagga ggggaggaag 


2160 


cggggctacg 


tggaaaccct 


cttcggaaga aggcgctacg 


tgcccgacct caacgcccgg 


2220 


gtgaagagcg 


tcagggaggc 


cgcggagcgc atggccttca 


acatgcccgt ccagggcacc 


2280 


gccgccgacc 


tcatgaagct 


cgccatggtg aagctcttcc 


cccgcctccg ggagatgggg 


2340 


gcccgcatgc 


tcctccaggt 


cgccaacgag ctcctcctgg 


aggcccccca agcgcgggcc 


2400 


gaggaggtgg 


cggctttggc 


caaggaggcc atggagaagg 


cctatcccct cgccgtgccc 


2460 


ctggaggtgg 


aggtggggat 


gggggaggac tggctttccg 


ccaagggt 


2508 



<210> 432 

<211> 836 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 432 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
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115 



120 



125 



Ala hys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu lie Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys lie Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 

260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 

370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 



His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
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420 



425 



430 



Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 



Val Leu Phe Asp Glu Leu Arg Leu Pro Lys Leu Lys Lys Thr Lys Lys 
500 50S 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 

595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 



765 



wo 01/90337 



PCT/USOl/17086 



725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly 
835 



<210> 


433 


<211> 


36 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


433 


cttaggcttc ccaagttgaa 


<210> 


434 


<211> 


36 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


434 


tgtcttcttc gtcttcttca 


<210> 


435 


<211> 


2508 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


435 



36 



36 



atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 60 
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ggccaccacc 


tggcctaccg 


caccttcttc 


gccctgaagg 


gcctcaccac 


gagccggggc 


120 




aqgcqqtcta 

33^33*****"** 


cggcttcgcc 


aagagcctcc 


tcaaggccct 


gaaggaggac 

3 "^3i> 


180 


qqqtacaaqq 


ccgtcttcgt 


ggtctttgac 


gccaaggccc 


cctcctt^ccg 


ccacgaggcc 


240 


tacqaqqcct 


acaaqqcaaa 
**^'*"3 3^yy 3 


gagggcCCCg 


acccccgagg 


acttcccccg 


qcaqctcqcc 


300 


cbcatcaa99 


aqctqqtqqa 


CCtCCtqqqq 

WW www W J J J J 


tttacccgcc 


tcgaggtccc 


CQQCtacaaq 


360 


acQQacaaco 


ttctCQCcac 

V w w^iJ ^^^^ 


cctaaccaaa 

WW Wg»j WWMM^ 


a aaacaaaaa 


dyy dyyyy wd 


caaaahacac 

y ay y v*y w 


420 


atcctcaccg 


ccaaccacaa 


cctctaccaa 


ctcabctcca 


aCCOCQtCQC 


cgtcctccac 


480 


CCCQSiCfOQCC 


acctcatcac 


^*»^3yay *»33 


c 1 1 1 gggaga 


acrtacacrcck 


w wy y w w *j y siy 


540 




acttccaecrc 


cctcatacrcra 
WW w wy ^y y y y 


aacccctcccr 

yWWWWW (oWWViJ 


acaacctccG 

vi W vLw& W W WW WW 


^^yyy 3 ^v-ddy 


600 




ay day ciw v«y Vi» 


w w ^ wddy w u> w 


c t r* a a cicjaa t 

w L> waay M y 


yyyy^'^y^^ ^ 


acraaaacpt" c 

y y aaaaw w w w 


660 


r* t* f a a era a o 


i-y y dv*v*y y y l. 


a a a ar* a na a 
d ddy w w dy d d 


dd^y wwwyyy 


aGraacfat'caa 
dy ddy d^wdd 


yy Vi^wwawwuy 


720 


01 w w u w d 




yydyctc tuc 


^333 ''3^3^* 


^^y dww w^ww 


Cwt-yydgy ug 


/ 0 V 


a r« r« f*r* n o r« r» 


a-yyygcggga 


gcccgdccgg 


gaggggc tea 


gggccttcct 


ggagaggccg 






y CCUwCtCCd 


cgag u c c ggc 


^M^a/*m 

cccccggagg 


cccccgcwcc 


ccuggaggag 






w \« wy w eg y d 


dygggc c c c c 


gugggcu ucg 


CCwUCuCCCy 


CwCwgagcwC 






dy ^ L. Udddy 


i^wuy yu. wy w w 


i-gcdggggcg 


gcugcgtgcd 


ccgggcdgcd 




era n p rr* t* t" on 

Cl V> i« OVjVj 


*^y y y y ^ waaa 


Cfoa or* t" ?5 rt 
yya^w u>wdciy 


ydygL.^uyyy 


y ^ u w ^ L> wy w 


wddy y ai' w uw 


1080 






y yyy way aw 


Gfccafccrccca 

wwwvjVwjgwwwy 


cracfar*aaPCo 
y yy awy aw w w 


wa wy w w w w ^ w 


1140 


acetacctcc 


taaacccthc 


cra.acaccacc 

y aawaw wa WW 


ut^^ydyyyyy 


L.gyv*yi»ay^g 


t.duy yyyy y 


1200 


y i»yy «^yy 


a aoa for* r*rir« 
dyydv»y w*.»y w 


w^d^^yy y ^w 


v« l»w t vyy 


a fra nrrr* t" pp a 
dy dyy w wwd 


i» oy y ddLr L* u 


X A U V 


^ U U Ady w>y 


ccgaggggga 


ggagddgctw 


wu buggctct 


a a ^^a ^i^f !■ 

accacgaggu 


J n/ ■ a a a a nn n 

ggadddgccc 


1 9 fl 




uww uggcccd 


udcggaggcc 


dwcggggcdw 


ggcgggacgu 


ggccudCCuu 


1 m n 
X3 0 u 




cccuggagc u 


tgcggaggag 


auccgccgcc 


t cgaggagga 


ggcccuccgc 




ttggcgggcc 


accccttcaa 


cctcaactcc 


cgggaccagc 


tggaaagggt 


gctctttgac 


1500 


gagcttaggc 


ttcccgcctt 


gaagaagacg 


aagaagacag 


gcaagcgctc 


caccagcgcc 


1560 


gcggtgctgg 


aggccctacg 


ggsLggcccac 


cccatcgtgg 


agaagatcct 


ccagcaccgg 


1620 


gagctcacca 


agctcaagaa 


cacctacgtg 


gaccccctcc 


caagcctcgt 


ccacccgagg 


1680 


acgggccgcc 


tccacacccg 


cttcaaccag 


acggccacgg 


ccacggggag 


gcttagtagc 


1740 
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tccgacccca 


acctgcagaa 


catccccgtc 


cgcaccccct 


tgggccagag gatccgccgg 


1800 


gccttcgtgg 


ccgaggcggg 


ttgggcgttg 


gtggccctgg 


actatagcca gatagagctc 


1860 


cgcgtcctcg 


cccacctctc 


cggggacgaa 


aacctgatca 


gggtcttcca ggaggggaag 


1920 


gacatcgcca 


cccagaccgc 


aagctggatg 


ttcggcgtcc 


ccccggaggc cgtggacccc 


1980 


ctgatgcgcc 


999cggccaa 


gacggtgaac 


ttcggcgtcc 


tctacggcat gtccgcccat 


2040 


aggctctccc 


aggagcttgc 


catcccctac 


gaggaggcgg 


tggcctttat agagcgctac 


2100 


ttccaaagct 


tccccaaggt 


gcgggcctgg 


atagaaaaga 


ccctggagga ggggaggaag 


2160 


cggggctacg 


tggaaaccct 


cttcggaaga 


aggcgctacg 


tgcccgacct caacgcccgg 


2220 


gtgaagagcg 


tcagggaggc 


cgcggagcgc 


atggccttca 


acatgcccgt ccagggcacc 


2280 


gccgccgacc 


tcatgaagct 


cgccatggtg 


aagctcttcc 


cccgcctccg ggagatgggg 


2340 


gcccgcatgc 


tcctccaggt 


cgccaacgag 


ctcctcctgg 


aggcccccca agcgcgggcc 


2400 


gaggaggtgg 


cggctttggc 


caaggaggcc 


atggagaagg 


cctatcccct cgccgtgccc 


2460 


ctggaggtgg 


aggtggggat 


gggggaggac 


tggctttccg 


ccaagggt 


2508 



<210> 436 

<211> 836 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 436 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 
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Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu lie Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys lie Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 



Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 
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His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Net 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 

His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp He Ala Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 



Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 
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Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly 
835 



<210> 


437 


<211> 


33 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


437 



gaggggaagg acatcgccac ccagaccgca age 33 



<210> 438 

<211> 33 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 438 

gcttgcggtc tgggtggcga tgtccttccc etc 33 



<210> 439 

<211> 2508 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
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<400> 439 



atgaattccg 


aggcgatgct 


tccgctcttt 


gaacccaaag 


gccgggtcct 


cctggtggac 


60 


ggccaccacc 


tggcctaccg 


caccttcttc 


gccctgaagg 


gcctcaccac 


gagccggggc 


120 


gaaccggtgc 


aggcggtcta 


cggcttcgcc 


aagagcctcc 


tcaaggccct 


gaaggaggac 


180 


gggtacaagg 


ccgtcttcgt 


ggtctttgac 


gccaaggccc 


cctccttccg 


ccacgaggcc 


240 


tacgaggcct 


acaaggcggg 


gagggccccg 


acccccgagg 


acttcccccg 


gcagctcgcc 


300 


ctcatcaagg 


agctggtgga 


cctcctgggg 


tttacccgcc 


tcgaggtccc 


cggctacgag 


360 


gcggacgacg 


ttctcgccac 


cctggccaag 


aaggcggaaa 


aggaggggta 


cgaggtgcgc 


420 


atcctcaccg 


ccgaccgcga 


cctctaccaa 


ctcgtctccg 


accgcgtcgc 


cgtcctccac 


480 


cccgagggcc 


acctcatcac 


cccggagtgg 


ctttgggaga 


agtacggcct 


caggccggag 


540 


cagtgggtgg 


acttccgcgc 


cctcgtgggg 


gacccctccg 


acaacctccc 


cggggtcaag 


600 


ggcatcgggg 


agaagaccgc 


cctcaagctc 


ctcaaggagt 


ggggaagcct 


ggaaaacctc 


660 


ctcaagaacc 


tggaccgggt 


aaagccagaa 


aacgtccggg 


agaagatcaa 


ggcccacctg 


720 


gaagacctca 


ggctctcctt 


ggagctctcc 


cgggtgcgca 


ccgacctccc 


cctggaggtg 


780 


gacctcgccc 


aggggcggga 


gcccgaccgg 


9a99g9Ctta 


gggccttcct 


ggagaggctg 


840 


gagttcggca 


gcctcctcca 


cgagttcggc 


ctcctggagg 


cccccgcccc 


cctggaggag 


900 


gccccctggc 


ccccgccgga 


aggggccttc 


gtgggcttcg 


tcctctcccg 


ccccgagccc 


960 


atgtgggcgg 


agcttaaagc 


cctggccgcc 


tgcaggggcg 


gccgcgtgca 


ccgggcagca 


1020 


gaccccttgg 


cggggctaaa 


ggacctcaag 


gaggtccggg 


gcctcctcgc 


caaggacctc 


1080 


gccgtcttgg 


cctcgaggga 


ggggctagac 


ctcgtgcccg 


gggacgaccc 


catgctcctc 


1140 


gcctacctcc 


tggacccttc 


gaacaccacc 


cccgaggggg 


tggcgcggcg 


ctacgggggg 


1200 


gagtggacgg 


aggacgccgc 


ccaccgggcc 


ctcctctcgg 


agaggctcca 


tcggaacctc 


1260 


cttaagcgcc 


tcgaggggga 


ggagaagctc 


ctttggctct 


accacgaggt 


ggaaaagccc 


1320 


ctctcccggg 


tcctggccca 


tatggaggcc 


accggggtac 


ggcgggacgt 


ggcctacctt 


1380 


caggcccttt 


ccctggagct 


tgcggaggag 


atccgccgcc 


tcgaggagga 


ggtcttccgc 


1440 


ttggcgggcc 


accccttcaa 


cctcaactcc 


cgggaccagc 


tggaaagggt 


gctctttgac 


1500 


gagcttaggc 


ttcccgcctt 


gaagaagccg 


aagaagacag 


gcaagcgctc 


caccagcgcc 


1560 


gcggtgctgg 


aggccctacg 


ggaggcccac 


cccatcgtgg 


agaagatcct 


ccagcaccgg 


1620 


gagctcacca 


agctcaagaa 


cacctacgtg 


gaccccctcc 


caagcctcgt 


ccacccgagg 


1680 
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acgggccgcc 


tccacacccg 


cttcaaccag 


acggccacgg 


ccacggggag gcttagtagc 


1740 


tccgacccca 


acctgcagaa 


catccccgtc 


cgcaccccct 


tgggccagag gatccgccgg 


1800 


gccttcgtgg 


ccgaggcggg 


ttgggcgttg 


gtggccctgg 


actatagcca gatagagctc 


1860 


cgcgtcctcg 


cccacctctc 


cggggacgaa 


aacctgatca 


gggtcttcca ggaggggaag 


1920 


gacatccaca 


cccagaccgc 


aagctggatg 


ttcggcgtcc 


ccccggaggc cgtggacccc 


1980 


ctgatgcgcc 


gggcggccaa 


gacggtgaac 


ttcggcgtcc 


tctacggcat gtccgcccat 


2040 


aggctctccc 


aggagcttgc 


catcccctac 


gaggaggcgg 


tggcctttat agagcgctac 


2100 


ttccaaagct 


tccccaaggt 


gcgggcctgg 


atagaaaaga 


ccctggagga ggggaggaag 


2160 


cggggctacg 


tggaaaccct 


cttcggaaga 


aggcgctacg 


tgcccgacct caacgcccgg 


2220 


gtgaagagcg 


tcagggaggc 


cgcggagcgc 


atggccttca 


acatgcccgt ccagggcacc 


2280 


gccgccgacc 


tcatgaagct 


cgccatggtg 


aagctcttcc 


cccgcctccg ggagatgggg 


2340 


gcccgcatgc 


tcctccaggt 


cgccaacgag 


ctcctcctgg 


aggcccccca agcgcgggcc 


2400 


gaggaggtgg 


cggctttggc 


caaggaggcc 


atggagaagg 


cctatcccct cgccgtgccc 


2460 


ctggaggtgg 


aggtggggat 


gggggaggac 


tggctttccg 


ccaagggt 


2508 



<210> 440 

<211> 836 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 440 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 
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Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu lie Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 



Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 
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Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Qlu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Pro Lys Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro lie Val Glu Lys lie Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg lie Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin lie Glu Leu Arg Val Leu Ala 

610 615 620 

His Leu Ser Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp lie His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala lie 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe lie Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 
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Pro Lys Val Arg Ala Trp lie Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly 
835 



<210> 


441 


<211> 


33 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


441 



cccgccttga agaagccgaa gaagacaggc aag 33 



<210> 


442 


<211> 


33 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


442 



cttgcctgtc ttcttcggct tcttcaaggc ggg 33 



<210> 443 

<211> 2499 

<212> DNA 

<213> Artificial 

<220> 
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<223> Synthetic 
<400> 443 

atgaattcgg ggatgctgcc cctctttgag cccaagggcc gggtcctcct ggtggacggc 60 

caccacctgg cctaccgcac cttccacgcc ctgaagggcc tcaccaccag ccggggggag 120 

ccggtgcagg cggtctacgg cttcgccaag agcctcctca aggccctcaa ggaggacggg 180 

gacgcggtga tcgtggtctt tgacgccaag gccccctcct tccgccacga ggcctacggg 240 

gggtacaagg cgggccgggc ccccacgccg gaggactttc cccggcaact cgccctcatc 300 

aaggagctgg tggacctcct ggggttcacg cgcctcgagg tcccgggcta cgaggcggac 360 

gacgtcctgg ccagcctggc caagaaggcg gaaaaggagg gctacgaggt ccgcatcctc 420 

accgccgaca aagaccttta ccagctcctt tccgaccgca tccacgtcct ccaccccgag 480 

gggtacctca tcaccccggc ctggctttgg gaaaagtacg gcctgaggcc cgaccagtgg 540 

gccgactacc gggccctgac cggggacgag tccgacaacc ttcccggggt caagggcatc 600 

ggggagaaga cggcgaggaa gcttctggag gagtggggga gcctggaagc cctcctcaag 660 

aacctggacc ggctgaagcc cgccatccgg gagaagatcc tggcccacat ggacgatctg 720 

aagctctcct gggacctggc caaggtgcgc accgacctgc ccctggaggt ggacttcgcc 780 

aaaaggcggg agcccgaccg ggagaggctt agggcctttc tggagaggct tgagtttggc 840 

agcctcctcc acgagttcgg ccttctggaa agccccaagg ccctggagga ggccccctgg 900 

cccccgccgg aaggggcctt cgtgggcttt gtgctttccc gcaaggagcc catgtgggcc 960 

gatcttctgg ccctggccgc cgccaggggc ggccgcgttt accgggcgga ggatcccttg 1020 

gaggccttgc gggggcttgg ggaggtgagg gggcttttgg ccaaggacct ggcggtgctg 1080 

gccctgaggg aagggattgc cctggcaccg ggcgacgacc ccatgctcct cgcctacctc 1140 

ctggatcctt ccaacaccgc ccccgaaggg gtagcccggc gctacggggg ggagtggacc 1200 

gaggaggcgg gggaaagggc gctgctttcc gaaaggcttt acgccgccct cctgaagcgg 1260 

cttaaggggg aggagaggct tctttggctt tacgaggagg tggaaaagcc cctttcgcgg 1320 

gtcctggccc acatggaggc cacgggggta cggttggatg tggcctactt aaaggccctt 13 80 

tccctggagg tggaggcgga gataaggcgc ttcgaggagg aggtccaccg cctggccggg 1440 

catcctttca acctgaactc ccgggaccag ctggaaaggg tcatctttga cgagcttggg 1500 

cttcccgcca tcaagaagac gaggaagacg ggcaagcgct ccaccagcgc cgccgttttg 1560 

gaggccttgc gggaggctca tcccatcgtg gaccgcatcc ttcagtaccg ggagctttcc 1620 
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aagctcaagg 


gaacctacat 


cgatcccttg 


cctgccctgg tccaccccaa gacgaaccgc 


1680 


ctccacaccc 


gtttcaacca 


gacggccacc 


gccacgggga ggcttagcag ctcggatcct 


1740 


aatctgcaaa 


atatccccgt 


gcgcacccct 


ttgggccagc ggatccgccg ggccttcgtg 


1800 


gccgaggagg 


ggtggaggct 


ggtggttttg 


gactacagcc agattgagct cagggtcctg 


1860 


gcgcaccttt 


ccggggacga 


gaacctaatc 


cgggtcttcc aggagggcca ggacatccac 


1920 


acccagacgg 


ccagctggat 


qttcggcgtg 


cccccagagg ccgtggattc cctgatgcgc 


1980 


cgggcggcca 


agaccatcaa 


cttcggcgtc 


ctctacggca bgtccgccca ccggctt^bcg 


2040 


ggagagctgg 


ccatccccta 


ccfaqqaQQCQ 


gtggcctfcca tcgagcggta tttccagagc 


2100 


taccccaagg 


tgcgggcctg 


gattgagaaa 


accctOQcqcr aaQQacQciQa acQOcrcrctiat 


2160 


gtggaaaccc 


tctttggccg 


ccggcgctac 


qtcrccccract tQorcttccco acrtcraaQaac 


2220 


atccgggagg 


cagcggagcg 


catggccttc 


aacatgccgg tccaggggac cgccgcggat 


2280 


ttgatgaaac 


tggccatggt 


gaagctcttt 


cccaggcttc aggagctggg ggccaggatg 


2340 


cttttgcagg 


tgcacaacga 


actggtcctc 


gaggctccca aggagcaagc ggaggaagtc 


2400 


gcccaggagg 


ccaagcggac 


catggaggag 


gtgtggcccc tgaaggtgcc cttggaggtg 


2460 


gaagtgggca 


tcggggagga 


ctggctttcc 


gccaaggcc 


2499 



<210> 444 

<211> 833 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 444 

Met Asn Ser Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu 
15 10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
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85 



90 



95 



Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg lie His Val Leu His Pro Glu 
145 150 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 

Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala He Arg Glu Lys He Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser TVrg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val Tyr Arg Ala 
325 330 335 

Glu Asp Pro Leu Glu Ala Leu Arg Gly Leu Gly Glu Val Arg Gly Leu 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Leu Arg Glu Gly He Ala Leu 
355 360 365 

Ala Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu Asp Pro Ser 
370 375 380 



Asn Thr Ala Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Trp Thr 
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385 



390 



395 



400 



Glu Glu Ala Gly Glu Arg Ala Leu Leu Ser Glu Arg Leu Tyr Ala Ala 
405 4X0 415 

Leu Leu Lys Arg Leu Lys Gly Glu Glu Arg Leu Leu Trp Leu Tyr Glu 
420 425 430 

Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Thr 
435 440 445 

Gly Val Arg Leu Asp Val Ala Tyr Leu Lys Ala Leu Ser Leu Glu Val 
450 455 460 

Glu Ala Glu lie Arg Arg Phe Glu Glu Glu Val His Arg Leu Ala Gly 
465 470 475 480 

His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val lie Phe 
485 490 495 

Asp Glu Leu Gly Leu Pro Ala lie Lys Lys Thr Arg Lys Thr Gly Lys 
500 505 510 

Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu Ala His Pro 
515 520 525 

lie Val Asp Arg He Leu Gin Tyr Arg Glu Leu Ser Lys Leu Lys Gly 
530 535 540 

Thr Tyr He Asp Pro Leu Pro Ala Leu Val His Pro Lys Thr Asn Arg 
545 550 555 560 

Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr- Gly Arg Leu Ser 
565 570 575 

Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly 

580 585 590 

Gin Arg He Arg Arg Ala Phe Val Ala Glu Glu Gly Trp Arg Leu Val 
595 600 605 

Val Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser 
610 615 620 

Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Gin Asp He His 
625 630 635 640 

Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu Ala Val Asp 
645 650 655 

Ser Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly Val Leu Tyr 
660 665 670 

Gly Met Ser Ala His Arg Leu Ser Gly Glu Leu Ala He Pro Tyr Glu 
675 680 685 



Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Tyr Pro Lys Val 
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690 695 700 

Arg Ala Trp He Qlu Lys Thr Leu Ala Glu Gly Arg Glu Arg Gly Tyr 
705 710 715 720 

Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Ala Ser 
725 730 735 

Arg Val Lys Ser He Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met 
740 745 750 

Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala Met Val Lys 
755 760 765 

Leu Phe Pro Arg Leu Gin Glu Leu Gly Ala Arg Met Leu Leu Gin Val 
770 775 780 

His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Gin Ala Glu Glu Val 
785 790 795 800 

Ala Gin Glu Ala Lys Arg Thr Met Glu Glu Val Trp Pro Leu Lys Val 
805 810 815 

Pro Leu Qlu Val Glu Val Gly He Gly Glu Asp Trp Leu Ser Ala Lys 
820 825 830 

Ala 



<210> 445 

<211> 2496 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 



<400> 445 
atgaattcgg 


ggatgctgcc 


cctctttgag 


cccaagggcc 


gggtcctcct 


ggtggacggc 


60 


caccacctgg 


cctaccgcac 


cttccacgcc 


ctgaagggcc 


tcaccaccag 


ccggggggag 


120 


ccggtgcagg 


cggtctacgg 


cttcgccaag 


agcctcctca 


aggccctcaa 


ggaggacggg 


180 


gacgcggtga 


tcgtggtctt 


tgacgccaag 


gccccctcct 


tccgccacga 


ggcctacggg 


240 


gggtacaagg 


cgggccgggc 


ccccacgccg 


gaggactttc 


cccggcaact 


cgccctcatc 


300 


aaggagctgg 


tggacctcct 


ggggttcacg 


cgcctcgagg 


tcccgggcta 


cgaggcggac 


360 


gacgtcctgg 


ccagcctggc 


caagaaggcg 


gaaaaggagg 


gctacgaggt 


ccgcatcctc 


420 


accgccgaca 


aagaccttta 


ccagctcctt 


tccgaccgca 


tccacgtcct 


ccaccccgag 


480 


gggtacctca 


tcaccccggc 


ctggctttgg 


gaaaagtacg 


gcctgaggcc 


cgaccagtgg 


540 


gccgactacc 


gggccctgac 


cggggacgag 


tccgacaacc 


ttcccggggt 


caagggcatc 


600 
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ggggagaaga 


cggcgaggaa 


gcttctggag 


gagtggggga 


gcctggaagc 


cctcctcaag 


660 


aacctggacc 


ggctgaagcc 


cgccatccgg 


gagaagatcc 


tggcccacat 


ggacgatctg 


720 


aagctctcct 


gggacctggc 


caaggtgcgc 


accgacctgc 


ccctggaggt 


ggacttcgcc 


780 


aaaaggcggg 


agcccgaccg 


ggagaggctt 


agggcctttc 


tggagaggct 


tgagtttggc 


840 


agcctcctcc 


acgagttcgg 


ccttctggaa 


agccccaagg 


ccctggagga 


ggccccctgg 


900 


cccccgccgg 


aaggggcctt 


cgtgggcttt 


gtgctttccc 


gcaaggagcc 


catgtgggcc 


960 


gatcttctgg 


ccctggccgc 


cgccaggggc 


ggccgcgtcc 


accgggcaac 


aagcccggtt 


1020 


gaggccctgg 


ccgacctcaa 


ggaggcccgg 


gggttcctgg 


ccaaggacct 


ggccgttttg 


1080 


gccctgcggg 


agggggtggc 


cctggacccc 


acggacgacc 


ccctcctggt 


ggcctacctc 


1140 


ctggacccgg 


ccaacaccca 


ccccgagggg 


gtggcccggc 


gctacggggg 


cgagttcacg 


1200 


gaggacgcag 


cggagagggc 


cctcctctcc 


gagaggctct 


tccagaacct 


ctttaaacgg 


1260 


ctttccgaga 


agctcctctg 


gctctaccag 


gaggtggagc 


ggcccctctc 


ccgggtcttg 


1320 


gcccacatgg 


aggcccgggg 


ggtgaggctg 


gacgtccccc 


ttctggaggc 


cctctccttt 


1380 


gagctggaga 


aggagatgga 


gcgcctggag 


ggggaggtct 


tccgtttggc 


cggccacccc 


1440 


ttcaacctca 


actcccgcga 


ccagctggaa 


agggtcctct 


ttgacgagct 


gggcctcacc 


1500 


ccggtgaagc 


ggacgaagaa 


gacgggcaag 


cgctccaccg 


cccagggggc 


cctggaggcc 


1560 


ctccgggggg 


cccaccccat 


cgtggagctc 


atcctccagt 


accgggagct 


ttccaagctc 


1620 


aaaagcacct 


acctggaccc 


cctgccccgg 


ctcgtccacc 


cgcggacggg 


ccggctccac 


1680 


acccgcttca 


accagacggc 


cacggccacg 


ggaaggcttt 


ccagctccga 


ccccaacctg 


1740 


cagaacatcc 


ccgtgcgcac 


ccccttgggg 


cagcgcatcc 


gcaaggcctt 


cgtggccgag 


1800 


gaggggtggc 


tccttttggc 


ggcggactac 


tcccagattg 


agctccgggt 


cctggcccac 


1860 


ctctcggggg 


acgagaacct 


gaagcgggtc 


ttccgggagg 


ggaaggacat 


ccataccgag 


1920 


accgccgcct 


ggatgttcgg 


cttagacccc 


gctctggtgg 


atccaaagat 


gcgccgggcg 


1980 


gccaagacgg 


tcaacttcgg 


cgtcctctac 


gggatgtccg 


cccacaggct 


ctcccaggag 


2040 


ctcggcatag 


actacaagga 


ggcggaggcc 


tttattgagc 


gctacttcca 


gagcttcccc 


2100 


aaggtgcggg 


cctggataga 


aaggaccctg 


gaggagggcc 


ggacgcgggg 


ctacgtggag 


2160 


accctgttcg 


gcaggaggcg 


ctatgtgccc 


gacctggcct 


cccgggtccg 


ctcggtgcgg 


2220 


gaggcggcgg 


agcggatggc 


cttcaacatg 


cccgtgcagg 


gcaccgccgc 


cgacctgatg 


2280 
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aagatcgcca tggtcaagct cttccccagg ctaaagcccc tgggggccca cctcctcctc 2340 

caagtggcca acgagctggt cctggaggtg cccgaggacc gggccgagga ggccaaggcc 2400 

ctggtcaagg aggtcatgga gaacgcctac cccctggacg tgcccctcga ggtggaggtg 2460 

ggcgtgggtc gggactggct ggaggcgaag caggat 2496 



<210> 


446 


<211> 


832 


<212> 


PRT 


<213> 


Artificial 


<220> 


• 


<223> 


Synthetic 


<400> 


446 


Met Asn Ser Gly Met 


X 


5 



10 15 

Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe His Ala Leu Lys 
20 25 30 

Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly Phe 
35 40 45 

Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Asp Ala Val lie 
50 55 60 

Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala Tyr Gly 
65 70 75 80 

Gly Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro Arg Gin 
85 90 95 

Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr Arg Leu 
100 105 110 

Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Ser Leu Ala Lys 
115 120 125 

Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala Asp Lys 
130 135 140 

Asp Leu Tyr Gin Leu Leu Ser Asp Arg He His Val Leu His Pro Glu 
145 ISO 155 160 

Gly Tyr Leu He Thr Pro Ala Trp Leu Trp Glu Lys Tyr Gly Leu Arg 
165 170 175 

Pro Asp Gin Trp Ala Asp Tyr Arg Ala Leu Thr Gly Asp Glu Ser Asp 
180 185 190 

Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Arg Lys Leu 
195 200 205 
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Leu Glu Glu Trp Gly Ser Leu Glu Ala Leu Leu Lys Asn Leu Asp Arg 
210 215 220 

Leu Lys Pro Ala lie Arg Glu Lys lie Leu Ala His Met Asp Asp Leu 
225 230 235 240 

Lys Leu Ser Trp Asp Leu Ala Lys Val Arg Thr Asp Leu Pro Leu Glu 
245 250 255 

Val Asp Phe Ala Lys Arg Arg Glu Pro Asp Arg Glu Arg Leu Arg Ala 
260 265 270 

Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu Phe Gly Leu 
275 280 285 

Leu Glu Ser Pro Lys Ala Leu Glu Glu Ala Pro Trp Pro Pro Pro Glu 
290 295 300 

Gly Ala Phe Val Gly Phe Val Leu Ser Arg Lys Glu Pro Met Trp Ala 
305 310 315 320 

Asp Leu Leu Ala Leu Ala Ala Ala Arg Gly Gly Arg Val His Arg Ala 
325 330 335 

Thr Ser Pro Val Glu Ala Leu Ala Asp Leu Lys Glu Ala Arg Gly Phe 
340 345 350 

Leu Ala Lys Asp Leu Ala Val Leu Ala Leu Arg Glu Gly Val Ala Leu 
355 360 365 

Asp Pro Thr Asp Asp Pro Leu Leu Val Ala Tyr Leu Leu Asp Pro Ala 

370 375 380 

Asn Thr His Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly Glu Phe Thr 
385 390 395 400 

Glu Asp Ala Ala Glu Arg Ala Leu Leu Ser Glu Arg Leu Phe Gin Asn 
405 410 415 

Leu Phe Lys Arg Leu Ser Glu Lys Leu Leu Trp Leu Tyr Gin Glu Val 
420 425 430 

Glu Arg Pro Leu Ser Arg Val Leu Ala His Met Glu Ala Arg Gly Val 
435 440 445 

Arg Leu Asp Val Pro Leu Leu Glu Ala Leu Ser Phe Glu Leu Glu Lys 
450 455 460 

Glu Met Glu Arg Leu Glu Gly Glu Val Phe Arg Leu Ala Gly His Pro 
465 470 475 480 

Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu Phe Asp Glu 

485 490 495 



Leu Gly Leu Thr Pro Val Lys Arg Thr Lys Lys Thr Gly Lys Arg Ser 
500 505 510 
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Thr Ala Gin Gly Ala Leu Glu Ala Leu Arg Gly Ala His Pro He Val 
515 520 525 

Glu Leu He Leu Gin Tyr Arg Glu Leu Ser Lys Leu Lys Ser Thr Tyr 
530 535 540 

Leu Asp Pro Leu Pro Arg Leu Val His Pro Arg Thr Gly Arg Leu His 
545 550 555 560 

Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser Ser 
565 570 575 

Asp Pro T^n Leu Gin Asn He Pro Val Arg Thr Pro Leu Gly Gin Arg 
580 585 590 

He Arg Lys Ala Phe Val Ala Glu Glu Gly Trp Leu Leu Leu Ala Ala 
595 600 605 

Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu Ser Gly Asp 
610 615 620 

Glu Asn Leu Lys Arg Val Phe Arg Glu Gly Lys Asp He His Thr Glu 
625 630 635 640 

Thr Ala Ala Trp Met Phe Gly Leu Asp Pro Ala Leu Val Asp Pro Lys 
645 650 655 

Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu Tyr Gly Met 
660 665 670 

Ser Ala His Arg Leu Ser Gin Glu Leu Gly He Asp Tyr Lys Glu Ala 
675 680 685 

Glu Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys Val Arg Ala 
690 695 700 

Trp He Glu Arg Thr Leu Glu Glu Gly Arg Thr Arg Gly Tyr Val Glu 
705 710 715 720 

Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Ala Ser Arg Val 
725 730 735 

Arg Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn Met Pro Val 
740 745 750 

Gin Gly Thr Ala Ala Asp Leu Met Lys He Ala Met Val Lys Leu Phe 

755 760 765 

Pro Arg Leu Lys Pro Leu Gly Ala His Leu Leu Leu Gin Val Ala Asn 
770 775 780 

Glu Leu Val Leu Glu Val Pro Glu Asp Arg Ala Glu Glu Ala Lys Ala 
785 790 795 800 



Leu Val Lys Glu Val Met Glu Asn Ala Tyr Pro Leu Asp Val Pro Leu 
805 810 815 
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Glu Val Glu Val Gly Val Gly Arg Asp Trp Leu Glu Ala Lys Gin Asp 
820 825 830 

<210> 447 
<211> 2508 
. <212> DNA 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 447 



atgaattccg 


aggcgatgct 


tccgctcttt 


gaacccaaag 


gccgggtcct 


cctggtggac 


60 


ggccaccacc 


tggcctaccg 


caccttcttc 


gccctgaagg 


gcctcaccac 


gagccggggc 


120 


gaaccggtgc 


aggcggtcta 


cggcttcgcc 


aagagcctcc 


tcaaggccct 


gaaggaggac 


180 


gggtacaagg 


ccgtcttcgt 


ggtctttgac 


gccaaggccc 


cctccttccg 


ccacgaggcc 


240 


tacgaggcct 


acaaggcggg 


gagggccccg 


acccccgagg 


acttcccccg 


gcagctcgcc 


300 


ctcatcaagg 


agctggtgga 


cctcctgggg 


tttacccgcc 


tcgaggtccc 


cggctacgag 


360 


gcggacgacg 


ttctcgccac 


cctggccaag 


aaggcggaaa 


aggaggggta 


cgaggtgcgc 


420 


atcctcaccg 


ccgaccgcga 


cctctaccaa 


ctcgtctccg 


accgcgtcgc 


cgtcctccac 


480 


cccgagggcc 


acctcatcac 


cccggagtgg 


ctttgggaga 


agtacggcct 


caggccggag 


540 


cagtgggtgg 


acttccgcgc 


cctcgtgggg 


gacccctccg 


acaacctccc 


cggggtcaag 


600 


ggcatcgggg 


agaagaccgc 


cctcaagctc 


ctcaaggagt 


ggggaagcct 


ggaaaacctc 


660 


ctcaagaacc 


tggaccgggt 


aaagccagaa 


aacgtccggg 


agaagatcaa 


ggcccacctg 


720 


gaagacctca 


ggctctcctt 


ggagctctcc 


cgggtgcgca 


ccgacctccc 


cctggaggtg 


780 


gacctcgccc 


aggggcggga 


gcccgaccgg 


gaggggctta 


gggccttcct 


ggagaggctg 


840 


gagttcggca 


gcctcctcca 


cgagttcggc 


ctcctggagg 


cccccgcccc 


cctggaggag 


900 


gccccctggc 


ccccgccgga 


aggggccttc 


gtgggcttcg 


tcctctcccg 


ccccgagccc 


960 


atgtgggcgg 


agcttaaagc 


cctggccgcc 


tgcaggggcg 


gccgcgttta 


ccgggcggag 


1020 


gatcccttgg 


aggccttgcg 


ggggcttggg 


gaggtgaggg 


ggcttttggc 


caaggacctg 


1080 


gcggtgctgg 


ccctgaggga 


agggattgcc 


ctggcaccgg 


gcgacgaccc 


catgctcctc 


1140 


gcctacctcc 


tggatccttc 


caacaccgcc 


cccgaagggg 


tagcccggcg 


ctacgggggg 


1200 


gagtggaccg 


aggaggcggg 


ggaaagggcg 


ctgctttccg 


aaaggcttta 


cgccgccctc 


1260 


ctgaagcggc 


ttaaggggga 


ggagaggctt 


ctttggcttt 


acgaggaggt 


ggaaaagccc 


1320 
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ctttcgcggg tcctggccca catggaggcc acgggggtac ggttggatgt ggcctactta 1380 

aaggcccttt ccctggaggt ggaggcggag ataaggcgct tcgaggagga ggtccaccgc 1440 

ctggccgggc atcctttcaa cctgaactcc cgggaccagc tggaaagggt catctttgac 1500 

gagcttgggc ttcccgccat caagaagacg aggaagacgg gcaagcgctc caccagcgcc 1560 

gccgttttgg aggccttgcg ggaggctcat cccatcgtgg accgcatcct tcagtaccgg 1620 

gagctttcca agctcaaggg aacctacatc gatcccttgc ctgccctggt ccaccccaag 1680 

acgaaccgcc tccacacccg tttcaaccag acggccaccg ccacggggag gcttagcagc 1740 

tcggatccta atctgcaaaa tatccccgtg cgcacccctt tgggccagcg gatccgccgg 1800 

gccttcgtgg ccgaggaggg gtggaggctg gtggttttgg actacagcca gattgagctc 1860 

agggtcctgg cgcacctttc cggggacgag aacctaatcc gggtcttcca ggagggccag 1920 

gacatccaca cccagacggc cagctggatg ttcggcgtgc ccccagaggc cgtggattcc 1980 

ctgatgcgcc gggcggccaa gaccatcaac ttcggcgtcc tctacggcat gtccgcccac 2040 

cggctttcgg gagagctggc catcccctac gaggaggcgg tggccttcat cgagcggtat 2100 

ttccagagct accccaaggt gcgggcctgg attgagaaaa ccctggcgga aggacgggaa 2160 

cggggctatg tggaaaccct ctttggccgc cggcgctacg tgcccgactt ggcttcccgg 2220 

gtgaagagca tccgggaggc agcggagcgc atggccttca acatgccggt ccaggggacc 2280 

gccgcggatt tgatgaaact ggccatggtg aagctctttc ccaggcttca ggagctgggg 2340 

gccaggatgc ttttgcaggt gcacaacgaa ctggtcctcg aggctcccaa ggagcaagcg 2400 

gaggaagtcg cccaggaggc caagcggacc atggaggagg tgtggcccct gaaggtgccc 2460 

ttggaggtgg aagtgggcat cggggaggac tggctttccg ccaaggcc 2508 

<210> 448 

<211> 836 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 448 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
^ S 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 
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Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg lie Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu lie Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly lie Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys lie Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 
305 310 315 320 



Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 
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Tyr Arg Ala Glu Asp Pro Leu Glu Ala Leu Arg Gly Leu Gly Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Leu Arg Glu Gly 
355 360 365 

lie Ala Leu Ala Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Ala Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Glu Ala Gly Glu Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

Tyr Ala Ala Leu Leu Lys Arg Leu Lys Gly Glu Glu Arg Leu Leu Trp 
420 425 430 

Leu Tyr Glu Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Leu Asp Val Ala Tyr Leu Lys Ala Leu Ser 
450 455 460 

Leu Glu Val Glu Ala Glu He Arg Arg Phe Glu Glu Glu Val His Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val He Phe Asp Glu Leu Gly Leu Pro Ala He Lys Lys Thr Arg Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro He Val Asp Arg He Leu Gin Tyr Arg Glu Leu Ser Lys 
530 535 540 

Leu Lys Gly Thr Tyr He Asp Pro Leu Pro Ala Leu Val His Pro Lys 
545 550 555 560 

Thr Asn Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn lie Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Glu Gly Trp 
595 600 605 

Arg Leu Val Val Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 



His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Gin 
625 630 635 640 
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Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 

645 650 655 

Ala Val Asp Ser Leu Met Arg Arg Ala Ala Lys Thr He Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gly Glu Leu Ala lie 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Tyr 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Ala Glu Gly Arg Glu 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Ala Ser Arg Val Lys Ser He Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 

755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Gin Glu Leu Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val His Asn Glu Leu Val Leu Glu Ala Pro Lys Glu Gin Ala 
785 790 795 800 

Glu Glu Val Ala Gin Glu Ala Lys Arg Thr Met Glu Glu Val Trp Pro 
805 810 815 

Leu Lys Val Pro Leu Glu Val Glu Val Gly He Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Ala 
835 

<210> 449 

<211> 2505 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 449 

atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 60 

ggccaccacc tggcctaccg caccttcttc gccctgaagg gcctcaccac gagccggggc 120 

gaaccggtgc aggcggtcta cggcttcgcc aagagcctcc tcaaggccct gaaggaggac 180 

gggtacaagg ccgtcttcgt ggtctttgac gccaaggccc cctccttccg ccacgaggcc 240 

tacgaggcct acaaggcggg gagggccccg acccccgagg acttcccccg gcagctcgcc 300 
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ctcatcaagg agctggtgga cctcctgggg tttacccgcc tcgaggtccc cggctacgag 360 

gcggacgacg ttctcgccac cctggccaag aaggcggaaa aggaggggta cgaggtgcgc 420 

atcctcaccg ccgaccgcga cctctaccaa ctcgtctccg accgcgtcgc cgtcctccac 4 80 

cccgagggcc acctcatcac cccggagtgg ctttgggaga agtacggcct caggccggag 540 

cagtgggtgg acttccgcgc cctcgtgggg gacccctccg acaacctccc cggggtcaag 600 

ggcatcgggg agaagaccgc cctcaagctc ctcaaggagt ggggaagcct ggaaaacctc 660 

ctcaagaacc tggaccgggt aaagccagaa aacgtccggg agaagatcaa ggcccacctg 720 

gaagacctca ggctctcctt ggagctctcc cgggtgcgca ccgacctccc cctggaggtg 780 

gacctcgccc aggggcggga gcccgaccgg gaggggctta gggccttcct ggagaggctg 840 

gagttcggca gcctcctcca cgagttcggc ctcctggagg cccccgcccc cctggaggag 900 

gccccctggc ccccgccgga aggggccttc gtgggcttcg tcctctcccg ccccgagccc 960 

atgtgggcgg agcttaaagc cctggccgcc tgcaggggcg gccgcgtcca ccgggcaaca 1020 

agcccggttg aggccctggc cgacctcaag gaggcccggg ggttcctggc caaggacctg 1080 

gccgttttgg ccctgcggga gggggtggcc ctggacccca cggacgaccc cctcctggtg 1140 

gcctacctcc tggacccggc caacacccac cccgaggggg tggcccggcg ctacgggggc 1200 

gagttcacgg aggacgcagc ggagagggcc ctcctctccg agaggctctt ccagaacctc 1260 

tttaaacggc tttccgagaa gctcctctgg ctctaccagg aggtggagcg gcccctctcc X320 

cgggtcttgg cccacatgga ggcccggggg gtgaggctgg acgtccccct tctggaggcc 1380 

ctctcctttg agctggagaa ggagatggag cgcctggagg gggaggtctt ccgtttggcc 1440 

ggccacccct tcaacctcaa ctcccgcgac cagctggaaa gggtcctctt tgacgagctg 1500 

ggcctcaccc cggtgaagcg gacgaagaag acgggcaagc gctccaccgc ccagggggcc 1560 

ctggaggccc tccggggggc ccaccccatc gtggagctca tcctccagta ccgggagctt 1620 

tccaagctca aaagcaccta cctggacccc ctgccccggc tcgtccaccc gcggacgggc 1680 

cggctccaca cccgcttcaa ccagacggcc acggccacgg gaaggctttc cagctccgac 1740 

cccaacctgc agaacatccc cgtgcgcacc cccttggggc agcgcatccg caaggccttc 1800 

gtggccgagg aggggtggct ccttttggcg gcggactact cccagattga gctccgggtc 1860 

ctggcccacc tctcggggga cgagaacctg aagcgggtct tccgggaggg gaaggacatc 1920 

cataccgaga ccgccgcctg gatgttcggc ttagaccccg ctctggtgga tccaaagatg 1980 
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cgccgggcgg ccaagacggt caacttcggc gtcctctacg ggatgtccgc ccacaggctc 2040 

tcccaggagc tcggcataga ctacaaggag gcggaggcct ttattgagcg ctacttccag 2100 

agcttcccca aggtgcgggc ctggatagaa aggaccctgg aggagggccg gacgcggggc 2160 

tacgtggaga ccctgttcgg caggaggcgc tatgtgcccg acctggcctc ccgggtccgc 2220 

tcggtgcggg aggcggcgga gcggatggcc ttcaacatgc ccgtgcaggg caccgccgcc 2280 

gacctgatga agatcgccat ggtcaagctc ttccccaggc taaagcccct gggggcccac 2340 

ctcctcctcc aagtggccaa cgagctggtc ctggaggtgc ccgaggaccg ggccgaggag 2400 

gccaaggccc tggtcaagga ggtcatggag aacgcctacc ccctggacgt gcccctcgag 2460 

gtggaggtgg gcgtgggtcg ggactggctg gaggcgaagc aggat 2505 



<210> 450 

<211> 835 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 450 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 

35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val Phe Val Val Phe Asp Ala Lys Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu lie Lys Glu 'Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
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145 



150 



155 



160 



Pro Glu Gly His Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 1B5 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Glu Asn Val Arg Glu Lys He Lys Ala His Leu 
225 230 235 240 

Glu Asp Leu Arg Leu Ser Leu Glu Leu Ser Arg Val Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Leu Ala Gin Gly Arg Glu Pro Asp Arg Glu Qly 
260 265 270 

Leu Arg Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ala Pro Ala Pro Leu Glu Glu Ala Pro Trp Pro 
290 295 300 

Pro Pro Glu Gly Ala Phe Val Gly Phe Val Leu Ser Arg Pro Glu Pro 

310 315 320 

Met Trp Ala Glu Leu Lys Ala Leu Ala Ala Cys Arg Gly Gly Arg Val 
325 330 335 

His Arg Ala Thr Ser Pro Val Glu Ala Leu Ala Asp Leu Lys Glu Ala 
340 345 350 

Arg Gly Phe Leu Ala Lys Asp Leu Ala Val Leu Ala Leu Arg Glu Gly 
355 360 365 

Val Ala Leu Asp Pro Thr Asp Asp Pro Leu Leu Val Ala Tyr Leu Leu 
370 375 380 



Asp Pro Ala Asn Thr His Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Phe Thr Glu Asp Ala Ala Glu Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

Phe Gin Asn Leu Phe Lys Arg Leu Ser Glu Lys Leu Leu Trp Leu Tyr 
420 425 430 

Gin Glu Val Glu Arg Pro Leu Ser Arg Val Leu Ala His Met Glu Ala 
435 440 445 

Arg Gly val Arg Leu Asp Val Pro Leu Leu Glu Ala Leu Ser Phe Glu 
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450 455 460 

Leu Glu Lys Glu Met Glu Arg Leu Glu Gly Glu Val Phe Krg Leu Ala 
465 470 475 480 

Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg Val Leu 
485 490 495 

Phe Asp Glu Leu Gly Leu Thr Pro Val Lys Arg Thr Lys Lys Thr Gly 
500 505 510 

Lys Arg Ser Thr Ala Gin Gly Ala Leu Glu Ala Leu Arg Gly Ala His 
515 520 525 

Pro He Val Glu Leu He Leu Gin Tyr Arg Glu Leu Ser Lys Leu Lys 
530 535 540 

Ser Thr Tyr Leu Asp Pro Leu Pro Arg Leu Val His Pro Arg Thr Gly 
545 550 555 560 

Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly Arg Leu 
565 570 575 

Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr Pro Leu 
580 585 590 

Gly Gin Arg He Arg Lys Ala Phe Val JVla Glu Glu Gly Trp Leu Leu 
595 600 605 

Leu Ala Ala Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala His Leu 
610 615 620 

Ser Gly Asp Glu Asn Leu Lys Arg Val Phe Arg Glu Gly Lys Asp He 
625 630 635 640 

His Thr Glu Thr Ala Ala Trp Met Phe Gly Leu Asp Pro Ala Leu Val 
645 650 655 

Asp Pro Lys Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly Val Leu 
660 665 670 

Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Gly He Asp Tyr 
675 680 685 

Lys Glu Ala Glu Ala Phe He Glu Arg Tyr Phe Gin Ser Phe Pro Lys 
690 695 700 

Val Arg Ala Trp He Glu Arg Thr Leu Glu Glu Gly Arg Thr Arg Gly 
705 710 715 720 

Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp Leu Ala 
725 730 735 

Ser Arg Val Arg Ser Val Arg Glu Ala Ala Glu Arg Met Ala Phe Asn 
740 745 750 

Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys He Ala Met Val 
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755 760 



765 



Lys Leu Phe Pro Arg Leu Lys Pro Leu Gly Ala His Leu Leu Leu Gin 
770 775 780 

Val Ala Asn Glu Leu Val Leu Glu Val Pro Glu Asp Arg Ala Glu Glu 
'^^^ 790 795 eOO 

Ala Lys Ala Leu Val Lys Glu Val Met Glu Asn Ala Tyr Pro Leu Asp 
805 810 815 

Val Pro Leu Glu Val Glu Val Gly Val Gly Arg Asp Trp Leu Glu Ala 
820 825 830 

Lys Gin Asp 
835 

<210> 451 

<211> 24 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 451 

cttctctcat ccgccaaaac agcc 24 

<210> 452 

<211> 2526 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 452 

atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 60 

ggccaccacc tggcctaccg caccttcttc gccctgaagg gcctcaccac gagccggggc 120 

gaaccggtgc aggcggtcta cggcttcgcc aagagcctcc tcaaggccct gaaggaggac 180 

gggtacaagg ccgtcatcgt ggtctttgac gccgaggccc cctccttccg ccacgaggcc 240 

tacgaggcct acaaggcggg gagggccccg acccccgagg acttcccccg gcagctcgcc 300 

ctcatcaagg agctggtgga cctcctgggg tttacccgcc tcgaggtccc cggctacgag 360 

gcggacgacg ttctcgccac cctggccaag aaggcggaaa aggaggggta cgaggtgcgc 420 

atcctcaccg ccgaccgcga cctctaccaa ctcgtctccg accgcgtcgc cgtcctccac 480 

cccgagggcc acctcatcac cccggagtgg ctttgggaga agtacggcct caggccggag 540 

cagtgggtgg acttccgcgc cctcgtgggg gacccctccg acaacctccc cggggtcaag 600 
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ggcatcgggg agaagaccgc cctcaagctc ctcaaggagt ggggaagcct ggaaaacctc 660 

ctcaagaacc tggaccgggt aaagccagaa aacgtccggg agaagatcaa ggcccacctg 720 

gaagacctca ggctctcctt ggagctctcc cgggtgcgca ccgacctccc cctggaggtg 780 

gacctcgccc aggggcggga gcccgaccgg gaggggctta gggccttcct ggagaggctg 840 

gagttcggca gcctcctcca cgagttcggc ctcctggagg cccccgcccc cctggaggag 900 

gccccctggc ccccgccgga aggggccttc gtgggcttcg tcctctcccg ccccgagccc 960 

atgtgggcgg agcttaaagc cctggccgcc tgcaggggcg gccgcgtgca ccgggcagca 1020 

gaccccttgg cggggctaaa ggacctcaag gaggtccggg gcctcctcgc caaggacctc 1080 

gccgtcttgg cctcgaggga ggggctagac ctcgtgcccg gggacgaccc catgctcctc 1140 

gcctacctcc tggacccttc gaacaccacc cccgaggggg tggcgcggcg ctacgggggg 1200 

gagtggacgg aggacgccgc ccaccgggcc ctcctctcgg agaggctcca tcggaacctc 1260 

cttaagcgcc tcgaggggga ggagaagctc ctttggctct accacgaggt ggaaaagccc 1320 

ctctcccggg tcctggccca tatggaggcc accggggtac ggcgggacgt ggcctacctt 1380 

caggcccttt ccctggagct tgcggaggag atccgccgcc tcgaggagga ggtcttccgc 1440 

ttggcgggcc accccttcaa cctcaactcc cgggaccagc tggaaagggt gctctttgac 1500 

gagcttaggc ttcccgcctt gaagaagacg aagaagacag gcaagcgctc caccagcgcc 1560 

gcggtgctgg aggccctacg ggaggcccac cccatcgtgg agaagatcct ccagcaccgg 1620 

gagctcacca agctcaagaa cacctacgtg gaccccctcc caagcctcgt ccacccgagg 1680 

acgggccgcc tccacacccg cttcaaccag acggccacgg ccacggggag gcttagtagc 1740 

tccgacccca acctgcagaa catccccgtc cgcaccccct tgggccagag gatccgccgg 1800 

gccttcgtgg ccgaggcggg ttgggcgttg gtggccctgg actatagcca gatagagctc 1860 

cgcgtcctcg cccacctctc cggggacgaa aacctgatca gggtcttcca ggaggggaag 1920 

gacatccaca cccagaccgc aagctggatg ttcggcgtcc ccccggaggc cgtggacccc 1980 

ctgatgcgcc gggcggccaa gacggtgaac ttcggcgtcc tctacggcat gtccgcccat 2040 

aggctctccc aggagcttgc catcccctac gaggaggcgg tggcctttat agagcgctac 2100 

ttccaaagct tccccaaggt gcgggcctgg atagaaaaga ccctggagga ggggaggaag 2160 

cggggctacg tggaaaccct cttcggaaga aggcgctacg tgcccgacct caacgcccgg 2220 

gtgaagagcg tcagggaggc cgcggagcgc atggccttca acatgcccgt ccagggcacc 2280 
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gccgccgacc tcatgaagct cgccatggtg aagctcttcc cccgcctccg ggagatgggg 2340 

gcccgcatgc tcctccaggt cgccaacgag ctcctcctgg aggcccccca agcgcgggcc 2400 

gaggaggtgg cggctttggc caaggaggcc atggagaagg cctatcccct cgccgtgccc 2460 

ctggaggtgg aggtggggat gggggaggac tggctttccg ccaagggtca ccaccaccac 2520 

caccac 2526 

<210> 453 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 453 

ctcctccacg agttcggc 18 

<210> 454 

<211> 2514 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 454 

atgaattccc tgcccctctt tgagcccaag ggccgggtgc ttctggtgga cggccaccac 60 

ctggcctacc gtaccttttt tgccctgaag ggcct caeca ccagccgcgg ggagccggtc 120 

caggcggtgt acgggtttgc caagagcctt ttgaaggcgc taagggaaga cggggatgtg 180 

gtgatcgtgg tgtttgacgc caaggccccc tccttccgcc accagaccta cgaggcctac 240 

aaggcggggc gggctcccac ccccgaggac tttccccggc agcttgccct tatcaaggag 300 

atggtggacc ttttgggctt tacccgcctc gaggtgccgg gctttgaagc ggatgacgtc 360 

ctggctaccc tggccaagaa ggcggaaaag gaaggctacg aagtgcgcat cctcaccgcg 420 

gaccgggacc tttaccagct tctttcggag cgaatctcca tccttcaccc ggagggttac 480 

ctgatcaccc cggagtggct ttgggagaag tatgggctta agccttccca gtgggtggac 540 

taccgggcct tggccgggga cccttccgac aacatccccg gcgtgaaggg catcggggag 600 

aagacggcgg ccaagctgat ccgggagtgg ggaagcctgg aaaaccttct taagcacctg 660 

gaacaggtga aacctgcctc cgtgcgggag aagatcctta gccacatgga ggacctcaag 720 

ctatccctgg agctatcccg ggtgcacacg gacttgctcc ttcaggtgga cttcgcccgg 780 
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cgccgggagc 


cggaccggga 


ggggcttaag 


gcctttttgg 


agaggctgga 


gttcggaagc 


840 


ctcctccacg 


agttcggcct 


gttggaaagc 


ccggtggcgg 


cggaggaagc 


tccctggccg 


900 


ccccccgagg 


gagccttcgt 


ggggtacgtt 


ctttcccgcc 


ccgagcccat 


gtgggcggag 


960 


cttaacgcct 


tggccgccgc 


ctggggcggc 


cgcgtgcacc 


gggcagcaga 


ccccttggcg 


1020 


gggctaaagg 


acctcaagga 


ggtccggggc 


ctcctcgcca 


aggacctcgc 


cgtcttggcc 


1080 


tcgagggagg 


ggctagacct 


cgtgcccggg 


gacgacccca 


tgctcctcgc 


ctacctcctg 


1140 


gacccttcga 


acaccacccc 


cgagggggtg 


gcgcggcgct 


acggggggga 


gtggacggag 


1200 


gacgccgccc 


accgggccct 


cctctcggag 


aggctccatc 


ggaacctcct 


taagcgcctc 


1260 


gagggggagg 


agaagctcct 


ttggctctac 


cacgaggtgg 


aaaagcccct 


ctcccgggtc 


1320 


ctggcccata 


tggaggccac 


cggggtacgg 


cgggacgtgg 


cctaccttca 


ggccctttcc 


1380 


ctggagcttg 


cggaggagat 


ccgccgcctc 


gaggaggagg 


tcttccgctt 


ggcgggccac 


1440 


cccttcaacc 


tcaactcccg 


ggaccagctg 


gaaagggtgc 


tctttgacga 


gcttaggctt 


1500 


cccgccttga 


agaagacgaa 


gaagacaggc 


aagcgctcca 


ccagcgccgc 


ggtgctggag 


1560 


gccctacggg 


aggcccaccc 


catcgtggag 


aagatcctcc 


agcaccggga 


gctcaccaag 


1620 


ctcaagaaca 


cctacgtgga 


ccccctccca 


agcctcgtcc 


acccgaggac 


gggccgcctc 


1680 


cacacccgct 


tcaaccagac 


ggccacggcc 


acggggaggc 


ttagtagctc 


cgaccccaac 


1740 


ctgcagaaca 


tccccgtccg 


cacccccttg 


ggccagagga 


tccgccgggc 


cttcgtggcc 


1800 


gaggcgggtt 


gggcgttggt 


ggccctggac 


tatagccaga 


tagagctccg 


cgtcctcgcc 


1860 


cacctctccg 


gggacgaaaa 


cctgatcagg 


gtcttccagg 


aggggaagga 


catccacacc 


1920 


cagaccgcaa 


gctggatgtt 


cggcgtcccc 


ccggaggccg 


tggaccccct 


gatgcgccgg 


1960 


gcggccaaga 


cggtgaactt 


cggcgtcctc 


tacggcatgt 


ccgcccatag 


gctctcccag 


2040 


gagcttgcca 


tcccctacga 


ggaggcggtg 


gcctttatag 


agcgctactt 


ccaaagcttc 


2100 


cccaaggtgc 


gggcctggat 


agaaaagacc 


ctggaggagg 


ggaggaagcg 


gggctacgtg 


2160 


gaaaccctct 


tcggaagaag 


gcgctacgtg 


cccgacctca 


acgcccgggt 


gaagagcgtc 


2220 


agggaggccg 


cggagcgcat 


ggccttcaac 


atgcccgtcc 


agggcaccgc 


cgccgacctc 


2280 


atgaagctcg 


ccatggtgaa 


gctcttcccc 


cgcctccggg 


agatgggggc 


ccgcatgctc 


2340 


ctccaggtcg 


ccaacgagct 


cctcctggag 


gccccccaag 


cgcgggccga 


ggaggtggcg 


2400 


gctttggcca 


aggaggccat 


ggagaaggcc 


tatcccctcg 


ccgtgcccct 


ggaggtggag 


2460 
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gtggggatgg gggaggactg gctttccgcc aagggtcacc accaccacca ccac 2514 

<210> 455 
<211> 2526 
<212> DNA 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 455 

atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 60 

ggccaccacc tggcctaccg caccttcttc gccctgaagg gcctcaccac gagccggggc 120 

gaaccggtgc aggcggtcta cggcttcgcc aagagcctcc tcaaggccct gaaggaggac 180 

gggtacaagg ccgtcatcgt ggtctttgac gccgaggccc cctccttccg ccacgaggcc 240 

tacgaggcct acaaggcggg gagggccccg acccccgagg acttcccccg gcagctcgcc 300 

ctcatcaagg agctggtgga cctcctgggg tttacccgcc tcgaggtccc cggctacgag 360 

gcggacgacg ttctcgccac cctggccaag aaggcggaaa aggaggggta cgaggtgcgc 420 

atcctcaccg ccgaccgcga cctctaccaa ctcgtctccg accgcgtcgc cgtcctccac 480 

cccgagggcc acctcatcac cccggagtgg ctttgggaga agtatgggct taagccttcc 540 

cagtgggtgg actaccgggc cttggccggg gacccttccg acaacatccc cggcgtgaag 600 

ggcatcgggg agaagacggc ggccaagctg atccgggagt ggggaagcct ggaaaacctt 660 

cttaagcacc tggaacaggt gaaacctgcc tccgtgcggg agaagatcct tagccacatg 720 

gaggacctca agctatccct ggagctatcc cgggtgcaca cggacttgct ccttcaggtg 780 

gacttcgccc ggcgccggga gccggaccgg gaggggctta aggccttttt ggagaggctg 840 

gagttcggaa gcctcctcca cgagttcggc ctgttggaaa gcccggtggc ggcggaggaa 900 

gctccctggc cgccccccga gggagccttc gtggggtacg ttctttcccg ccccgagccc 960 

atgtgggcgg agcttaacgc cttggccgcc gcctggggcg gccgcgtgca ccgggcagca 1020 

gaccccttgg cggggctaaa ggacctcaag gaggtccggg gcctcctcgc caaggacctc 1080 

gccgtcttgg cctcgaggga ggggctagac ctcgtgcccg gggacgaccc catgctcctc 1140 

gcctacctcc tggacccttc gaacaccacc cccgaggggg tggcgcggcg ctacgggggg 1200 

gagtggacgg aggacgccgc ccaccgggcc ctcctctcgg agaggctcca tcggaacctc 1260 

cttaagcgcc tcgaggggga ggagaagctc ctttggctct accacgaggt ggaaaagccc 1320 

ctctcccggg tcctggccca tatggaggcc accggggtac ggcgggacgt ggcctacctt 1380 
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caggcccttt ccctggagct tgcggaggag atccgccgcc tcgaggagga ggtcttccgc 1440 

ttggcgggcc accccttcaa cctcaactcc cgggaccagc tggaaagggt gctctttgac 1500 

gagcttaggc ttcccgcctt gaagaagacg aagaagacag gcaagcgctc caccagcgcc 1560 

gcggtgctgg aggccctacg ggaggcccac cccatcgtgg agaagatcct ccagcaccgg 1620 

gagctcacca agctcaagaa cacctacgtg gaccccctcc caagcctcgt ccacccgagg 1680 

acgggccgcc tccacacccg cttcaaccag acggccacgg ccacggggag gcttagtagc 1740 

tccgacccca acctgcagaa catccccgtc cgcaccccct tgggccagag gatccgccgg 1800 

gccttcgtgg ccgaggcggg ttgggcgttg gtggccctgg actatagcca gatagagctc 1860 

cgcgtcctcg cccacctctc cggggacgaa aacctgatca gggtcttcca ggaggggaag 1920 

gacatccaca cccagaccgc aagctggatg ttcggcgtcc ccccggaggc cgtggacccc 1980 

ctgatgcgcc gggcggccaa gacggtgaac ttcggcgtcc tctacggcat gtccgcccat 2040 

aggctctccc aggagcttgc catcccctac gaggaggcgg tggcctttat agagcgctac 2100 

ttccaaagct tccccaaggt gcgggcctgg atagaaaaga ccctggagga ggggaggaag 2160 

cggggctacg tggaaaccct cttcggaaga aggcgctacg tgcccgacct caacgcccgg 222 0 

gtgaagagcg tcagggaggc cgcggagcgc atggccttca acatgcccgt ccagggcacc 2280 

gccgccgacc tcatgaagct cgccatggtg aagctcttcc cccgcctccg ggagatgggg 2340 

gcccgcatgc tcctccaggt cgccaacgag ctcctcctgg aggcccccca agcgcgggcc 2400 

gaggaggtgg cggctttggc caaggaggcc atggagaagg cctatcccct cgccgtgccc 2460 

ctggaggtgg aggtggggat gggggaggac tggctttccg ccaagggtca ccaccaccac 2520 

caccac 2526 



<210> 456 

<211> 842 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 456 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 
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Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val He Val Val Phe Asp Ala Glu Ala Pro Ser Phe Arg His Glu Ala 
^5 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Lys Pro Ser Gin Trp Val Asp Tyr Arg Ala Leu Ala Gly Asp Pro 
100 185 190 

Ser Asp Asn He Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Ala 
195 200 205 

Lys Leu He Arg Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys His Leu 

210 215 220 

Glu Gin Val Lys Pro Ala Ser Val Arg Glu Lys He Leu Ser His Met 
225 230 235 240 

Glu Asp Leu Lys Leu Ser Leu Glu Leu Ser Arg Val His Thr Asp Leu 
245 250 255 

Leu Leu Gin Val Asp Phe Ala Arg Arg Arg Glu Pro Asp Arg Glu Gly 
260 265 270 

Leu Lys Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ser Pro Val Ala Ala Glu Glu Ala Pro Trp Pro 
290 295 300 



Pro Pro Glu Gly Ala Phe Val Gly Tyr Val Leu Ser Arg Pro Glu Pro 

310 315 320 

Met Trp Ala Glu Leu Asn Ala Leu Ala Ala Ala Trp Gly Gly Arg Val 
325 330 335 
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His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 

370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 4X0 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val Phe Arg 
H65 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 

Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 

595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
610 615 620 



His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 
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Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 

710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 
755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
'^^5 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly His His His His His His 

840 





835 


<210> 


457 


<211> 


27 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


457 



ttccaggtgc ttgaggaggt tttccag 27 



<210> 458 

<211> 27 

<212> DNA 

<213> Artificial 
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<220> 

<223> Synthetic 
<400> 458 



ctcctcaagc 


acctggaaca 


ggtgaaa 




27 


<210> 459 
<211> 2526 
<212> DNA 
<213> Artificial 








<220> 

<223> Synthetic 








<400> 459 
atgaattccg 


aggcgatgct 


tccgctcttt 


gaacccaaag gccgggtcct cctggtggac 


60 


ggccaccacc 


tggcctaccg 


caccttcttc 


gccctgaagg gcctcaccac gagccggggc 


120 


gaaccggtgc 


aggcggtcta 


cggcttcgcc 


aagagcctcc tcaaggccct gaaggaggac 


180 


gggtacaagg 


ccgtcatcgt 


ggtctttgac 


gccgaggccc cctccttccg ccacgaggcc 


240 


tacgaggcct 


acaaggcggg 


Qagggccccq 


acccccgagg acttcccccg gcagctcgcc 


300 


ctcatcaagg 


agctggtgga 


cctcctgqqQ 


tttacccgcc tcgaggtccc cggctacgag 


360 


gcggacgacg 


ttctcgccac 


cctggccaag 


aaggcggaaa aggaggggta cgaggtgcgc 


420 


atcctcaccg 


ccgaccgcga 


cctctaccaa 


ctcgtctccg accgcgtcgc cgtcctccac 


480 


cccgagggcc 


acctcatcac 


cccggagtgg 


ctttgggaga agtacggcct caggccggag 


540 


cagtgggtgg 


acttccgcgc 


cctcgtagqg 

^ ^^^^ 


gacccctccg acaacctccc cggggtcaag 


600 


ggcatcgggg 


agaagaccgc 


cctcaagctc 


ctcaagqagt ggggaagcct ggaaaacctc 


660 


ctcaagcacc 


tggaacaggt 


gaaacctgcc 


tccgtgcggg agaagatcct tagccacatg 


720 


gaggacctca 


agctatccct 


ggagctatcc 


cqqqtqcaca cqqacttqct ccttcaqqtq 


780 


gacttcgccc 


ggcgccggga 


qcccrqaccqq 

3^^3Z3**^^33 


qaqqqqctta aqqccttttt qqaqaggctq 

3 "3 j33 ** **** **33^ ** ^ ** 33 3 33 3 


840 


gagttcggaa 


gcctcctcca 


cgagttcggc 


ctgttggaaa gcccggtggc ggcggaggaa 


900 


gctccctggc 


cgccccccga 


gggagccttc 


gtggggtacg ttctttcccg ccccgagccc 


960 


atgtgggcgg 


agcttaacgc 


cttggccgcc 


gcctggggcg gccgcgtgca ccgggcagca 


1020 


gaccccttgg 


cggggctaaa 


ggacctcaag 


gaggtccggg gcctcctcgc caaggacctc 


1080 


gccgtcttgg 


cctcgaggga 


ggggctagac 


ctcgtgcccg gggacgaccc catgctcctc 


1140 


gcctacctcc 


tggacccttc 


gaacaccacc 


cccgaggggg tggcgcggcg ctacgggggg 


1200 


gagtggacgg 


aggacgccgc 


ccaccgggcc 


ctcctctcgg agaggctcca tcggaacctc 


1260 
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cttaagcgcc tcgaggggga ggagaagctc ctttggctct accacgaggt ggaaaagccc 1320 

ctctcccggg tcctggccca tatggaggcc accggggtac ggcgggacgt ggcctacctt 1380 

caggcccttt ccctggagct tgcggaggag atccgccgcc tcgaggagga ggtcttccgc 1440 

ttggcgggcc accccttcaa cctcaactcc cgggaccagc tggaaagggt gctctttgac 1500 

gagcttaggc ttcccgcctt gaagaagacg aagaagacag gcaagcgctc caccagcgcc 1560 

gcggtgctgg aggccctacg ggaggcccac cccatcgtgg agaagatcct ccagcaccgg 1620 

gagctcacca agctcaagaa cacctacgtg gaccccctcc caagcctcgt ccacccgagg 1680 

acgggccgcc tccacacccg cttcaaccag acggccacgg ccacggggag gcttagtagc 1740 

tccgacccca acctgcagaa catccccgtc cgcaccccct tgggccagag gatccgccgg 1800 

gccttcgtgg ccgaggcggg ttgggcgttg gtggccctgg actatagcca gatagagctc 1860 

cgcgtcctcg cccacctctc cggggacgaa aacctgatca gggtcttcca ggaggggaag 1920 

gacatccaca cccagaccgc aagctggatg ttcggcgtcc ccccggaggc cgtggacccc 1980 

ctgatgcgcc gggcggccaa gacggtgaac ttcggcgtcc tctacggcat gtccgcccat 2040 

aggctctccc aggagcttgc catcccctac gaggaggcgg tggcctttat agagcgctac 2100 

ttccaaagct tccccaaggt gcgggcctgg atagaaaaga ccctggagga ggggaggaag 2160 

cggggctacg tggaaaccct cttcggaaga aggcgctacg tgcccgacct caacgcccgg 2220 

gtgaagagcg tcagggaggc cgcggagcgc atggccttca acatgcccgt ccagggcacc 2280 

gccgccgacc tcatgaagct cgccatggtg aagctcttcc cccgcctccg ggagatgggg 2340 

gcccgcatgc tcctccaggt cgccaacgag ctcctcctgg aggcccccca agcgcgggcc 2400 

gaggaggtgg cggctttggc caaggaggcc atggagaagg cctatcccct cgccgtgccc 2460 

ctggaggtgg aggtggggat gggggaggac tggctttccg ccaagggtca ccaccaccac 2520 

2526 

<210> 460 
<211> 842 
<212> PRT 
<213> Artificial 

<220> 

<223> Synthetic 
<400> 460 

Met Asn Ser Qlu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
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10 



15 



Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val He Val Val Phe Asp Ala Glu Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 

145 150 155 160 

Pro Glu Gly His Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys His Leu 
210 215 220 

Glu Gin Val Lys Pro Ala Ser Val Arg Glu Lys He Leu Ser His Met 
225 230 235 240 

Glu Asp Leu Lys Leu Ser Leu Glu Leu Ser Arg Val His Thr Asp Leu 
245 250 255 

Leu Leu Gin Val Asp Phe Ala Arg Arg Arg Glu Pro Asp Arg Glu Gly 

260 265 270 

Leu Lys Ala Phe Leu Glu Arg Leu Glu Phe Gly Ser Leu Leu His Glu 
275 280 285 

Phe Gly Leu Leu Glu Ser Pro Val Ala Ala Glu Glu Ala Pro Trp Pro 
290 295 300 



Pro Pro Glu Gly Ala Phe Val Gly Tyr Val Leu Ser Arg Pro Glu Pro 
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305 



310 



315 



320 



Met Trp Ala Glu Leu Asn Ala Leu Ala Ala Ala Trp Gly Gly Arg Val 
325 330 335 

His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys Glu Val 
340 345 350 

Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg Glu Gly 
355 360 365 

Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr Leu Leu 
370 375 380 

Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr Gly Gly 
385 390 395 400 

Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu Arg Leu 
405 410 415 

His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu Leu Trp 
420 425 430 

Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala His Met 
435 440 445 

Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala Leu Ser 
450 455 460 

Leu Glu Leu Ala Glu Glu He Arg Arg Leu Glu Glu Glu Val Phe Arg 
465 470 475 480 

Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu Glu Arg 
485 490 495 

Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr Lys Lys 
500 505 510 

Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu Arg Glu 
515 520 525 

Ala His Pro He Val Glu Lys He Leu Gin His Arg Glu Leu Thr Lys 
530 535 540 



Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His Pro Arg 
545 550 555 560 

Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala Thr Gly 
565 570 575 

Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val Arg Thr 
580 585 590 

Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala Gly Trp 
595 600 605 

Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val Leu Ala 
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610 615 620 

His Leu Ser Gly Asp Glu Asn Leu lie Arg Val Phe Gin Glu Gly Lys 
625 630 635 640 

Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro Pro Glu 
645 650 655 

Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn Phe Gly 
660 665 670 

Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu Ala He 
675 680 685 

Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin Ser Phe 
690 695 700 

Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly Arg Lys 
705 710 715 720 

Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val Pro Asp 
725 730 735 

Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg Met Ala 
740 745 750 

Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys Leu Ala 

755 760 765 

Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg Met Leu 
770 775 780 

Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala Arg Ala 
785 790 795 800 

Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala Tyr Pro 
805 810 815 

Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp Trp Leu 
820 825 830 

Ser Ala Lys Gly His His His His His His 
835 840 



<210> 


461 


<211> 


27 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


461 



gtccaggttc ttgaggaggt tttccag 27 



<210> 462 
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<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 462 

ctcctcaaga acctggaccg ggtaaag 27 

<210> 463 

<211> 2532 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 463 



atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct 


cctggtggac 


60 


ggccaccacc tggcctaccg caccttcttc gccctgaagg gcctcaccac 


gagccggggc 


120 


gaaccggtgc aggcggtcta cggcttcgcc aagagcctcc tcaaggccct 


gaaggaggac 


180 


gggtacaagg ccgtcatcgt ggtctttgac gccgaggccc cctccttccg 


ccacgaggcc 


240 


tacgaggcct acaaggcggg gagggccccg acccccgagg acttcccccg 


gcagctcgcc 


300 


ctcatcaagg agctggtgga cctcctgggg tttacccgcc tcgaggtccc 


cggctacgag 


360 


gcggacgacg ttctcgccac cctggccaag aaggcggaaa aggaggggta 


cgaggtgcgc 


420 


atcctcaccg ccgaccgcga cctctaccaa ctcgtctccg accgcgtcgc 


cgtcctccac 


480 


cccgagggcc acctcatcac cccggagtgg ctttgggaga agtacggcct 


caggccggag 


540 


cagtgggtgg acttccgcgc cctcgtgggg gacccctccg acaacctccc 


cggggtcaag 


600 


ggcatcgggg agaagaccgc cctcaagctc ctcaaggagt ggggaagcct 


ggaaaacctc 


660 


ctcaagaacc tggaccgggt aaagccggac tcgctccggc gcaagataga 


ggcgcacctc 


720 


gaggacctcc acctctcctt agacctggcc cgcatccgca ccgacctccc 


cctggaggtg 


780 


gactttaagg ccctgcgccg caggaccccc gacctggagg gcctgagggc 


ctttttggag 


840 


gagctggagt tcggaagcct cctccacgag ttcggcctcc tgggagggga 


gaagccccgg 


900 


gaggaggccc cctggccccc gcccgaaggg gccttcgtgg gcttcctcct 


ttcccgcaag 


960 


gagcccatgt gggcggagct tctggccctg gcggcggcct cgggcggccg 


cgtgcaccgg 


1020 


gcagcagacc ccttggcggg gctaaaggac ctcaaggagg tccggggcct 


cctcgccaag 


1080 


gacctcgccg tcttggcctc gagggagggg ctagacctcg tgcccgggga 


cgaccccatg 


1140 
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ctcctcgcct 


acctcctgga 


cccttcgaac 


accacccccg agggggtggc gcggcgctac 


1200 


gggggggagt 


ggacggagga 


cgccgcccac 


cgggccctcc tctcggagag gctccatcgg 


1260 


aacctcctta 


agcgcctcga 


gggggaggag 


aagctccttt ggctctacca cgaggtggaa 


1320 


aagcccctct 


cccgggtcct 


ggcccatatg 


gaggccaccg gggtacggcg ggacgtggcc 


1380 


taccttcagg 


ccctttccct 


ggagcttgcg 


gaggagatcc gccgcctcga ggaggaggtc 


1440 


ttccgcttgg 


cgggccaccc 


cttcaacctc 


aactcccggg accagctgga aagggtgctc 


1500 


tttgacgagc 


ttaggcttcc 


cgccttgaag 


aagacgaaga agacaggcaa gcgctccacc 


1560 


agcgccgcgg 


tgctggaggc 


cctacgggag 


gcccacccca tcgtggagaa gatcctccag 


1620 


caccgggagc 


tcaccaagct 


caagaacacc 


tacgtggacc ccctcccaag cctcgtccac 


1680 


ccgaggacgg 


gccgcctcca 


cacccgcttc 


aaccagacgg ccacggccac ggggaggctt 


1740 


agtagctccg 


accccaacct 


gcagaacatc 


cccgtccgca cccccttggg ccagaggatc 


1800 


cgccgggcct 


tcgtggccga 


ggcgggttgg 


gcgttggtgg ccctggacta tagccagata 


1860 


gagctccgcg 


tcctcgccca 


cctctccggg 


gacgaaaacc tgatcagggt cttccaggag 


1920 


gggaaggaca 


tccacaccca 


gaccgcaagc 


tggatgttcg gcgtcccccc ggaggccgtg 


1980 


gaccccctga 


tgcgccgggc 


ggccaagacg 


gtgaacttcg gcgtcctcta cggcatgtcc 


2040 


gcccataggc 


tctcccagga 


gcttgccatc 


ccctacgagg aggcggtggc ctttatagag 


2100 


cgctacttcc 


aaagcttccc 


caaggtgcgg 


gcctggatag aaaagaccct ggaggagggg 


2160 


aggaagcggg 


gctacgtgga 


aaccctcttc 


ggaagaaggc gctacgtgcc cgacctcaac 


2220 


gcccgggtga 


agagcgtcag 


ggaggccgcg 


gagcgcatgg ccttcaacat gcccgtccag 


2280 


ggcaccgccg 


ccgacctcat 


gaagctcgcc 


atggtgaagc tcttcccccg cctccgggag 


2340 


atgggggccc 


gcatgctcct 


ccaggtcgcc 


aacgagctcc tcctggaggc cccccaagcg 


2400 


cgggccgagg 


aggtggcggc 


tttggccaag 


gaggccatgg agaaggccta tcccctcgcc 


2460 


gtgcccctgg 


aggtggaggt 


ggggatgggg 


gaggactggc tttccgccaa gggtcaccac 


2520 


caccaccacc 


ac 






2532 



<210> 464 

<211> 844 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 



810 



wo 01/90337 PCT/USOl/17086 



<400> 464 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Qly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val lie Val Val Phe Asp Ala Glu Ala Pro Ser Phe Arg His Glu Ala 
^5 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu lie Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Arg Pro Glu Gin Trp Val Asp Phe Arg Ala Leu Val Gly Asp Pro 
180 185 190 

Ser Asp Asn Leu Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Leu 
195 200 205 

Lys Leu Leu Lys Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Asp Ser Leu Arg Arg Lys He Glu Ala His Leu 
225 230 235 240 

Glu Asp Leu His Leu Ser Leu Asp Leu Ala Arg He Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Phe Lys Ala Leu Arg Arg Arg Thr Pro Asp Leu 
260 265 270 

Glu Gly Leu Arg Ala Phe Leu Glu Glu Leu Glu Phe Gly Ser Leu Leu 
275 280 285 
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His Glu Phe Gly Leu Leu Gly Gly Glu Lys Pro Arg Glu Glu Ala Pro 
290 295 300 

Trp Pro Pro Pro Glu Gly Ala Phe Val Gly Phe Leu Leu Ser Arg Lys 
305 310 315 320 

Glu Pro Met Trp Ala Glu Leu Leu Ala Leu Ala Ala Ala Ser Gly Gly 
325 330 335 

Arg Val His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys 
340 345 350 

Glu Val Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg 

355 360 365 

Glu Gly Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr 
370 375 380 

Leu Leu Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr 
385 390 395 400 

Gly Gly Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu 
405 410 415 

Arg Leu His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu 
420 425 430 

Leu Trp Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala 
435 440 445 

His Met Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala 
450 455 460 

Leu Ser Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val 
465 470 475 480 

Phe Arg Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu 
485 490 495 

Glu Arg Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr 
500 505 510 

Lys Lys Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu 
515 520 525 

Arg Glu Ala His Pro lie Val Glu Lys He Leu Gin His Arg Glu Leu 
530 535 540 

Thr Lys Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His 
545 550 555 560 

Pro Arg Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala 
565 570 575 



Thr Gly Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val 
580 585 590 
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Arg Thr Pro Leu Gly Gin Arg He Arg Arg Ala Phe val Ala Glu Ala 
595 600 605 

Gly Trp Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu -Arg Val 
610 615 620 

Leu Ala His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu 
625 630 635 640 

Gly Lys Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro 

645 650 655 

Pro Glu Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn 
660 665 670 

Phe Gly Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu 
675 680 685 

Ala He Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin 
690 695 700 

Ser Phe Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly 
705 710 715 720 

Arg Lys Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val 
725 730 735 

Pro Asp Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg 
740 745 . 750 

Met Ala Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys 
755 760 765 

Leu Ala Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg 
770 775 780 

Met Leu Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala 
785 790 795 800 

Arg Ala Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala 
805 810 815 

Tyr Pro Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp 
820 825 830 

Trp Leu Ser Ala Lys Gly His His His His His His 
835 840 

<210> 465 
<211> 27 
<212> DNA 

<213> Artificial 

"I 

<220> 

<223> Synthetic 
<400> 465 
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gacgtccttc ggggtgatga ggtggcc 27 



<210> 466 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<400> 466 



atcaccccga 


aggacgtcca 


QQaaaaQ 




27 


<210> 467 
<211> 2532 
<212> DNA 
<213> Artificial 








<220> 

<223> Synthetic 








<400> 467 
atgaattccg 


aggcgatgct 


tccgctcttt gaacccaaag 


QccQQcrtcct cctcrcrtQaac 


60 


ggccaccacc 


tggcctaccg 


caccttcttc gccctgaagg 


Qcctcaccac craoccQcrcrac 


120 


gaaccggtgc 


aggcggtcta 


cggcttcgcc aagagcctcc 


tcaaocrccct ciaaaaacraac 


180 


gggtacaagg 


ccgtcatcgt 


crcrtctttoac occcracroccc 


cctccttcccf ccAcaAaacc 


240 


tacgaggcct 


acaaggcggg 




ar't"fT'n<^r«r*n' rrr* a or* horror* 


300 


ctcatcaagg 


agctggtgga 








gcggacgacg 


ttctcgccac 


cctggccaag aaggcggaaa 


aggaggggta cgaggtgcgc 


420 


atcctcaccg 


ccgaccgcga 


cctctaccaa ctcgtctccg 


accgcgtcgc cgtcctccac 


480 


cccgagggcc 


acctcatcac 


cccgaaggac gtccaggaga 


agtacggggt gcccccggag 


540 


cgctgggtgg 


acttccgcgc 


cctcacgggg gaccgctcgg 


acaacatccc cggggtggcg 


600 


gggatagggg 


agaagaccgc 


ccttcgactc ctcgcagagt 


gggggagcgt ggaaaacctc 


660 


ctgaagaacc 


tggaccgggt 


aaagccggac tcgctccggc 


gcaagataga ggcgcacctc 


720 


gaggacctcc 


acctctcctt 


agacctggcc cgcatccgca 


ccgacctccc cctggaggtg 


780 


gactttaagg 


ccctgcgccg 


caggaccccc gacctggagg 


gcctgagggc ctttttggag 


840 


gagctggagt 


tcggaagcct 


cctccacgag ttcggcctcc 


tgggagggga gaagccccgg 


900 


gaggaggccc 


cctggccccc 


gcccgaaggg gccttcgtgg 


gcttcctcct ttcccgcaag 


960 


gagcccatgt 


gggcggagct 


tctggccctg gcggcggcct 


cgggcggccg cgtgcaccgg 


1020 
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gcagcagacc ccttggcggg gctaaaggac ctcaaggagg tccggggcct cctcgccaag 1080 

gacctcgccg tcttggcctc gagggagggg ctagacctcg tgcccgggga cgaccccatg 1140 

ctcctcgcct acctcctgga cccttcgaac accacccccg agggggtggc gcggcgctac 1200 

gggggggagt ggacggagga cgccgcccac cgggccctcc tctcggagag gctccatcgg 1260 

aacctcctta agcgcctcga gggggaggag aagctccttt ggctctacca cgaggtggaa 1320 

aagcccctct cccgggtcct ggcccatatg gaggccaccg gggtacggcg ggacgtggcc 1380 

taccttcagg ccctttccct ggagcttgcg gaggagatcc gccgcctcga ggaggaggtc 1440 

ttccgcttgg cgggccaccc cttcaacctc aactcccggg accagctgga aagggtgctc 1500 

tttgacgagc ttaggcttcc cgccttgaag aagacgaaga agacaggcaa gcgctccacc 1560 

agcgccgcgg tgctggaggc cctacgggag gcccacccca tcgtggagaa gatcctccag 1620 

caccgggagc tcaccaagct caagaacacc tacgtggacc ccctcccaag cctcgtccac 1680 

ccgaggacgg gccgcctcca cacccgcttc aaccagacgg ccacggccac ggggaggctt 1740 

agtagctccg accccaacct gcagaacatc cccgtccgca cccccttggg ccagaggatc 1800 

cgccgggcct tcgtggccga ggcgggttgg gcgttggtgg ccctggacta tagccagata 1860 

gagctccgcg tcctcgccca cctctccggg gacgaaaacc tgatcagggt cttccaggag 1920 

gggaaggaca tccacaccca gaccgcaagc tggatgttcg gcgtcccccc ggaggccgtg 1980 

gaccccctga tgcgccgggc ggccaagacg gtgaacttcg gcgtcctcta cggcatgtcc 2040 

gcccataggc tctcccagga gcttgccatc ccctacgagg aggcggtggc ctttatagag 2100 

cgctacttcc aaagcttccc caaggtgcgg gcctggatag aaaagaccct ggaggagggg 2160 

aggaagcggg gctacgtgga aaccctcttc ggaagaaggc gctacgtgcc cgacctcaac 2220 

gcccgggtga agagcgtcag ggaggccgcg gagcgcatgg ccttcaacat gcccgtccag 2280 

ggcaccgccg ccgacctcat gaagctcgcc atggtgaagc tcttcccccg cctccgggag 2340 

atgggggccc gcatgctcct ccaggtcgcc aacgagctcc tcctggaggc cccccaagcg 2400 

cgggccgagg aggtggcggc tttggccaag gaggccatgg agaaggccta tcccctcgcc 2460 

gtgcccctgg aggtggaggt ggggatgggg gaggactggc tttccgccaa gggtcaccac 252 0 

caccaccacc ac 2532 

<210> 468 
<211> 844 
<212> PRT 
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<213> Artificial 
<220> 

<:223> Synthetic 
<400> 468 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
15 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val He Val Val Phe Asp Ala Glu Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Tyr Glu Ala Asp Asp Val Leu Ala Thr Leu 

lis 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Val Ser Asp Arg Val Ala Val Leu His 
145 150 155 160 

Pro Glu Gly His Leu lie Thr Pro Lys Asp Val Gin Glu Lys Tyr Gly 
165 170 175 

Val Pro Pro Glu Arg Trp Val Asp Phe Arg Ala Leu Thr Gly Asp Arg 

180 185 190 

Ser Asp Asn He Pro Gly Val Ala Gly He Gly Glu Lys Thr Ala Leu 
195 200 205 

Arg Leu Leu Ala Glu Trp Gly Ser Val Glu Asn Leu Leu Lys Asn Leu 
210 215 220 

Asp Arg Val Lys Pro Asp Ser Leu Arg Arg Lys He Glu Ala His Leu 
225 230 235 240 

Glu Asp Leu His Leu Ser Leu Asp Leu Ala Arg He Arg Thr Asp Leu 
245 250 255 

Pro Leu Glu Val Asp Phe Lys Ala Leu Arg Arg Arg Thr Pro Asp Leu 
260 265 270 
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aiu Gly Leu Arg Ala Phe Leu Glu Glu Leu Glu Phe Gly Ser Leu Leu 
275 280 285 

His Glu Phe Gly Leu Leu Gly Gly Glu Lys Pro Arg Glu Glu Ala Pro 
290 295 300 

Trp Pro Pro Pro Glu Gly Ala Phe Val Gly Phe Leu Leu Ser Arg Lys 
305 310 315 320 

Glu Pro Met Trp Ala Glu Leu Leu Ala Leu Ala Ala Ala Ser Gly Gly 
325 330 335 

Arg Val His Arg Ala Ala Asp Pro Leu Ala Gly Leu Lys Asp Leu Lys 
340 345 350 

Glu Val Arg Gly Leu Leu Ala Lys Asp Leu Ala Val Leu Ala Ser Arg 

355 360 365 

Glu Gly Leu Asp Leu Val Pro Gly Asp Asp Pro Met Leu Leu Ala Tyr 
370 375 380 

Leu Leu Asp Pro Ser Asn Thr Thr Pro Glu Gly Val Ala Arg Arg Tyr 
385 390 395 400 

Gly Gly Glu Trp Thr Glu Asp Ala Ala His Arg Ala Leu Leu Ser Glu 
405 410 415 

Arg Leu His Arg Asn Leu Leu Lys Arg Leu Glu Gly Glu Glu Lys Leu 
420 425 430 

Leu Trp Leu Tyr His Glu Val Glu Lys Pro Leu Ser Arg Val Leu Ala 
435 440 445 

His Met Glu Ala Thr Gly Val Arg Arg Asp Val Ala Tyr Leu Gin Ala 
450 455 460 

Leu Ser Leu Glu Leu Ala Glu Glu lie Arg Arg Leu Glu Glu Glu Val 
465 470 475 480 

Phe Arg Leu Ala Gly His Pro Phe Asn Leu Asn Ser Arg Asp Gin Leu 
485 490 495 

Glu Arg Val Leu Phe Asp Glu Leu Arg Leu Pro Ala Leu Lys Lys Thr 
500 505 510 

Lys Lys Thr Gly Lys Arg Ser Thr Ser Ala Ala Val Leu Glu Ala Leu 
515 520 525 

Arg Glu Ala His Pro He Val Glu Lys He Leu Gin His Arg Glu Leu 
530 535 540 

Thr Lys Leu Lys Asn Thr Tyr Val Asp Pro Leu Pro Ser Leu Val His 
545 550 555 560 



Pro Arg Thr Gly Arg Leu His Thr Arg Phe Asn Gin Thr Ala Thr Ala 
565 570 575 
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Thr Gly Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin Asn He Pro Val 
580 585 590 

Arg Thr Pro Leu Gly Gin Arg He Arg Arg Ala Phe Val Ala Glu Ala 
595 600 605 

Gly Trp Ala Leu Val Ala Leu Asp Tyr Ser Gin He Glu Leu Arg Val 

610 615 620 

Leu Ala His Leu Ser Gly Asp Glu Asn Leu He Arg Val Phe Gin Glu 
625 630 635 640 

Gly Lys Asp He His Thr Gin Thr Ala Ser Trp Met Phe Gly Val Pro 
645 650 655 

Pro Glu Ala Val Asp Pro Leu Met Arg Arg Ala Ala Lys Thr Val Asn 
660 665 670 

Phe Gly Val Leu Tyr Gly Met Ser Ala His Arg Leu Ser Gin Glu Leu 
675 680 685 

Ala He Pro Tyr Glu Glu Ala Val Ala Phe He Glu Arg Tyr Phe Gin 
690 695 700 

Ser Phe Pro Lys Val Arg Ala Trp He Glu Lys Thr Leu Glu Glu Gly 
705 710 715 720 

Arg Lys Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg Arg Arg Tyr Val 

725 730 , 735 

Pro Asp Leu Asn Ala Arg Val Lys Ser Val Arg Glu Ala Ala Glu Arg 
740 745 750 

Met Ala Phe Asn Met Pro Val Gin Gly Thr Ala Ala Asp Leu Met Lys 
755 760 765 

Leu Ala Met Val Lys Leu Phe Pro Arg Leu Arg Glu Met Gly Ala Arg 
770 775 780 

Met Leu Leu Gin Val Ala Asn Glu Leu Leu Leu Glu Ala Pro Gin Ala 
785 790 795 800 

Arg Ala Glu Glu Val Ala Ala Leu Ala Lys Glu Ala Met Glu Lys Ala 
805 810 815 

Tyr Pro Leu Ala Val Pro Leu Glu Val Glu Val Gly Met Gly Glu Asp 
820 825 830 

Trp Leu Ser Ala Lys Gly His His His His His His 
835 840 



<210> 469 

<211> 27 

<212> DNA 

<213> Artificial 
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<220> 

<223> Synthetic 
<400> 469 

ctcgaggcgg gtaaacccca ggaggtc 



<210> 470 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic 

<4Q0> 470 

gggtttaccc gcctcgaggt gcccggc 



<210> 471 
<211> 2526 
<212> DNA 
<213> Artificial 

<220> 

<223> Synthetic 

<400> 471 

atgaattccg aggcgatgct tccgctcttt gaacccaaag gccgggtcct cctggtggac 60 

ggccaccacc tggcctaccg caccttcttc gccctgaagg gcctcaccac gagccggggc 120 

gaaccggtgc aggcggtcta cggcttcgcc aagagcctcc tcaaggccct gaaggaggac 180 

gggtacaagg ccgtcatcgt ggtctttgac gccgaggccc cctccttccg ccacgaggcc 240 

tacgaggcct acaaggcggg gagggccccg acccccgagg acttcccccg gcagctcgcc 300 

ctcatcaagg agctggtgga cctcctgggg tttacccgcc tcgaggtgcc gggctttgaa 360 

gcggatgacg tcctggctac cctggccaag aaggcggaaa aggaaggcta cgaagtgcgc 420 

atcctcaccg cggaccggga cctttaccag cttctttcgg agcgaatctc catccttcac 480 

ccggagggtt acctgatcac cccggagtgg ctttgggaga agtatgggct taagccttcc 540 

cagtgggtgg actaccgggc cttggccggg gacccttccg acaacatccc cggcgtgaag 600 

ggcatcgggg agaagacggc ggccaagctg atccgggagt ggggaagcct ggaaaacctt 660 

cttaagcacc tggaacaggt gaaacctgcc tccgtgcggg agaagatcct tagccacatg 720 

gaggacctca agctatccct ggagctatcc cgggtgcaca cggacttgct ccttcaggtg 780 

gacttcgccc ggcgccggga gccggaccgg gaggggctta aggccttttt ggagaggctg 840 

gagttcggaa gcctcctcca cgagttcggc ctgttggaaa gcccggtggc ggcggaggaa 900 
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gctccctggc cgccccccga gggagccttc gtggggtacg ttctttcccg ccccgagccc 960 

atgtgggcgg agcttaacgc cttggccgcc gcctggggcg gccgcgtgca ccgggcagca 1020 

gaccccttgg cggggctaaa ggacctcaag gaggtccggg gcctcctcgc caaggacctc 1080 

gccgtcttgg cctcgaggga ggggctagac ctcgtgcccg gggacgaccc catgctcctc 1140 

gcctacctcc tggacccttc gaacaccacc cccgaggggg tggcgcggcg ctacgggggg 1200 

gagtggacgg aggacgccgc ccaccgggcc ctcctctcgg agaggctcca tcggaacctc 1260 

cttaagcgcc tcgaggggga ggagaagctc ctttggctct accacgaggt ggaaaagccc 1320 

ctctcccggg tcctggccca tatggaggcc accggggtac ggcgggacgt ggcctacctt 1380 

caggcccttt ccctggagct tgcggaggag atccgccgcc tcgaggagga ggtcttccgc 1440 

ttggcgggcc accccttcaa cctcaactcc cgggaccagc tggaaagggt gctctttgac 1500 

gagcttaggc ttcccgcctt gaagaagacg aagaagacag gcaagcgctc caccagcgcc 1560 

gcggtgctgg aggccctacg ggaggcccac cccatcgtgg agaagatcct ccagcaccgg 1620 

gagctcacca agctcaagaa cacctacgtg gaccccctcc caagcctcgt ccacccgagg 1680 

acgggccgcc tccacacccg cttcaaccag acggccacgg ccacggggag gcttagtagc 1740 

tccgacccca acctgcagaa catccccgtc cgcaccccct tgggccagag gatccgccgg 1800 

gccttcgtgg ccgaggcggg ttgggcgttg gtggccctgg actatagcca gatagagctc 1860 

cgcgtcctcg cccacctctc cggggacgaa aacctgatca gggtcttcca ggaggggaag 1920 

gacatccaca cccagaccgc aagctggatg ttcggcgtcc ccccggaggc cgtggacccc 1980 

ctgatgcgcc gggcggccaa gacggtgaac ttcggcgtcc tctacggcat gtccgcccat 2040 

aggctctccc aggagcttgc catcccctac gaggaggcgg tggcctttat agagcgctac 2100 

ttccaaagct tccccaaggt gcgggcctgg atagaaaaga ccctggagga ggggaggaag 2160 

cggggctacg tggaaaccct cttcggaaga aggcgctacg tgcccgacct caacgcccgg 2 220 

gtgaagagcg tcagggaggc cgcggagcgc atggccttca acatgcccgt ccagggcacc 2280 

gccgccgacc tcatgaagct cgccatggtg aagctcttcc cccgcctccg ggagatgggg 2340 

gcccgcatgc tcctccaggt cgccaacgag ctcctcctgg aggcccccca agcgcgggcc 2400 

gaggaggtgg cggctttggc caaggaggcc atggagaagg cctatcccct cgccgtgccc 2460 

ctggaggtgg aggtggggat gggggaggac tggctttccg ccaagggtca ccaccaccac 2520 

caccac 2526 
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<210> 472 

<211> 842 

<212> PRT 

<213> Artificial 

<220> 

<223> Synthetic 
<400> 472 

Met Asn Ser Glu Ala Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val 
1 5 10 15 

Leu Leu Val Asp Gly His His Leu Ala Tyr Arg Thr Phe Phe Ala Leu 
20 25 30 

Lys Gly Leu Thr Thr Ser Arg Gly Glu Pro Val Gin Ala Val Tyr Gly 
35 40 45 

Phe Ala Lys Ser Leu Leu Lys Ala Leu Lys Glu Asp Gly Tyr Lys Ala 
50 55 60 

Val lie Val Val Phe Asp Ala Glu Ala Pro Ser Phe Arg His Glu Ala 
65 70 75 80 

Tyr Glu Ala Tyr Lys Ala Gly Arg Ala Pro Thr Pro Glu Asp Phe Pro 
85 90 95 

Arg Gin Leu Ala Leu He Lys Glu Leu Val Asp Leu Leu Gly Phe Thr 
100 105 110 

Arg Leu Glu Val Pro Gly Phe Glu Ala Asp Asp Val Leu Ala Thr Leu 
115 120 125 

Ala Lys Lys Ala Glu Lys Glu Gly Tyr Glu Val Arg He Leu Thr Ala 
130 135 140 

Asp Arg Asp Leu Tyr Gin Leu Leu Set Glu Arg He Ser He Leu His 
145 150 155 160 

Pro Glu Gly Tyr Leu He Thr Pro Glu Trp Leu Trp Glu Lys Tyr Gly 
165 170 175 

Leu Lys Pro Ser Gin Trp Val Asp Tyr Arg Ala Leu Ala Gly Asp Pro 
180 185 190 

Ser Asp Asn He Pro Gly Val Lys Gly He Gly Glu Lys Thr Ala Ala 
195 200 205 

Lys Leu ile Arg Glu Trp Gly Ser Leu Glu Asn Leu Leu Lys His Leu 
210 215 220 

Glu Gin Val Lys Pro Ala Ser Val Arg Glu Lys Ile Leu Ser His Met 
225 230 235 240 

Glu Asp Leu Lys Leu Ser Leu Glu Leu Ser Arg Val His Thr Asp Leu 
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